Commit graph

951 commits

Author SHA1 Message Date
rebuilt
904d0d2f2c It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:06:50 -07:00
rebuilt
3589878700 Don't check standard deviation for early education surveys 2023-06-02 16:10:35 -07:00
rebuilt
89295f8832 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:13:53 -07:00
rebuilt
83fa0c4e63 Removed duplicate parameter 2023-06-02 10:36:46 -07:00
rebuilt
6022739f07 use district short name when writing filename 2023-05-31 17:12:40 -07:00
rebuilt
f749b96006 update response_date to recorded_date 2023-05-31 17:07:31 -07:00
rebuilt
d031179dc0 add changes to duration for short form items 2023-05-31 17:06:47 -07:00
rebuilt
1e3cebf223 add oneoff script to load wareham data 2023-05-27 17:05:40 -07:00
rebuilt
28e7f1fcb2 Change Dese ID of Minot Forest in Wareham to 310001799. Change Dese ID of Wareham Elementary School to 3100017 2023-05-27 15:44:04 -07:00
rebuilt
9d0f8659f1 Adjust valid duration threshold of short form items 2023-05-26 19:01:33 -07:00
rebuilt
37e932e078 Fix regression in student loader 2023-05-24 12:04:39 -07:00
rebuilt
bd4e64167d Add livereload 2023-05-24 12:04:13 -07:00
rebuilt
0dfc9726d0 Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-24 11:59:53 -07:00
rebuilt
b5dc933187 Add subcategory report 2023-05-23 18:43:55 -07:00
rebuilt
c57ec6157c Round all averages to two decimal places 2023-05-12 13:12:42 -07:00
rebuilt
6b2555c1f0 Fix response rates for small schools by accounting for expected minimum counts of responses less than 10 2023-05-12 12:27:46 -07:00
rebuilt
097edb1744 update ship it to use 'yarn jest' instead of 'yarn test' so javascript test will run correctly 2023-05-01 10:32:29 -07:00
rebuilt
a066f464c7 fix failing tests 2023-04-30 16:35:08 -07:00
rebuilt
a21a11f809 specify gender column 2023-04-30 12:19:31 -07:00
rebuilt
80431e7660 Add most recent admin data 2023-04-30 12:04:36 -07:00
rebuilt
f1022728fa Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
Deleted unused csvs.  Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
2023-04-30 12:04:20 -07:00
rebuilt
c42b569e78 cap response rate per grade at 100 2023-04-30 11:59:22 -07:00
rebuilt
798ba1f340 Only return files in sftp directory, not other directories 2023-04-30 11:55:45 -07:00
rebuilt
03efd88284 Do not create response rate entries in the database when loading survey item responses 2023-04-25 17:16:44 -07:00
rebuilt
0ed6cbe6b4 Rename DataAvailability to Summary 2023-04-22 14:24:39 -07:00
rebuilt
09963c3323 Remove surveys table from codebase since it's no longer needed after changing logic surrounding response rates and score calculations 2023-04-22 14:21:48 -07:00
rebuilt
001d3083c8 Calculate response rate on the fly instead of looking it up from the db
when calculating response rates.
2023-04-22 14:03:22 -07:00
rebuilt
7f9b96fdb3 Add updated admin data 2023-04-22 14:03:00 -07:00
rebuilt
cee7aa4c59 Remove unused filename 2023-04-22 14:01:21 -07:00
rebuilt
1f8aea31de Add older admin data file archive 2023-04-22 14:01:10 -07:00
rebuilt
d3a28f7635 Fix ThreeATwo scraper 2023-04-22 14:01:01 -07:00
rebuilt
30e006ee0d Set a minimum threshold of 10 survey item responses to calculate scores per survey item 2023-04-22 14:00:51 -07:00
rebuilt
982f216c6c rename counts_by_grade method to the more descriptive enrollment_by_grade 2023-04-22 14:00:40 -07:00
rebuilt
07ed8dd259 Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-22 14:00:20 -07:00
rebuilt
0bfde2805a temporarily turn off caching based on the response rate timestamp 2023-04-22 13:28:09 -07:00
rebuilt
63f453d258 Update seeder to use ActiveRecord imports 2023-04-22 13:20:26 -07:00
rebuilt
357c7427d1 Batch imports for staffing data 2023-04-22 13:19:34 -07:00
rebuilt
d272e48adc load total students and batch importing records 2023-04-22 13:18:50 -07:00
Nelson Jovel
98ba54c305 Convert dese::loader from using seeder to factories 2023-04-22 13:15:13 -07:00
Nelson Jovel
9228e3dbaf convert student loader from seeding to factories 2023-04-22 13:15:01 -07:00
rebuilt
049ae4548f Don't add another response rate record to the database. Just calculate on the fly. Otherwise we can run into a situation where we create two response rates for a given school/academic_year/subcategory 2023-04-08 14:49:21 -07:00
rebuilt
283845edd3 Add tests to make sure teacher survey items don't pollute the graphs for
students of unknown genders
2023-04-05 16:21:41 -07:00
rebuilt
7208cbb661 Add tests to make sure teacher survey items don't pollute the graphs for
students of unknown genders
2023-04-05 15:45:54 -07:00
rebuilt
63b1615bef Make sure when checking for sufficient gender responses that we only check student survey items 2023-04-04 17:03:45 -07:00
rebuilt
098110433a Remove duplicate line 2023-03-23 19:57:36 -07:00
rebuilt
5f58a5516e print the number of student and teacher respondents 2023-03-23 17:29:02 -07:00
rebuilt
f7818fc9f6 make sure test database is cleaned before and after every test 2023-03-23 15:55:07 -07:00
rebuilt
9749cd1abb Remove standard from gemfile 2023-03-23 13:05:09 -07:00
rebuilt
de478b7761 update reline 2023-03-17 19:06:26 -07:00
rebuilt
1290e52626 Correct branch name for ci 2023-03-17 19:02:28 -07:00