Commit graph

126 commits

Author SHA1 Message Date
rebuilt
245cde85cd Add disaggregation by ELL 2023-10-17 11:18:47 -07:00
rebuilt
6db93cb116 feat: Rename income labels to 'Economically Disadvantaged' and 'Not Economically Disadvantaged' 2023-08-09 12:13:04 -07:00
rebuilt
a1b580048b chore: remove precalculated race scores. Calculate race scores on every reload 2023-08-08 15:54:15 -07:00
rebuilt
76ebcc6ef3 feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page 2023-08-08 15:52:12 -07:00
rebuilt
7373e2e52f fix: Filter out responses that don't correspond to the grades the school serves 2023-08-08 11:46:14 -07:00
rebuilt
1265a164b9 feat: add popover to analyze graphs that displays the n-size of the different columns. Make sure to only calculate a score for a race if there are more than 10 respondents to a question. 2023-08-08 11:43:40 -07:00
rebuilt
e42aa05324 chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data 2023-07-21 12:58:01 -07:00
rebuilt
8f276a5f1a feat: if admin data value is above 5, round down to 5 2023-07-21 12:55:39 -07:00
rebuilt
22cc303a95 fix: There was an n+1 problem where we looked up the list of schools for
every row. Now we query the list of schools just once per file
2023-06-26 11:25:32 -07:00
rebuilt
784e23982e feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses 2023-06-23 11:28:14 -07:00
rebuilt
25a2698ac9 Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column 2023-06-08 09:27:44 -07:00
rebuilt
f7c40c2da2 Fix: enable correct detection of student surveys types by rejecting any
headers ending with '-1' (the variants of standard questions)
2023-06-07 12:40:14 -07:00
rebuilt
2445642586 Don't write a file if there's an empty dataset 2023-06-07 12:40:03 -07:00
rebuilt
6b2bceceb6 Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly 2023-06-06 18:29:47 -07:00
rebuilt
ce76c979a4 Add scraper for 3B-i student/#courses ratio 2023-06-05 11:39:08 -07:00
rebuilt
904d0d2f2c It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:06:50 -07:00
rebuilt
3589878700 Don't check standard deviation for early education surveys 2023-06-02 16:10:35 -07:00
rebuilt
89295f8832 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:13:53 -07:00
rebuilt
6022739f07 use district short name when writing filename 2023-05-31 17:12:40 -07:00
rebuilt
f749b96006 update response_date to recorded_date 2023-05-31 17:07:31 -07:00
rebuilt
9d0f8659f1 Adjust valid duration threshold of short form items 2023-05-26 19:01:33 -07:00
rebuilt
37e932e078 Fix regression in student loader 2023-05-24 12:04:39 -07:00
rebuilt
0dfc9726d0 Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-24 11:59:53 -07:00
rebuilt
a066f464c7 fix failing tests 2023-04-30 16:35:08 -07:00
rebuilt
f1022728fa Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
Deleted unused csvs.  Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
2023-04-30 12:04:20 -07:00
rebuilt
798ba1f340 Only return files in sftp directory, not other directories 2023-04-30 11:55:45 -07:00
rebuilt
001d3083c8 Calculate response rate on the fly instead of looking it up from the db
when calculating response rates.
2023-04-22 14:03:22 -07:00
rebuilt
cee7aa4c59 Remove unused filename 2023-04-22 14:01:21 -07:00
rebuilt
d3a28f7635 Fix ThreeATwo scraper 2023-04-22 14:01:01 -07:00
rebuilt
07ed8dd259 Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-22 14:00:20 -07:00
rebuilt
357c7427d1 Batch imports for staffing data 2023-04-22 13:19:34 -07:00
rebuilt
d272e48adc load total students and batch importing records 2023-04-22 13:18:50 -07:00
rebuilt
cf2b2433e9 Use an sftp uri unique to MCIEA 2023-02-19 19:36:14 -08:00
rebuilt
a5da0fb0c6 Fix bug with not all survey responses loading when using sftp loader 2023-02-19 19:36:14 -08:00
rebuilt
ef087a6cd0 update default folder for survey responses 2023-02-19 19:36:14 -08:00
rebuilt
47c1856281 Process 1000 rows at a time to limit memory usage in production 2023-02-19 19:36:14 -08:00
rebuilt
640de1c8df Don't print sftptogo_url 2023-02-19 19:36:14 -08:00
rebuilt
380048f5cd Load survey responses from sftp folder 2023-02-19 19:36:14 -08:00
rebuilt
6f986ff8a1 add proof of concept sftp downloader 2023-02-19 19:36:05 -08:00
rebuilt
d5533bb756 Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1 2023-01-12 17:02:05 -08:00
rebuilt
ed421296c7 Fix bug that caused an incorrect score of 1 to be assigned for admin
data item related to parity index of teachers of color to students of
color.  Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
2023-01-06 17:43:48 -08:00
rebuilt
7b9abe3793 Scrape latest data from DESE 2022-12-20 15:26:11 -08:00
rebuilt
850ef9d1cd Make sure demographic loader doesn't break when changing the designation of a Race 2022-12-12 12:03:53 -08:00
rebuilt
53a08d41bf Change label for unknown race 2022-12-10 15:44:56 -08:00
rebuilt
a1b7db9b2e Add ability to load student demographic information for only lowell 2022-12-03 11:33:11 -08:00
rebuilt
d49cf918ac Add a way to seed only lowell schools 2022-12-03 10:23:02 -08:00
rebuilt
fc9e73f656 make sure survey responses for Butler get loaded correctly 2022-12-02 14:11:54 -08:00
rebuilt
9bbba64e9e Add rules to the survey data loader to allow skipping non-lowell schools 2022-12-01 16:15:45 -08:00
Nelson Jovel
43bc55ed32 Query for survey items once instead of on each row 2022-11-28 11:58:58 -08:00
rebuilt
83fa255e5d Make utility method private 2022-11-25 03:50:17 -08:00