Commit graph

156 commits

Author SHA1 Message Date
Nelson Jovel
d4db4e0fa2 add disaggregation glossary 2024-05-15 10:54:34 -07:00
Nelson Jovel
5851ab1cbf chore: add parsing rules for disaggregation columns; ell, income, sped. Add tests for new rules 2024-05-09 19:16:57 -07:00
Nelson Jovel
6a9d04f7ff fix: make sure enrollment loader works 2024-04-27 14:34:07 -07:00
Nelson Jovel
ac5c1edfbf feat: Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-27 14:24:05 -07:00
b3e6efdb2e Faster admin data loader + rename School.school_hash 2024-04-22 16:21:43 -04:00
b5f90376f7 Memoize admin data loader 2024-04-18 17:23:49 -04:00
Nelson Jovel
3c3b0d2cf5 fix: loosen restrictions on date format 2024-03-01 10:16:15 -08:00
Nelson Jovel
bd1a7b3877 match an additional format for Dates. Supported dates are now '1/10/2022 14:21:45' '2022-1-10T14:21:45' '2022-1-10 14:21:45' 2024-03-01 09:30:50 -08:00
Nelson Jovel
c4d8b927af feat: Support two date formats: ISO 8601 and the standard US date format
used in google sheets
2024-02-27 12:00:45 -08:00
Nelson Jovel
7d527b5639 chore: support alternate format for recorded date 2024-02-23 14:23:46 -08:00
Nelson Jovel
8d596faeeb chore: alphabetize districts when writing filename for cleaned files 2024-02-23 11:28:42 -08:00
Nelson Jovel
c6c9fd38de fix: batch in smaller size to stay within memory limits 2024-02-23 10:34:18 -08:00
Nelson Jovel
e97e7047b1 fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists 2024-02-22 12:04:36 -08:00
Nelson Jovel
e1fc5dbbaa feat: add 2023-24 academic year and make sure previous year enrollment
and staffing data get loaded when missing
2024-02-20 21:06:55 -08:00
Nelson Jovel
2a125aa058 chore: bring in some refactoring changes that didn't get copied over from the main branch 2023-12-20 19:27:06 -08:00
Nelson Jovel
75451648e5 chore: move code that normalizes race data to the Race class 2023-12-20 19:07:52 -08:00
Nelson Jovel
e7f06ef4e9 chore: remove lowell seeding rules 2023-12-20 14:08:38 -08:00
Nelson Jovel
bbefdcb5bb chore: remove student loader since we load race information in data loader 2023-12-20 13:47:09 -08:00
Nelson Jovel
063810a9d7 chore: make sure to load races in data loader instead of student loader 2023-12-20 13:39:03 -08:00
Nelson Jovel
381625f58b chore: reenable test spec that tests data loader for races 2023-12-20 12:40:22 -08:00
Nelson Jovel
317fe3239a chore: refactor Race out of survey_item_values 2023-12-20 12:35:14 -08:00
Nelson Jovel
a295b8afb9 chore: refactor Gender out out of survey_item_values row 2023-12-20 12:33:18 -08:00
Nelson Jovel
0359dae88a chore: rename counts_by_grade to enrollment_by_grade for clarity 2023-12-18 15:59:30 -08:00
Nelson Jovel
ee5e53f992 chore: Make sure 'hispanic' column only gets applied when using SIS race information 2023-12-18 15:58:54 -08:00
Nelson Jovel
6f265302ea feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:58:40 -08:00
Nelson Jovel
fe039e3d04 chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-18 15:38:31 -08:00
rebuilt
a07728fcd6 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:13:17 -07:00
rebuilt
9d680c5159 chore: upgrade to rails 7.1.
upgrade rspec

fix failing tests

upgrade devise
2023-10-17 12:34:11 -07:00
rebuilt
acfdaf5587 feat: add special education disaggregation 2023-10-17 12:29:00 -07:00
rebuilt
5bdffec8f9 Add disaggregation by ELL 2023-10-17 12:06:19 -07:00
rebuilt
245cde85cd Add disaggregation by ELL 2023-10-17 11:18:47 -07:00
rebuilt
6db93cb116 feat: Rename income labels to 'Economically Disadvantaged' and 'Not Economically Disadvantaged' 2023-08-09 12:13:04 -07:00
rebuilt
a1b580048b chore: remove precalculated race scores. Calculate race scores on every reload 2023-08-08 15:54:15 -07:00
rebuilt
76ebcc6ef3 feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page 2023-08-08 15:52:12 -07:00
rebuilt
7373e2e52f fix: Filter out responses that don't correspond to the grades the school serves 2023-08-08 11:46:14 -07:00
rebuilt
1265a164b9 feat: add popover to analyze graphs that displays the n-size of the different columns. Make sure to only calculate a score for a race if there are more than 10 respondents to a question. 2023-08-08 11:43:40 -07:00
rebuilt
e42aa05324 chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data 2023-07-21 12:58:01 -07:00
rebuilt
8f276a5f1a feat: if admin data value is above 5, round down to 5 2023-07-21 12:55:39 -07:00
rebuilt
22cc303a95 fix: There was an n+1 problem where we looked up the list of schools for
every row. Now we query the list of schools just once per file
2023-06-26 11:25:32 -07:00
rebuilt
784e23982e feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses 2023-06-23 11:28:14 -07:00
rebuilt
25a2698ac9 Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column 2023-06-08 09:27:44 -07:00
rebuilt
f7c40c2da2 Fix: enable correct detection of student surveys types by rejecting any
headers ending with '-1' (the variants of standard questions)
2023-06-07 12:40:14 -07:00
rebuilt
2445642586 Don't write a file if there's an empty dataset 2023-06-07 12:40:03 -07:00
rebuilt
6b2bceceb6 Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly 2023-06-06 18:29:47 -07:00
rebuilt
ce76c979a4 Add scraper for 3B-i student/#courses ratio 2023-06-05 11:39:08 -07:00
rebuilt
904d0d2f2c It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:06:50 -07:00
rebuilt
3589878700 Don't check standard deviation for early education surveys 2023-06-02 16:10:35 -07:00
rebuilt
89295f8832 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:13:53 -07:00
rebuilt
6022739f07 use district short name when writing filename 2023-05-31 17:12:40 -07:00
rebuilt
f749b96006 update response_date to recorded_date 2023-05-31 17:07:31 -07:00