Commit graph

192 commits

Author SHA1 Message Date
Nelson Jovel
e5e969b968 feat: add parent survey gauges 2024-09-16 15:24:43 -07:00
Nelson Jovel
d2968c5cef chore: add another variant for column name 2024-06-26 14:19:25 -07:00
Nelson Jovel
ed3ac25a7b chore: During cleaning, stop execution if grade column isn't found. Also stop execution if a duplicate header is found. Turn off spec for duplicate header check 2024-05-23 12:52:13 -07:00
Nelson Jovel
532ca39e84 chore: speed up survey data loader 2024-05-23 12:23:52 -07:00
Nelson Jovel
d820172ff7 chore: Add 'English Learner' and 'DirectCert' as valid column headers 2024-05-17 09:41:18 -07:00
Nelson Jovel
c2c8ce3b8d chore: add a way to print out errors in dese id when cleaning files 2024-05-16 15:09:14 -07:00
Nelson Jovel
ea2feb138b add disaggregation glossary 2024-05-15 10:15:16 -07:00
Nelson Jovel
e60a30911c chore: add more rules for finding disaggregation columns and parsing sped values 2024-05-09 12:53:13 -07:00
Nelson Jovel
8bb6f5e8f0 Add ell income and sped parsing rules for SIS data. Add tests for the
new inputs.
2024-05-07 18:46:53 -07:00
Nelson Jovel
33da0859b9 Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-25 09:21:04 -07:00
5789ebf564 Faster admin data loader + rename School.school_hash 2024-04-22 14:46:37 -04:00
04d61c0b1a Memoize admin data loader 2024-04-18 17:13:03 -04:00
Nelson Jovel
2cb43058a0 fix: loosen restrictions on date format 2024-03-01 10:15:32 -08:00
Nelson Jovel
289b04bc69 match an additional format for Dates. Supported dates are now '1/10/2022 14:21:45' '2022-1-10T14:21:45' '2022-1-10 14:21:45' 2024-03-01 09:30:23 -08:00
Nelson Jovel
d6735d449d feat: Support two date formats: ISO 8601 and the standard US date format
used in google sheets
2024-02-27 11:55:47 -08:00
Nelson Jovel
03349b0ed5 chore: support alternate format for recorded date 2024-02-23 14:21:16 -08:00
Nelson Jovel
37610e732f chore: alphabetize districts when writing filename for cleaned files 2024-02-23 11:27:59 -08:00
Nelson Jovel
0a32fb50ff fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists 2024-02-22 10:55:48 -08:00
Nelson Jovel
ebf4ca4166 chore: move code that normalizes race data to the Race class 2023-12-20 19:07:16 -08:00
Nelson Jovel
56b512810c chore: remove lowell seeding rules 2023-12-20 14:03:15 -08:00
Nelson Jovel
880b438eb4 chore: reenable test spec that tests data loader for races 2023-12-20 12:39:44 -08:00
Nelson Jovel
36e21515c3 chore: refactor Race out of survey_item_values 2023-12-20 12:25:23 -08:00
Nelson Jovel
e7fb009425 chore: refactor Gender out out of survey_item_values row 2023-12-20 11:08:23 -08:00
Nelson Jovel
ea079e927b chore: rename counts_by_grade to enrollment_by_grade for clarity 2023-12-18 15:01:33 -08:00
Nelson Jovel
41d942c214 chore: Make sure 'hispanic' column only gets applied when using SIS race information 2023-12-12 10:53:07 -08:00
Nelson Jovel
24dbf33bcd chore: remove unneeded comment 2023-12-11 15:55:00 -08:00
Nelson Jovel
f028e6c884 feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-11 15:39:20 -08:00
Nelson Jovel
3f44613085 chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-08 13:12:19 -08:00
Nelson Jovel
883731bce1 feat: Support SIS categories for race in the form of 'White and Asian'
or 'Black, Asian, and white'
2023-12-07 13:40:02 -08:00
Nelson Jovel
0931950eaf chore: make sure to output race and gender columns during cleaning 2023-12-07 13:39:25 -08:00
Nelson Jovel
3db0f9f757 fix: Print out message to make clean when there are duplicate headers
present in the raw survey file
2023-12-07 12:28:24 -08:00
Nelson Jovel
6541b87e9c feat: add 2023-24 academic year and make sure previous year enrollment
and staffing data get loaded when missing
2023-12-07 12:27:19 -08:00
Nelson Jovel
b7e670bb60 Lower threshold for the number of valid student responses from 17 to 11 2023-12-06 14:15:19 -08:00
Nelson Jovel
7dc881f654 chore: refactor code to put logic into models 2023-12-06 14:14:56 -08:00
Nelson Jovel
e325f38c43 Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-12-06 14:10:16 -08:00
Nelson Jovel
9efc1f41c6 chore: Add comment about StaffingLoader also cloning enrollment data when it clones staffing data 2023-12-06 14:09:54 -08:00
Nelson Jovel
f6f78bcd58 fix: make sure to grab the 'Gender- Qcode' column 2023-12-06 14:09:13 -08:00
Nelson Jovel
8bebe7db42 chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1 2023-11-06 14:52:21 -08:00
Nelson Jovel
b63c327d33 chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for. 2023-11-06 13:15:50 -08:00
rebuilt
cddea60c8b feat: reduce number of rows to process at one time to reduce memory use 2023-11-02 12:21:32 -07:00
rebuilt
1a707eb6bc feat: load student responses in the same pass as loading the survey responses
chore: remove student loader since loading students is now done with the survey response loader
2023-11-02 09:52:39 -07:00
rebuilt
e3fbbabce5 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:24 -07:00
rebuilt
83661540b7 chore: upgrade to rails 7.1.
upgrade rspec

fix failing tests

upgrade devise
2023-10-11 10:58:52 -07:00
rebuilt
48e795fcfb feat: add special education disaggregation 2023-10-06 11:41:52 -07:00
rebuilt
060d7aa55a Add disaggregation by ELL 2023-09-29 19:29:23 -07:00
rebuilt
abea2cb8fa feat: support multiple columns for race and gender information 2023-08-25 15:37:20 -07:00
rebuilt
463e4c9452 fix: hide scores on analyze page for scores that don't meet the student threshold of 25% 2023-08-23 15:56:46 -07:00
rebuilt
714b90b3eb fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column 2023-08-23 15:30:47 -07:00
rebuilt
2321897283 fix: start fixing problem with variants not getting added to the cleaned csv 2023-08-23 15:30:47 -07:00
rebuilt
a785c69c44 Add Overall Response Rate 2023-08-09 15:13:58 -07:00