Commit graph

180 commits

Author SHA1 Message Date
Nelson Jovel
d041a5a567 chore: During cleaning, stop execution if grade column isn't found. Also stop execution if a duplicate header is found. Turn off spec for duplicate header check 2024-05-23 12:52:34 -07:00
Nelson Jovel
26bcc2c736 chore: speed up survey data loader 2024-05-23 12:24:27 -07:00
Nelson Jovel
2c07ea6317 chore: Add 'English Learner' and 'DirectCert' as valid column headers 2024-05-17 09:42:22 -07:00
Nelson Jovel
2bf22758c1 chore: add a way to print out errors in dese id when cleaning files 2024-05-16 15:10:21 -07:00
Nelson Jovel
8e7fbdfb2c add disaggregation glossary 2024-05-15 10:53:59 -07:00
Nelson Jovel
4c360c9e62 chore: add more rules for finding disaggregation columns and parsing sped values 2024-05-09 12:53:35 -07:00
Nelson Jovel
fdc4c49bf0 chore: copy changes from main 2024-05-09 11:20:24 -07:00
Nelson Jovel
cf5c4f2d13 feat: add seasonal academic years 2024-05-08 15:22:26 -07:00
Nelson Jovel
2561fa28fc feat: Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-27 14:05:02 -07:00
Nelson Jovel
67ffc996a8 Revert "Split academic year into seasons if the academic year's range is"
This reverts commit a5d4cccb37.
2024-04-26 13:48:30 -07:00
Nelson Jovel
a5d4cccb37 Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-26 13:31:50 -07:00
6fac759ec2 Faster admin data loader + rename School.school_hash 2024-04-22 15:43:54 -04:00
deb70d4b88 Memoize admin data loader 2024-04-18 17:20:26 -04:00
Nelson Jovel
1e560294e1 fix: loosen restrictions on date format 2024-03-01 10:15:55 -08:00
Nelson Jovel
9bfb76db5a match an additional format for Dates. Supported dates are now '1/10/2022 14:21:45' '2022-1-10T14:21:45' '2022-1-10 14:21:45' 2024-03-01 09:30:43 -08:00
Nelson Jovel
c3cb05701f feat: Support two date formats: ISO 8601 and the standard US date format
used in google sheets
2024-02-27 11:57:20 -08:00
Nelson Jovel
23d23c4962 chore: support alternate format for recorded date 2024-02-23 14:21:53 -08:00
Nelson Jovel
be5b63637b chore: alphabetize districts when writing filename for cleaned files 2024-02-23 11:28:30 -08:00
Nelson Jovel
cc8ed48204 fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists 2024-02-22 12:02:25 -08:00
Nelson Jovel
080a054cad chore: move code that normalizes race data to the Race class 2023-12-20 19:07:39 -08:00
Nelson Jovel
c565dbe46f chore: remove lowell seeding rules 2023-12-20 14:06:53 -08:00
Nelson Jovel
a0c0b1d01d chore: reenable test spec that tests data loader for races 2023-12-20 12:40:04 -08:00
Nelson Jovel
d4109fda6f chore: refactor Race out of survey_item_values 2023-12-20 12:27:53 -08:00
Nelson Jovel
3e4ef9cb08 chore: refactor Gender out out of survey_item_values row 2023-12-20 12:27:44 -08:00
Nelson Jovel
6e7e64c039 chore: rename counts_by_grade to enrollment_by_grade for clarity 2023-12-18 15:28:29 -08:00
Nelson Jovel
6a24d4fa23 chore: Make sure 'hispanic' column only gets applied when using SIS race information 2023-12-18 15:26:21 -08:00
Nelson Jovel
f28713d1fe chore: remove unneeded comment 2023-12-18 15:25:46 -08:00
Nelson Jovel
ce43f52bd5 feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:25:36 -08:00
Nelson Jovel
8a0ba0dbea chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-08 14:22:33 -08:00
Nelson Jovel
8364d26e11 feat: Support SIS categories for race in the form of 'White and Asian'
or 'Black, Asian, and white'
2023-12-08 09:36:48 -08:00
Nelson Jovel
55b73279bf feat: add 2023-24 academic year and make sure previous year enrollment
and staffing data get loaded when missing
2023-12-08 09:35:53 -08:00
Nelson Jovel
2ef24caf70 Lower threshold for the number of valid student responses from 17 to 11 2023-12-06 13:56:14 -08:00
Nelson Jovel
6365cad71b chore: refactor code to put logic into models 2023-12-06 13:55:37 -08:00
Nelson Jovel
f27a590c5a Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-11-30 20:57:04 -08:00
Nelson Jovel
81b683e95c chore: Add comment about StaffingLoader also cloning enrollment data when it clones staffing data 2023-11-09 14:34:31 -08:00
Nelson Jovel
11617abd89 fix: make sure to grab the 'Gender- Qcode' column 2023-11-07 10:15:16 -08:00
Nelson Jovel
3d9bbb52eb fix: make sure to populate total_students when loading enrollement 2023-11-06 23:06:15 -08:00
Nelson Jovel
123b570bb9 Detect and print message when there are duplicate headers during
cleaning process
2023-11-06 21:55:37 -08:00
Nelson Jovel
6d84204f83 Add race and gender columns to cleaned cvs files when those headers are
missing
2023-11-06 20:30:51 -08:00
Nelson Jovel
a11a134805 fix: also load enrollment data when loading staffing information for the latest year 2023-11-06 20:28:27 -08:00
Nelson Jovel
1b82ff9413 chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1 2023-11-06 14:51:56 -08:00
Nelson Jovel
a3f9e46414 chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for. 2023-11-06 13:13:37 -08:00
rebuilt
02b335ef63 feat: reduce number of rows to process at one time to reduce memory use 2023-11-02 12:21:07 -07:00
rebuilt
019b954ffa feat: load student responses in the same pass as loading the survey responses
chore: remove student loader since loading students is now done with the survey response loader
2023-11-02 11:38:03 -07:00
rebuilt
b2fdbe5756 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:58 -07:00
rebuilt
ef44c41965 feat: add special education disaggregation 2023-10-24 13:05:57 -07:00
rebuilt
18ab51c860 chore: upgrade to rails 7.1.
upgrade rspec

fix failing tests

upgrade devise
2023-10-24 13:04:05 -07:00
rebuilt
2fd56047d4 Add disaggregation by ELL 2023-10-24 12:51:12 -07:00
rebuilt
490522eb1e feat: support multiple columns for race and gender information 2023-10-24 10:27:39 -07:00
rebuilt
ce5340648f fix: hide scores on analyze page for scores that don't meet the student threshold of 25% 2023-10-24 10:27:25 -07:00