Nelson Jovel
ed3ac25a7b
chore: During cleaning, stop execution if grade column isn't found. Also stop execution if a duplicate header is found. Turn off spec for duplicate header check
2024-05-23 12:52:13 -07:00
Nelson Jovel
33da0859b9
Split academic year into seasons if the academic year's range is
...
initialized with a season, i.e. "2024-25 Fall". Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly. Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year. So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring". Add tests
for the new range format. Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-25 09:21:04 -07:00
5789ebf564
Faster admin data loader + rename School.school_hash
2024-04-22 14:46:37 -04:00
Nelson Jovel
d6735d449d
feat: Support two date formats: ISO 8601 and the standard US date format
...
used in google sheets
2024-02-27 11:55:47 -08:00
Nelson Jovel
9696a2b2fa
fix: fix failing test
2024-02-23 11:54:32 -08:00
Nelson Jovel
0a32fb50ff
fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists
2024-02-22 10:55:48 -08:00
Nelson Jovel
e7fb009425
chore: refactor Gender out out of survey_item_values row
2023-12-20 11:08:23 -08:00
Nelson Jovel
f028e6c884
feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename
2023-12-11 15:39:20 -08:00
Nelson Jovel
e325f38c43
Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup
2023-12-06 14:10:16 -08:00
Nelson Jovel
305ddf2b1a
chore: add test for checking duplicate headers during cleaning process
2023-12-06 14:10:08 -08:00
Nelson Jovel
b63c327d33
chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for.
2023-11-06 13:15:50 -08:00
rebuilt
e3fbbabce5
feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold.
2023-10-27 15:12:24 -07:00
rebuilt
48e795fcfb
feat: add special education disaggregation
2023-10-06 11:41:52 -07:00
rebuilt
060d7aa55a
Add disaggregation by ELL
2023-09-29 19:29:23 -07:00
rebuilt
714b90b3eb
fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column
2023-08-23 15:30:47 -07:00
rebuilt
a785c69c44
Add Overall Response Rate
2023-08-09 15:13:58 -07:00
Nelson Jovel
0a2c5e02c5
feat: add ability to merge disaggregation data with raw survey data to
...
produce a cleaned csv with merged income disaggregation columns
2023-06-20 12:22:24 -07:00
rebuilt
e3ae12b425
update response_date to recorded_date
2023-05-31 16:57:47 -07:00
rebuilt
4509c157fa
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:38:29 -07:00