Nelson Jovel
e5e969b968
feat: add parent survey gauges
1 year ago
Nelson Jovel
ed3ac25a7b
chore: During cleaning, stop execution if grade column isn't found. Also stop execution if a duplicate header is found. Turn off spec for duplicate header check
2 years ago
Nelson Jovel
c2c8ce3b8d
chore: add a way to print out errors in dese id when cleaning files
2 years ago
Nelson Jovel
33da0859b9
Split academic year into seasons if the academic year's range is
...
initialized with a season, i.e. "2024-25 Fall". Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly. Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year. So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring". Add tests
for the new range format. Set the default cutoff for the start of Spring season will be the last Sunday in February
2 years ago
Gabe Farrell
5789ebf564
Faster admin data loader + rename School.school_hash
2 years ago
Nelson Jovel
37610e732f
chore: alphabetize districts when writing filename for cleaned files
2 years ago
Nelson Jovel
0a32fb50ff
fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists
2 years ago
Nelson Jovel
e7fb009425
chore: refactor Gender out out of survey_item_values row
2 years ago
Nelson Jovel
24dbf33bcd
chore: remove unneeded comment
2 years ago
Nelson Jovel
f028e6c884
feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename
2 years ago
Nelson Jovel
3db0f9f757
fix: Print out message to make clean when there are duplicate headers
...
present in the raw survey file
2 years ago
Nelson Jovel
e325f38c43
Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup
2 years ago
Nelson Jovel
8bebe7db42
chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1
2 years ago
rebuilt
e3fbbabce5
feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold.
2 years ago
rebuilt
48e795fcfb
feat: add special education disaggregation
2 years ago
rebuilt
060d7aa55a
Add disaggregation by ELL
2 years ago
rebuilt
714b90b3eb
fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column
2 years ago
Nelson Jovel
0a2c5e02c5
feat: add ability to merge disaggregation data with raw survey data to
...
produce a cleaned csv with merged income disaggregation columns
3 years ago
rebuilt
896f0d9961
Don't write a file if there's an empty dataset
3 years ago
rebuilt
93d087a5de
Use short district name for cleaned csv
3 years ago
rebuilt
4509c157fa
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
3 years ago