Commit graph

11 commits

Author SHA1 Message Date
Nelson Jovel
f28713d1fe chore: remove unneeded comment 2023-12-18 15:25:46 -08:00
Nelson Jovel
ce43f52bd5 feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:25:36 -08:00
Nelson Jovel
f27a590c5a Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-11-30 20:57:04 -08:00
Nelson Jovel
123b570bb9 Detect and print message when there are duplicate headers during
cleaning process
2023-11-06 21:55:37 -08:00
Nelson Jovel
6d84204f83 Add race and gender columns to cleaned cvs files when those headers are
missing
2023-11-06 20:30:51 -08:00
Nelson Jovel
1b82ff9413 chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1 2023-11-06 14:51:56 -08:00
rebuilt
b2fdbe5756 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:58 -07:00
rebuilt
2fd56047d4 Add disaggregation by ELL 2023-10-24 12:51:12 -07:00
rebuilt
7bd7923d41 fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column 2023-10-24 10:24:57 -07:00
rebuilt
02a2c6cd97 Don't write a file if there's an empty dataset 2023-06-07 12:24:39 -07:00
rebuilt
dbfc9d1d3a Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:39:12 -07:00