Commit graph

22 commits

Author SHA1 Message Date
Nelson Jovel
b4c4522956 fix: tests 2024-11-26 10:18:17 -08:00
Nelson Jovel
308efbc6dc fix: fix merges from main 2024-11-19 19:23:22 -08:00
Nelson Jovel
0bbd7c5cf1 feat: add parent survey gauges 2024-09-27 13:25:10 -07:00
Nelson Jovel
d041a5a567 chore: During cleaning, stop execution if grade column isn't found. Also stop execution if a duplicate header is found. Turn off spec for duplicate header check 2024-05-23 12:52:34 -07:00
Nelson Jovel
2561fa28fc feat: Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-27 14:05:02 -07:00
Nelson Jovel
67ffc996a8 Revert "Split academic year into seasons if the academic year's range is"
This reverts commit a5d4cccb37.
2024-04-26 13:48:30 -07:00
Nelson Jovel
a5d4cccb37 Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-26 13:31:50 -07:00
6fac759ec2 Faster admin data loader + rename School.school_hash 2024-04-22 15:43:54 -04:00
Nelson Jovel
c3cb05701f feat: Support two date formats: ISO 8601 and the standard US date format
used in google sheets
2024-02-27 11:57:20 -08:00
Nelson Jovel
aa7af11a4e fix: fix failing test 2024-02-23 11:54:57 -08:00
Nelson Jovel
cc8ed48204 fix: no longer support 'form' in filename when cleaning. Only look for 'part X' and add that to the filename if it exists 2024-02-22 12:02:25 -08:00
Nelson Jovel
3e4ef9cb08 chore: refactor Gender out out of survey_item_values row 2023-12-20 12:27:44 -08:00
Nelson Jovel
ce43f52bd5 feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:25:36 -08:00
Nelson Jovel
f27a590c5a Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-11-30 20:57:04 -08:00
Nelson Jovel
97ddb09167 chore: add test for checking duplicate headers during cleaning process 2023-11-09 14:50:51 -08:00
Nelson Jovel
6d84204f83 Add race and gender columns to cleaned cvs files when those headers are
missing
2023-11-06 20:30:51 -08:00
Nelson Jovel
a3f9e46414 chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for. 2023-11-06 13:13:37 -08:00
rebuilt
b2fdbe5756 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:58 -07:00
rebuilt
2fd56047d4 Add disaggregation by ELL 2023-10-24 12:51:12 -07:00
rebuilt
7bd7923d41 fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column 2023-10-24 10:24:57 -07:00
rebuilt
a71ebbc4e4 Add Overall Response Rate 2023-05-22 16:03:34 +00:00
rebuilt
dbfc9d1d3a Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:39:12 -07:00