Commit graph

34 commits

Author SHA1 Message Date
Nelson Jovel
6a24d4fa23 chore: Make sure 'hispanic' column only gets applied when using SIS race information 2023-12-18 15:26:21 -08:00
Nelson Jovel
8a0ba0dbea chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-08 14:22:33 -08:00
Nelson Jovel
8364d26e11 feat: Support SIS categories for race in the form of 'White and Asian'
or 'Black, Asian, and white'
2023-12-08 09:36:48 -08:00
Nelson Jovel
2ef24caf70 Lower threshold for the number of valid student responses from 17 to 11 2023-12-06 13:56:14 -08:00
Nelson Jovel
6365cad71b chore: refactor code to put logic into models 2023-12-06 13:55:37 -08:00
Nelson Jovel
f27a590c5a Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-11-30 20:57:04 -08:00
Nelson Jovel
11617abd89 fix: make sure to grab the 'Gender- Qcode' column 2023-11-07 10:15:16 -08:00
Nelson Jovel
6d84204f83 Add race and gender columns to cleaned cvs files when those headers are
missing
2023-11-06 20:30:51 -08:00
Nelson Jovel
1b82ff9413 chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1 2023-11-06 14:51:56 -08:00
Nelson Jovel
a3f9e46414 chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for. 2023-11-06 13:13:37 -08:00
rebuilt
019b954ffa feat: load student responses in the same pass as loading the survey responses
chore: remove student loader since loading students is now done with the survey response loader
2023-11-02 11:38:03 -07:00
rebuilt
b2fdbe5756 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:58 -07:00
rebuilt
ef44c41965 feat: add special education disaggregation 2023-10-24 13:05:57 -07:00
rebuilt
2fd56047d4 Add disaggregation by ELL 2023-10-24 12:51:12 -07:00
rebuilt
490522eb1e feat: support multiple columns for race and gender information 2023-10-24 10:27:39 -07:00
rebuilt
ce5340648f fix: hide scores on analyze page for scores that don't meet the student threshold of 25% 2023-10-24 10:27:25 -07:00
rebuilt
7bd7923d41 fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column 2023-10-24 10:24:57 -07:00
rebuilt
2c9df34fac fix: start fixing problem with variants not getting added to the cleaned csv 2023-10-24 09:55:43 -07:00
rebuilt
2ac30bb107 feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page 2023-10-24 09:05:27 -07:00
rebuilt
c4b90a711c Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column 2023-06-08 09:27:22 -07:00
rebuilt
5f66d3d99a Fix: enable correct detection of student surveys types by rejecting any
headers ending with '-1' (the variants of standard questions)
2023-06-07 12:25:04 -07:00
rebuilt
0f23053294 It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:47:03 -07:00
rebuilt
c789c46032 Don't check standard deviation for early education surveys 2023-06-02 16:09:12 -07:00
rebuilt
e058c523b6 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:18:03 -07:00
rebuilt
a71ebbc4e4 Add Overall Response Rate 2023-05-22 16:03:34 +00:00
rebuilt
dbfc9d1d3a Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:39:12 -07:00
rebuilt
05bff3ee89 Memoize schools in SurveyItemValues and academic_years in AcademicYear
for performace improvement
2023-03-29 15:47:20 -07:00
rebuilt
282a671531 Change survey data loader spec to use factorybot objects instead of loading seeds. Change databasecleaner to use transaction. Add back babel-preset dependency to fix failing javascript test in production. 2023-03-29 15:45:48 -07:00
rebuilt
01851d8527 Make sure blank grades in csv load as nils instead of zeros 2023-02-10 13:44:48 -08:00
rebuilt
3a744db975 Start adding kindergarten 2023-01-17 15:27:15 -08:00
rebuilt
03c6bff7bb Load survey responses from sftp folder 2023-01-12 17:04:57 -08:00
rebuilt
eeeb4f5d14 Add Stoklosa survey responses 2023-01-12 17:02:35 -08:00
rebuilt
d5533bb756 Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1 2023-01-12 17:02:05 -08:00
rebuilt
a1b7db9b2e Add ability to load student demographic information for only lowell 2022-12-03 11:33:11 -08:00