Commit graph

377 commits

Author SHA1 Message Date
Nelson Jovel
d4109fda6f chore: refactor Race out of survey_item_values 2023-12-20 12:27:53 -08:00
Nelson Jovel
3e4ef9cb08 chore: refactor Gender out out of survey_item_values row 2023-12-20 12:27:44 -08:00
Nelson Jovel
6e7e64c039 chore: rename counts_by_grade to enrollment_by_grade for clarity 2023-12-18 15:28:29 -08:00
Nelson Jovel
6a24d4fa23 chore: Make sure 'hispanic' column only gets applied when using SIS race information 2023-12-18 15:26:21 -08:00
Nelson Jovel
ce43f52bd5 feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:25:36 -08:00
Nelson Jovel
a15b01a3e1 fix: instead of looking for 'asian' at the start of a word, look for it
after a word boundary.  This means it still doesn't get confused with
caucasian and it's more flexible whan asian appears inside other text
such as 'Caucasian and Asian and Black'
2023-12-08 14:22:43 -08:00
Nelson Jovel
8a0ba0dbea chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-08 14:22:33 -08:00
Nelson Jovel
2ef24caf70 Lower threshold for the number of valid student responses from 17 to 11 2023-12-06 13:56:14 -08:00
Nelson Jovel
b983f1e144 chore: fix categorization of gender 2023-12-01 15:32:48 -08:00
Nelson Jovel
f27a590c5a Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup 2023-11-30 20:57:04 -08:00
Nelson Jovel
97ddb09167 chore: add test for checking duplicate headers during cleaning process 2023-11-09 14:50:51 -08:00
Nelson Jovel
6d84204f83 Add race and gender columns to cleaned cvs files when those headers are
missing
2023-11-06 20:30:51 -08:00
Nelson Jovel
a3f9e46414 chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for. 2023-11-06 13:13:37 -08:00
rebuilt
019b954ffa feat: load student responses in the same pass as loading the survey responses
chore: remove student loader since loading students is now done with the survey response loader
2023-11-02 11:38:03 -07:00
rebuilt
9110be31c6 chore: fix failing test 2023-11-02 11:06:44 -07:00
rebuilt
b2fdbe5756 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:12:58 -07:00
rebuilt
e45a4f96dd last commit 2023-10-26 13:29:54 -07:00
rebuilt
ef44c41965 feat: add special education disaggregation 2023-10-24 13:05:57 -07:00
rebuilt
18ab51c860 chore: upgrade to rails 7.1.
upgrade rspec

fix failing tests

upgrade devise
2023-10-24 13:04:05 -07:00
rebuilt
2fd56047d4 Add disaggregation by ELL 2023-10-24 12:51:12 -07:00
rebuilt
490522eb1e feat: support multiple columns for race and gender information 2023-10-24 10:27:39 -07:00
rebuilt
7bd7923d41 fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column 2023-10-24 10:24:57 -07:00
rebuilt
e58ea3f1af feat: Round up response rate. Survey response rates above 24.5 will now meet sufficiency 2023-10-24 09:07:47 -07:00
rebuilt
8cf9b4eeb8 Add test for income group 2023-10-24 09:06:05 -07:00
rebuilt
a97838c194 Reorder gender columns 2023-10-24 09:05:41 -07:00
rebuilt
2ac30bb107 feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page 2023-10-24 09:05:27 -07:00
rebuilt
f230f76bc6 fix: Overall response rate was incorrectly using the updated_at date instead of the recorded date. Also, it was just using the last date for all academic years instead of the last date the survey was taken per academic year. 2023-08-17 07:06:13 -07:00
rebuilt
a4332f6a05 chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data 2023-07-21 12:52:18 -07:00
rebuilt
23ddaed2ce feat: if admin data value is above 5, round down to 5 2023-07-21 12:51:18 -07:00
rebuilt
66f78dadf9 chore: Add cypress to codebase 2023-07-19 12:18:06 -07:00
rebuilt
a0a88d497d fix: we deleted non-lowell schools from the master list so the journey spec completely failed. This commit fixes the basic case of viewing the overview page. Navigation to other pages seems broken in the spec but not when running in development. I'm not sure if I'll continue using apparition since it's no longer in active development. I'll wait to fix the rest of the flow until I decide what test framework I'll continue using 2023-06-26 14:05:51 -07:00
rebuilt
846e4a8149 update tests for subcategory report 2023-06-26 11:38:41 -07:00
rebuilt
878ba08a22 fix: There was an n+1 problem where we looked up the list of schools for
every row. Now we query the list of schools just once per file
2023-06-26 11:38:33 -07:00
rebuilt
d025a83a2b chore: remove errant comment 2023-06-12 16:06:07 -07:00
rebuilt
a19a30f5f3 Fix: failing subcategory report specs 2023-06-09 21:25:22 -07:00
rebuilt
a1f72a5273 Add tests to codify behavior of weighted average logic 2023-06-08 15:24:32 -07:00
rebuilt
0f23053294 It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:47:03 -07:00
rebuilt
e058c523b6 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:18:03 -07:00
rebuilt
2a3f69b813 The overall response rate is artifically lower because we are including the number of all the students at the school, not just the number of students that took the survey. Updated the overall response rate presenter to return the count of only the grades that took the student survey. 2023-05-24 11:22:17 -07:00
rebuilt
08c946a469 Add subcategory report 2023-05-23 18:59:05 -07:00
rebuilt
f590bc9dd5 Add subcategory report 2023-05-22 16:40:39 -07:00
rebuilt
a71ebbc4e4 Add Overall Response Rate 2023-05-22 16:03:34 +00:00
rebuilt
dbfc9d1d3a Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:39:12 -07:00
rebuilt
a9316f18cb Make sure to use apparition for system specs 2023-05-16 13:33:55 -07:00
rebuilt
9f21a76ba4 Fix response rates for small schools by accounting for expected minimum counts of responses less than 10 2023-05-12 12:34:42 -07:00
rebuilt
c33c00b845 Rename DataAvailability to Summary 2023-04-22 10:44:06 -07:00
rebuilt
0bd70ca452 Remove surveys table from codebase since it's no longer needed after changing logic surrounding response rates and score calculations 2023-04-22 10:36:00 -07:00
rebuilt
aa3a67406b Set a minimum threshold of 10 survey item responses to calculate scores per survey item 2023-04-17 15:18:10 -07:00
rebuilt
65b8599c6e Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-08 10:59:48 -07:00
rebuilt
ba018e8f10 specify which unknown class to use 2023-04-08 10:55:09 -07:00