Commit graph

346 commits

Author SHA1 Message Date
Nelson Jovel
6f265302ea feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename 2023-12-18 15:58:40 -08:00
Nelson Jovel
d79635054c fix: instead of looking for 'asian' at the start of a word, look for it
after a word boundary.  This means it still doesn't get confused with
caucasian and it's more flexible whan asian appears inside other text
such as 'Caucasian and Asian and Black'
2023-12-18 15:44:51 -08:00
Nelson Jovel
fe039e3d04 chore: various fixes for race and gender categorization during cleaning.
Also add tests for race and gender categorization
2023-12-18 15:38:31 -08:00
Nelson Jovel
054e5603b7 chore: fix failing legacy tests 2023-11-03 10:18:46 -07:00
rebuilt
a07728fcd6 feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold. 2023-10-27 15:13:17 -07:00
rebuilt
9d680c5159 chore: upgrade to rails 7.1.
upgrade rspec

fix failing tests

upgrade devise
2023-10-17 12:34:11 -07:00
rebuilt
acfdaf5587 feat: add special education disaggregation 2023-10-17 12:29:00 -07:00
rebuilt
5bdffec8f9 Add disaggregation by ELL 2023-10-17 12:06:19 -07:00
rebuilt
245cde85cd Add disaggregation by ELL 2023-10-17 11:18:47 -07:00
rebuilt
5ea01ac916 Reorder gender columns 2023-08-22 12:22:51 -07:00
rebuilt
aafaeeaf41 fix: Overall response rate was incorrectly using the updated_at date instead of the recorded date. Also, it was just using the last date for all academic years instead of the last date the survey was taken per academic year. 2023-08-15 16:13:47 -07:00
rebuilt
6db93cb116 feat: Rename income labels to 'Economically Disadvantaged' and 'Not Economically Disadvantaged' 2023-08-09 12:13:04 -07:00
rebuilt
76ebcc6ef3 feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page 2023-08-08 15:52:12 -07:00
rebuilt
f6f56367fc The overall response rate is artifically lower because we are including the number of all the students at the school, not just the number of students that took the survey. Updated the overall response rate presenter to return the count of only the grades that took the student survey. 2023-08-08 10:06:39 -07:00
rebuilt
0a5500ecc9 feat: Add overall response rate 2023-08-08 10:01:48 -07:00
rebuilt
e42aa05324 chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data 2023-07-21 12:58:01 -07:00
rebuilt
8f276a5f1a feat: if admin data value is above 5, round down to 5 2023-07-21 12:55:39 -07:00
rebuilt
0c18aa2083 chore: Add cypress to codebase 2023-07-19 12:43:56 -07:00
rebuilt
d09e61e1e3 chore: disable flaky legacy test 2023-06-26 14:27:19 -07:00
rebuilt
4e85d756e6 fix: Flaky test might be related to not waiting for go button to appear. Add line to wait for go button 2023-06-26 14:20:56 -07:00
rebuilt
3680f5bc48 update tests for subcategory report 2023-06-26 11:38:04 -07:00
rebuilt
22cc303a95 fix: There was an n+1 problem where we looked up the list of schools for
every row. Now we query the list of schools just once per file
2023-06-26 11:25:32 -07:00
rebuilt
784e23982e feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses 2023-06-23 11:28:14 -07:00
rebuilt
acb01ec92f Add tests to codify behavior of weighted average logic 2023-06-21 04:45:53 +00:00
rebuilt
bb172aa7d0 chore: remove errant comment 2023-06-12 16:07:21 -07:00
rebuilt
ce035f3cbd Fix: failing subcategory report specs 2023-06-09 21:25:40 -07:00
rebuilt
677cee83e4 turn off three b one scraper spec 2023-06-08 15:31:49 -07:00
rebuilt
7dbb025357 Revert "Add tests to codify behavior of weighted average logic"
This reverts commit be26efbeca.
2023-06-08 15:26:29 -07:00
rebuilt
be26efbeca Add tests to codify behavior of weighted average logic 2023-06-08 15:25:41 -07:00
rebuilt
ce76c979a4 Add scraper for 3B-i student/#courses ratio 2023-06-05 11:39:08 -07:00
rebuilt
904d0d2f2c It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:06:50 -07:00
rebuilt
89295f8832 Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:13:53 -07:00
rebuilt
f749b96006 update response_date to recorded_date 2023-05-31 17:07:31 -07:00
rebuilt
d031179dc0 add changes to duration for short form items 2023-05-31 17:06:47 -07:00
rebuilt
9d0f8659f1 Adjust valid duration threshold of short form items 2023-05-26 19:01:33 -07:00
rebuilt
0dfc9726d0 Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-24 11:59:53 -07:00
rebuilt
b5dc933187 Add subcategory report 2023-05-23 18:43:55 -07:00
rebuilt
6b2555c1f0 Fix response rates for small schools by accounting for expected minimum counts of responses less than 10 2023-05-12 12:27:46 -07:00
rebuilt
a066f464c7 fix failing tests 2023-04-30 16:35:08 -07:00
rebuilt
a21a11f809 specify gender column 2023-04-30 12:19:31 -07:00
rebuilt
0ed6cbe6b4 Rename DataAvailability to Summary 2023-04-22 14:24:39 -07:00
rebuilt
09963c3323 Remove surveys table from codebase since it's no longer needed after changing logic surrounding response rates and score calculations 2023-04-22 14:21:48 -07:00
rebuilt
30e006ee0d Set a minimum threshold of 10 survey item responses to calculate scores per survey item 2023-04-22 14:00:51 -07:00
rebuilt
07ed8dd259 Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-22 14:00:20 -07:00
rebuilt
357c7427d1 Batch imports for staffing data 2023-04-22 13:19:34 -07:00
rebuilt
d272e48adc load total students and batch importing records 2023-04-22 13:18:50 -07:00
Nelson Jovel
98ba54c305 Convert dese::loader from using seeder to factories 2023-04-22 13:15:13 -07:00
Nelson Jovel
9228e3dbaf convert student loader from seeding to factories 2023-04-22 13:15:01 -07:00
rebuilt
283845edd3 Add tests to make sure teacher survey items don't pollute the graphs for
students of unknown genders
2023-04-05 16:21:41 -07:00
rebuilt
7208cbb661 Add tests to make sure teacher survey items don't pollute the graphs for
students of unknown genders
2023-04-05 15:45:54 -07:00