rebuilt
22cc303a95
fix: There was an n+1 problem where we looked up the list of schools for
...
every row. Now we query the list of schools just once per file
2 years ago
rebuilt
784e23982e
feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses
2 years ago
rebuilt
25a2698ac9
Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column
3 years ago
rebuilt
f7c40c2da2
Fix: enable correct detection of student surveys types by rejecting any
...
headers ending with '-1' (the variants of standard questions)
3 years ago
rebuilt
2445642586
Don't write a file if there's an empty dataset
3 years ago
rebuilt
6b2bceceb6
Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly
3 years ago
rebuilt
ce76c979a4
Add scraper for 3B-i student/#courses ratio
3 years ago
rebuilt
904d0d2f2c
It's possible for admin data likert score values to be above 5. If that happens, we
...
cap the likert score at 5. This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
3 years ago
rebuilt
3589878700
Don't check standard deviation for early education surveys
3 years ago
rebuilt
89295f8832
Missing progress or duration information does not result in a row removed in the cleaning process
3 years ago
rebuilt
6022739f07
use district short name when writing filename
3 years ago
rebuilt
f749b96006
update response_date to recorded_date
3 years ago
rebuilt
9d0f8659f1
Adjust valid duration threshold of short form items
3 years ago
rebuilt
37e932e078
Fix regression in student loader
3 years ago
rebuilt
0dfc9726d0
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
3 years ago
rebuilt
a066f464c7
fix failing tests
3 years ago
rebuilt
f1022728fa
Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
...
Deleted unused csvs. Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
3 years ago
rebuilt
798ba1f340
Only return files in sftp directory, not other directories
3 years ago
rebuilt
001d3083c8
Calculate response rate on the fly instead of looking it up from the db
...
when calculating response rates.
3 years ago
rebuilt
cee7aa4c59
Remove unused filename
3 years ago
rebuilt
d3a28f7635
Fix ThreeATwo scraper
3 years ago
rebuilt
07ed8dd259
Update logic for calculating student response rate. Remove references
...
to survey table. We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses. If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.
Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly
Update three_b_two scraper to use teacher only numbers
swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
3 years ago
rebuilt
357c7427d1
Batch imports for staffing data
3 years ago
rebuilt
d272e48adc
load total students and batch importing records
3 years ago
rebuilt
cf2b2433e9
Use an sftp uri unique to MCIEA
3 years ago
rebuilt
a5da0fb0c6
Fix bug with not all survey responses loading when using sftp loader
3 years ago
rebuilt
ef087a6cd0
update default folder for survey responses
3 years ago
rebuilt
47c1856281
Process 1000 rows at a time to limit memory usage in production
3 years ago
rebuilt
640de1c8df
Don't print sftptogo_url
3 years ago
rebuilt
380048f5cd
Load survey responses from sftp folder
3 years ago
rebuilt
6f986ff8a1
add proof of concept sftp downloader
3 years ago
rebuilt
d5533bb756
Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1
3 years ago
rebuilt
ed421296c7
Fix bug that caused an incorrect score of 1 to be assigned for admin
...
data item related to parity index of teachers of color to students of
color. Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
3 years ago
rebuilt
7b9abe3793
Scrape latest data from DESE
3 years ago
rebuilt
850ef9d1cd
Make sure demographic loader doesn't break when changing the designation of a Race
3 years ago
rebuilt
53a08d41bf
Change label for unknown race
3 years ago
rebuilt
a1b7db9b2e
Add ability to load student demographic information for only lowell
3 years ago
rebuilt
d49cf918ac
Add a way to seed only lowell schools
3 years ago
rebuilt
fc9e73f656
make sure survey responses for Butler get loaded correctly
3 years ago
rebuilt
9bbba64e9e
Add rules to the survey data loader to allow skipping non-lowell schools
3 years ago
Nelson Jovel
43bc55ed32
Query for survey items once instead of on each row
3 years ago
rebuilt
83fa255e5d
Make utility method private
3 years ago
rebuilt
3ca270e3b1
Fix more malformed csv errors
3 years ago
rebuilt
360579852c
Send fewer sql queries when loading survey responses
3 years ago
rebuilt
9ae94a2dae
further reduce batch size to avoid exceeding memory quota
3 years ago
rebuilt
bb472ce6c3
Reduce number of rows to process to avoid exceeding memory limit on heroku
3 years ago
rebuilt
4626d43e3e
Add changes to demographic loader so it the tests don't fail when run in parallel
3 years ago
rebuilt
7d7bee6498
Add gender disagreggation
3 years ago
rebuilt
a1f3dfaa33
Load gender information to survey item responses
3 years ago
rebuilt
8cdd1b6285
Create gender class. Load genders to database from demographics file
3 years ago