rebuilt
e42aa05324
chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data
2023-07-21 12:58:01 -07:00
rebuilt
8f276a5f1a
feat: if admin data value is above 5, round down to 5
2023-07-21 12:55:39 -07:00
rebuilt
22cc303a95
fix: There was an n+1 problem where we looked up the list of schools for
...
every row. Now we query the list of schools just once per file
2023-06-26 11:25:32 -07:00
rebuilt
784e23982e
feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses
2023-06-23 11:28:14 -07:00
rebuilt
25a2698ac9
Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column
2023-06-08 09:27:44 -07:00
rebuilt
f7c40c2da2
Fix: enable correct detection of student surveys types by rejecting any
...
headers ending with '-1' (the variants of standard questions)
2023-06-07 12:40:14 -07:00
rebuilt
2445642586
Don't write a file if there's an empty dataset
2023-06-07 12:40:03 -07:00
rebuilt
6b2bceceb6
Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly
2023-06-06 18:29:47 -07:00
rebuilt
ce76c979a4
Add scraper for 3B-i student/#courses ratio
2023-06-05 11:39:08 -07:00
rebuilt
904d0d2f2c
It's possible for admin data likert score values to be above 5. If that happens, we
...
cap the likert score at 5. This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 16:06:50 -07:00
rebuilt
3589878700
Don't check standard deviation for early education surveys
2023-06-02 16:10:35 -07:00
rebuilt
89295f8832
Missing progress or duration information does not result in a row removed in the cleaning process
2023-06-02 15:13:53 -07:00
rebuilt
6022739f07
use district short name when writing filename
2023-05-31 17:12:40 -07:00
rebuilt
f749b96006
update response_date to recorded_date
2023-05-31 17:07:31 -07:00
rebuilt
9d0f8659f1
Adjust valid duration threshold of short form items
2023-05-26 19:01:33 -07:00
rebuilt
37e932e078
Fix regression in student loader
2023-05-24 12:04:39 -07:00
rebuilt
0dfc9726d0
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-24 11:59:53 -07:00
rebuilt
a066f464c7
fix failing tests
2023-04-30 16:35:08 -07:00
rebuilt
f1022728fa
Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
...
Deleted unused csvs. Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
2023-04-30 12:04:20 -07:00
rebuilt
798ba1f340
Only return files in sftp directory, not other directories
2023-04-30 11:55:45 -07:00
rebuilt
001d3083c8
Calculate response rate on the fly instead of looking it up from the db
...
when calculating response rates.
2023-04-22 14:03:22 -07:00
rebuilt
cee7aa4c59
Remove unused filename
2023-04-22 14:01:21 -07:00
rebuilt
d3a28f7635
Fix ThreeATwo scraper
2023-04-22 14:01:01 -07:00
rebuilt
07ed8dd259
Update logic for calculating student response rate. Remove references
...
to survey table. We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses. If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.
Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly
Update three_b_two scraper to use teacher only numbers
swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-22 14:00:20 -07:00
rebuilt
357c7427d1
Batch imports for staffing data
2023-04-22 13:19:34 -07:00
rebuilt
d272e48adc
load total students and batch importing records
2023-04-22 13:18:50 -07:00
rebuilt
cf2b2433e9
Use an sftp uri unique to MCIEA
2023-02-19 19:36:14 -08:00
rebuilt
a5da0fb0c6
Fix bug with not all survey responses loading when using sftp loader
2023-02-19 19:36:14 -08:00
rebuilt
ef087a6cd0
update default folder for survey responses
2023-02-19 19:36:14 -08:00
rebuilt
47c1856281
Process 1000 rows at a time to limit memory usage in production
2023-02-19 19:36:14 -08:00
rebuilt
640de1c8df
Don't print sftptogo_url
2023-02-19 19:36:14 -08:00
rebuilt
380048f5cd
Load survey responses from sftp folder
2023-02-19 19:36:14 -08:00
rebuilt
6f986ff8a1
add proof of concept sftp downloader
2023-02-19 19:36:05 -08:00
rebuilt
d5533bb756
Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1
2023-01-12 17:02:05 -08:00
rebuilt
ed421296c7
Fix bug that caused an incorrect score of 1 to be assigned for admin
...
data item related to parity index of teachers of color to students of
color. Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
2023-01-06 17:43:48 -08:00
rebuilt
7b9abe3793
Scrape latest data from DESE
2022-12-20 15:26:11 -08:00
rebuilt
850ef9d1cd
Make sure demographic loader doesn't break when changing the designation of a Race
2022-12-12 12:03:53 -08:00
rebuilt
53a08d41bf
Change label for unknown race
2022-12-10 15:44:56 -08:00
rebuilt
a1b7db9b2e
Add ability to load student demographic information for only lowell
2022-12-03 11:33:11 -08:00
rebuilt
d49cf918ac
Add a way to seed only lowell schools
2022-12-03 10:23:02 -08:00
rebuilt
fc9e73f656
make sure survey responses for Butler get loaded correctly
2022-12-02 14:11:54 -08:00
rebuilt
9bbba64e9e
Add rules to the survey data loader to allow skipping non-lowell schools
2022-12-01 16:15:45 -08:00
Nelson Jovel
43bc55ed32
Query for survey items once instead of on each row
2022-11-28 11:58:58 -08:00
rebuilt
83fa255e5d
Make utility method private
2022-11-25 03:50:17 -08:00
rebuilt
3ca270e3b1
Fix more malformed csv errors
2022-11-25 03:09:00 -08:00
rebuilt
360579852c
Send fewer sql queries when loading survey responses
2022-11-24 03:14:20 -08:00
rebuilt
9ae94a2dae
further reduce batch size to avoid exceeding memory quota
2022-11-23 13:54:11 -08:00
rebuilt
bb472ce6c3
Reduce number of rows to process to avoid exceeding memory limit on heroku
2022-11-23 12:32:31 -08:00
rebuilt
4626d43e3e
Add changes to demographic loader so it the tests don't fail when run in parallel
2022-11-03 14:46:45 -07:00
rebuilt
7d7bee6498
Add gender disagreggation
2022-10-27 15:56:50 -07:00