Commit graph

227 commits

Author SHA1 Message Date
rebuilt
30285efd69 It's possible for admin data likert score values to be above 5. If that happens, we
cap the likert score at 5.   This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
2023-06-03 17:14:41 -07:00
rebuilt
abe7a8804c Don't check standard deviation for early education surveys 2023-06-02 16:05:45 -07:00
rebuilt
9aeb5f92af Missing progress or duration information does not result in a row removed in the cleaning process 2023-06-02 15:23:21 -07:00
rebuilt
e3ae12b425 update response_date to recorded_date 2023-05-31 16:57:47 -07:00
rebuilt
a30921ce06 Add New Jersey enrollment and staffing data 2023-05-28 17:11:52 -07:00
rebuilt
93d087a5de Use short district name for cleaned csv 2023-05-28 17:11:27 -07:00
rebuilt
8ef8cfce58 Adjust valid duration threshold of short form items 2023-05-26 18:30:44 -07:00
rebuilt
d6b2521883 Fix regression in student loader 2023-05-19 13:48:16 -07:00
rebuilt
4509c157fa Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:38:29 -07:00
rebuilt
359e266a6c Remove unused TODOs 2023-04-27 15:47:45 -07:00
rebuilt
3f2a7dff50 Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
Deleted unused csvs.  Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
2023-04-27 15:43:17 -07:00
rebuilt
4b7d2e6feb Only return files in sftp directory, not other directories 2023-04-26 15:55:10 -07:00
rebuilt
f68b650d8b Calculate response rate on the fly instead of looking it up from the db
when calculating response rates.
2023-04-20 20:27:01 -07:00
rebuilt
10cedd2310 Remove unused filename 2023-04-20 17:42:27 -07:00
rebuilt
5b2bae0d92 Fix ThreeATwo scraper 2023-04-19 19:20:54 -07:00
rebuilt
128748addd Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-18 13:59:29 -07:00
rebuilt
b250ebe415 Memoize schools in SurveyItemValues and academic_years in AcademicYear
for performace improvement
2023-03-28 03:38:52 -07:00
rebuilt
c15cb7b483 Change survey data loader spec to use factorybot objects instead of loading seeds. Change databasecleaner to use transaction. Add back babel-preset dependency to fix failing javascript test in production. 2023-03-28 03:06:17 -07:00
rebuilt
db3e4f1178 Set config variable for SFTP_TO_GO server 2023-03-20 13:05:43 -07:00
rebuilt
5e88c8be1b Transition from Lowell to ECP. Update text and links to reflect the new Organization. 2023-03-16 16:25:09 -07:00
rebuilt
4c4ccc01cc Merge branch 'rpp-response-rate' to bring in changes to test files 2023-03-15 15:00:25 -07:00
rebuilt
6b31fa9115 Batch imports for staffing data 2023-03-08 04:51:15 -08:00
rebuilt
d059177f0c load total students and batch importing records 2023-03-08 04:48:25 -08:00
rebuilt
2362d884eb Convert admin data loader from using seeder to using factory 2023-03-07 11:12:52 -08:00
rebuilt
d0219217de Convert response rate loader spec from using the seeder to using the factory 2023-03-07 10:35:38 -08:00
rebuilt
06f9d2f0e9 Scrape enrollment and staffing information. Seed enrollment and staffing information. Update DatabaseCleaner so it cleans up leftover information in the database. Remove old admin csvs from codebase. 2023-03-05 16:15:35 -08:00
rebuilt
c0332955f3 move csv require statement to application.rb 2023-02-27 15:47:20 -08:00
rebuilt
ae6d7151b3 Load student respondent information from enrollment loader 2023-02-27 12:29:16 -08:00
rebuilt
984fedff02 Add academic year column to enrollment file. Add enrollment loader to seeder file 2023-02-22 01:12:09 -08:00
rebuilt
52d2676f80 Populate db with grade level data 2023-02-21 16:58:08 -08:00
rebuilt
69c089d9bb Remove occurances of MCIEA 2023-02-13 17:58:27 -08:00
rebuilt
01851d8527 Make sure blank grades in csv load as nils instead of zeros 2023-02-10 13:44:48 -08:00
rebuilt
3a744db975 Start adding kindergarten 2023-01-17 15:27:15 -08:00
rebuilt
c6b8518cb1 Fix bug with not all survey responses loading when using sftp loader 2023-01-12 17:05:00 -08:00
rebuilt
b1200dca33 update default folder for survey responses 2023-01-12 17:05:00 -08:00
rebuilt
7a159f7ddf Process 1000 rows at a time to limit memory usage in production 2023-01-12 17:05:00 -08:00
rebuilt
79adff5faa Don't print sftptogo_url 2023-01-12 17:05:00 -08:00
rebuilt
03c6bff7bb Load survey responses from sftp folder 2023-01-12 17:04:57 -08:00
rebuilt
1426e7cc63 add proof of concept sftp downloader 2023-01-12 17:02:35 -08:00
rebuilt
eeeb4f5d14 Add Stoklosa survey responses 2023-01-12 17:02:35 -08:00
rebuilt
d5533bb756 Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1 2023-01-12 17:02:05 -08:00
rebuilt
ed421296c7 Fix bug that caused an incorrect score of 1 to be assigned for admin
data item related to parity index of teachers of color to students of
color.  Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
2023-01-06 17:43:48 -08:00
rebuilt
7b9abe3793 Scrape latest data from DESE 2022-12-20 15:26:11 -08:00
rebuilt
850ef9d1cd Make sure demographic loader doesn't break when changing the designation of a Race 2022-12-12 12:03:53 -08:00
rebuilt
53a08d41bf Change label for unknown race 2022-12-10 15:44:56 -08:00
rebuilt
a1b7db9b2e Add ability to load student demographic information for only lowell 2022-12-03 11:33:11 -08:00
rebuilt
d49cf918ac Add a way to seed only lowell schools 2022-12-03 10:23:02 -08:00
rebuilt
fc9e73f656 make sure survey responses for Butler get loaded correctly 2022-12-02 14:11:54 -08:00
rebuilt
9bbba64e9e Add rules to the survey data loader to allow skipping non-lowell schools 2022-12-01 16:15:45 -08:00
Nelson Jovel
43bc55ed32 Query for survey items once instead of on each row 2022-11-28 11:58:58 -08:00