Commit graph

43 commits

Author SHA1 Message Date
rebuilt
fbaeb2769f scrape admin, enrollments, and staffing data 2026-03-05 13:20:21 -08:00
rebuilt
ddd20b2053 scrape admin, enrollments, and staffing data 2026-03-05 12:17:44 -08:00
rebuilt
1bf35012d0 update admin data 2025-12-04 20:41:41 -08:00
rebuilt
044a77cd3d feat: pull down new admin data. Fix problems with some scrapers. For 5D-ii make sure the likert score is 1 if the nurse count is 0. Prevent devision by zero returning infinity. For 3A-ii make sure not to perform division by zero. Likert score will be 1 if the number of guidance couselors, psychologists, or paraprofessionals is zero. 2025-03-12 13:46:01 -07:00
nelson
ef9e5b7ebb feat: Add scraper for a-curv-i7. Scrape new admin data for that item 2025-02-13 11:45:30 -08:00
nelson
83b426fbc9 feat: add scraper for 4D-ii 2025-02-06 21:09:56 -08:00
nelson
4c28b10bd4 feat: finish scraper for students of color in advanced courses 2025-02-06 13:08:04 -08:00
nelson
685bafd52c Fix hs only admin data item. Use alternate method to check if a school is a high school 2025-02-06 13:07:05 -08:00
rebuilt
1de5d4d456 chore: fix problem with enrollment scraper 2025-01-27 15:13:25 -08:00
Nelson Jovel
28c6eb7fc0 chore: scrape new admin data; a-ovpe-i1 2025-01-17 18:32:00 -08:00
Nelson Jovel
2333e9c8c7 feat: add admin data item a-curv-i5 with temporary benchmark of 2 2025-01-15 16:32:22 -08:00
Nelson Jovel
33da0859b9 Split academic year into seasons if the academic year's range is
initialized with a season, i.e. "2024-25 Fall".  Update scapers for
admin data, enrollment and staffing to use the new range standard
correctly.   Update the loaders for admin data, enrollment and staffing
so that it populates all seasons in a given year.  So admin data for
2024-25 gets loaded into "2024-25 Fall" and "2024-25 Spring".  Add tests
for the new range format.  Set the default cutoff for the start of Spring season will be the last Sunday in February
2024-04-25 09:21:04 -07:00
5789ebf564 Faster admin data loader + rename School.school_hash 2024-04-22 14:46:37 -04:00
04d61c0b1a Memoize admin data loader 2024-04-18 17:13:03 -04:00
rebuilt
5c7729beeb feat: if admin data value is above 5, round down to 5 2023-07-21 12:14:46 -07:00
rebuilt
cbd5687ff0 feat: Add out of state admin data 2023-07-20 17:06:07 -07:00
rebuilt
4509c157fa Add automated data cleaning. Modify SurveyItemValues class to use regex
instead of hard coded values.  Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
2023-05-16 13:38:29 -07:00
rebuilt
359e266a6c Remove unused TODOs 2023-04-27 15:47:45 -07:00
rebuilt
3f2a7dff50 Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
Deleted unused csvs.  Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
2023-04-27 15:43:17 -07:00
rebuilt
10cedd2310 Remove unused filename 2023-04-20 17:42:27 -07:00
rebuilt
5b2bae0d92 Fix ThreeATwo scraper 2023-04-19 19:20:54 -07:00
rebuilt
128748addd Update logic for calculating student response rate. Remove references
to survey table.  We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses.  If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.

Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly

Update three_b_two scraper to use teacher only numbers

swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
2023-04-18 13:59:29 -07:00
rebuilt
4c4ccc01cc Merge branch 'rpp-response-rate' to bring in changes to test files 2023-03-15 15:00:25 -07:00
rebuilt
06f9d2f0e9 Scrape enrollment and staffing information. Seed enrollment and staffing information. Update DatabaseCleaner so it cleans up leftover information in the database. Remove old admin csvs from codebase. 2023-03-05 16:15:35 -08:00
rebuilt
c0332955f3 move csv require statement to application.rb 2023-02-27 15:47:20 -08:00
rebuilt
69c089d9bb Remove occurances of MCIEA 2023-02-13 17:58:27 -08:00
rebuilt
ed421296c7 Fix bug that caused an incorrect score of 1 to be assigned for admin
data item related to parity index of teachers of color to students of
color.  Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
2023-01-06 17:43:48 -08:00
rebuilt
7b9abe3793 Scrape latest data from DESE 2022-12-20 15:26:11 -08:00
rebuilt
f4e2e6ca83 Correct headers for 4D so the data can be correctly imported 2022-10-21 17:29:19 -07:00
rebuilt
dd9ed86dcd Merge branch 'scrapers' 2022-09-21 19:54:24 -07:00
rebuilt
f556844b73 Update a-sust-i1 so it's only applicable to high schools and lower
benchmark for a-cure-i1 from 6.7 to 5 percent.
2022-09-21 19:50:33 -07:00
rebuilt
9e82023e48 Make sure we don't divide by 0 2022-09-20 16:44:11 -07:00
rebuilt
1a984d2cf0 Make sure we don't divide by 0 2022-09-20 16:43:19 -07:00
rebuilt
3ee909bbea Update 3B-2 with an absolute floor 2022-09-20 14:45:20 -07:00
rebuilt
1be6e79fd8 Add scrapers for admin data and their corresponding csv files of data.
Make sure specs are off so they don't run in github CI
2022-09-19 19:37:39 -07:00
rebuilt
9e18bf2151 Turn off specs for scrapers. Generate csvs for scraped data 2022-09-13 22:59:04 -07:00
rebuilt
75cea539c5 Change scraped csv file names from word to number format 'one_a_one.csv'
-> '1A_1.csv''
2022-09-13 10:46:18 -07:00
rebuilt
3e28be2d0d Remove 'scraper' from file/class names add 3B-1 2022-09-12 11:24:57 -07:00
rebuilt
83ef9310a4 Add scraper for 3A-1 and 2C-1. Finishes #183105787 and #183105716 2022-09-07 11:34:42 -07:00
rebuilt
b5bc9c9de4 Add 2A-i admin data. Finishes #183105627 2022-09-05 22:37:53 -07:00
rebuilt
459393381b Load admin data for 1A-iii. Finishes #183105516 2022-09-05 21:01:32 -07:00
rebuilt
f783ce9724 add one-a 2022-09-01 20:00:20 -07:00
rebuilt
048aacd641 Add admin data for 4D. Finishes #182818489 and #182793402 2022-08-30 21:46:23 -07:00