rebuilt
2321897283
fix: start fixing problem with variants not getting added to the cleaned csv
2 years ago
rebuilt
a785c69c44
Add Overall Response Rate
2 years ago
rebuilt
4afa030141
chore: remove precalculated race scores. Calculate race scores on every reload
2 years ago
rebuilt
f035c4d9ad
fix: Filter out responses that don't correspond to the grades the school serves
2 years ago
rebuilt
5f49746bf4
feat: Rename income labels to 'Economically Disadvantaged' and 'Not Economically Disadvantaged'
2 years ago
rebuilt
67e469a66c
feat: add popover to analyze graphs that displays the n-size of the different columns. Make sure to only calculate a score for a race if there are more than 10 respondents to a question.
2 years ago
rebuilt
cec48e55d3
chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data
2 years ago
rebuilt
5c7729beeb
feat: if admin data value is above 5, round down to 5
2 years ago
rebuilt
cbd5687ff0
feat: Add out of state admin data
2 years ago
rebuilt
4f035f6a63
feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page
2 years ago
rebuilt
d72f8d31e0
fix: There was an n+1 problem where we looked up the list of schools for
...
every row. Now we query the list of schools just once per file
2 years ago
rebuilt
e8aa75bf66
feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses
2 years ago
Nelson Jovel
0a2c5e02c5
feat: add ability to merge disaggregation data with raw survey data to
...
produce a cleaned csv with merged income disaggregation columns
3 years ago
rebuilt
e2d24a9bec
Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column
3 years ago
rebuilt
ddf9a628d5
Fix: enable correct detection of student surveys types by rejecting any
...
headers ending with '-1' (the variants of standard questions)
3 years ago
rebuilt
896f0d9961
Don't write a file if there's an empty dataset
3 years ago
rebuilt
76b79b99c2
Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly
3 years ago
rebuilt
30285efd69
It's possible for admin data likert score values to be above 5. If that happens, we
...
cap the likert score at 5. This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
3 years ago
rebuilt
abe7a8804c
Don't check standard deviation for early education surveys
3 years ago
rebuilt
9aeb5f92af
Missing progress or duration information does not result in a row removed in the cleaning process
3 years ago
rebuilt
e3ae12b425
update response_date to recorded_date
3 years ago
rebuilt
a30921ce06
Add New Jersey enrollment and staffing data
3 years ago
rebuilt
93d087a5de
Use short district name for cleaned csv
3 years ago
rebuilt
8ef8cfce58
Adjust valid duration threshold of short form items
3 years ago
rebuilt
d6b2521883
Fix regression in student loader
3 years ago
rebuilt
4509c157fa
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
3 years ago
rebuilt
359e266a6c
Remove unused TODOs
3 years ago
rebuilt
3f2a7dff50
Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
...
Deleted unused csvs. Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
3 years ago
rebuilt
4b7d2e6feb
Only return files in sftp directory, not other directories
3 years ago
rebuilt
f68b650d8b
Calculate response rate on the fly instead of looking it up from the db
...
when calculating response rates.
3 years ago
rebuilt
10cedd2310
Remove unused filename
3 years ago
rebuilt
5b2bae0d92
Fix ThreeATwo scraper
3 years ago
rebuilt
128748addd
Update logic for calculating student response rate. Remove references
...
to survey table. We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses. If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.
Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly
Update three_b_two scraper to use teacher only numbers
swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
3 years ago
rebuilt
b250ebe415
Memoize schools in SurveyItemValues and academic_years in AcademicYear
...
for performace improvement
3 years ago
rebuilt
c15cb7b483
Change survey data loader spec to use factorybot objects instead of loading seeds. Change databasecleaner to use transaction. Add back babel-preset dependency to fix failing javascript test in production.
3 years ago
rebuilt
db3e4f1178
Set config variable for SFTP_TO_GO server
3 years ago
rebuilt
5e88c8be1b
Transition from Lowell to ECP. Update text and links to reflect the new Organization.
3 years ago
rebuilt
4c4ccc01cc
Merge branch 'rpp-response-rate' to bring in changes to test files
3 years ago
rebuilt
6b31fa9115
Batch imports for staffing data
3 years ago
rebuilt
d059177f0c
load total students and batch importing records
3 years ago
rebuilt
2362d884eb
Convert admin data loader from using seeder to using factory
3 years ago
rebuilt
d0219217de
Convert response rate loader spec from using the seeder to using the factory
3 years ago
rebuilt
06f9d2f0e9
Scrape enrollment and staffing information. Seed enrollment and staffing information. Update DatabaseCleaner so it cleans up leftover information in the database. Remove old admin csvs from codebase.
3 years ago
rebuilt
c0332955f3
move csv require statement to application.rb
3 years ago
rebuilt
ae6d7151b3
Load student respondent information from enrollment loader
3 years ago
rebuilt
984fedff02
Add academic year column to enrollment file. Add enrollment loader to seeder file
3 years ago
rebuilt
52d2676f80
Populate db with grade level data
3 years ago
rebuilt
69c089d9bb
Remove occurances of MCIEA
3 years ago
rebuilt
01851d8527
Make sure blank grades in csv load as nils instead of zeros
3 years ago
rebuilt
3a744db975
Start adding kindergarten
3 years ago
rebuilt
c6b8518cb1
Fix bug with not all survey responses loading when using sftp loader
3 years ago
rebuilt
b1200dca33
update default folder for survey responses
3 years ago
rebuilt
7a159f7ddf
Process 1000 rows at a time to limit memory usage in production
3 years ago
rebuilt
79adff5faa
Don't print sftptogo_url
3 years ago
rebuilt
03c6bff7bb
Load survey responses from sftp folder
3 years ago
rebuilt
1426e7cc63
add proof of concept sftp downloader
3 years ago
rebuilt
eeeb4f5d14
Add Stoklosa survey responses
3 years ago
rebuilt
d5533bb756
Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1
3 years ago
rebuilt
ed421296c7
Fix bug that caused an incorrect score of 1 to be assigned for admin
...
data item related to parity index of teachers of color to students of
color. Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
3 years ago
rebuilt
7b9abe3793
Scrape latest data from DESE
3 years ago
rebuilt
850ef9d1cd
Make sure demographic loader doesn't break when changing the designation of a Race
3 years ago
rebuilt
53a08d41bf
Change label for unknown race
3 years ago
rebuilt
a1b7db9b2e
Add ability to load student demographic information for only lowell
3 years ago
rebuilt
d49cf918ac
Add a way to seed only lowell schools
3 years ago
rebuilt
fc9e73f656
make sure survey responses for Butler get loaded correctly
3 years ago
rebuilt
9bbba64e9e
Add rules to the survey data loader to allow skipping non-lowell schools
3 years ago
Nelson Jovel
43bc55ed32
Query for survey items once instead of on each row
3 years ago
rebuilt
83fa255e5d
Make utility method private
3 years ago
rebuilt
3ca270e3b1
Fix more malformed csv errors
3 years ago
rebuilt
360579852c
Send fewer sql queries when loading survey responses
3 years ago
rebuilt
9ae94a2dae
further reduce batch size to avoid exceeding memory quota
3 years ago
rebuilt
bb472ce6c3
Reduce number of rows to process to avoid exceeding memory limit on heroku
3 years ago
rebuilt
4626d43e3e
Add changes to demographic loader so it the tests don't fail when run in parallel
3 years ago
rebuilt
7d7bee6498
Add gender disagreggation
3 years ago
rebuilt
a1f3dfaa33
Load gender information to survey item responses
3 years ago
rebuilt
8cdd1b6285
Create gender class. Load genders to database from demographics file
3 years ago
rebuilt
f4e2e6ca83
Correct headers for 4D so the data can be correctly imported
3 years ago
rebuilt
f9c4ed8880
make sure grade gets updated when loading survey responses
3 years ago
rebuilt
89a7f27b88
Finishes #183092367 . Adds ui and graphs to show grade level disaggregations
3 years ago
Nelson Jovel
6ed405d16d
Update student csvs
3 years ago
rebuilt
dd9ed86dcd
Merge branch 'scrapers'
3 years ago
rebuilt
f556844b73
Update a-sust-i1 so it's only applicable to high schools and lower
...
benchmark for a-cure-i1 from 6.7 to 5 percent.
3 years ago
rebuilt
9e82023e48
Make sure we don't divide by 0
3 years ago
rebuilt
1a984d2cf0
Make sure we don't divide by 0
3 years ago
rebuilt
3ee909bbea
Update 3B-2 with an absolute floor
3 years ago
rebuilt
1be6e79fd8
Add scrapers for admin data and their corresponding csv files of data.
...
Make sure specs are off so they don't run in github CI
3 years ago
rebuilt
9e18bf2151
Turn off specs for scrapers. Generate csvs for scraped data
3 years ago
rebuilt
75cea539c5
Change scraped csv file names from word to number format 'one_a_one.csv'
...
-> '1A_1.csv''
3 years ago
rebuilt
3e28be2d0d
Remove 'scraper' from file/class names add 3B-1
3 years ago
rebuilt
83ef9310a4
Add scraper for 3A-1 and 2C-1. Finishes #183105787 and #183105716
3 years ago
rebuilt
b5bc9c9de4
Add 2A-i admin data. Finishes #183105627
3 years ago
rebuilt
459393381b
Load admin data for 1A-iii. Finishes #183105516
3 years ago
rebuilt
f783ce9724
add one-a
3 years ago
rebuilt
048aacd641
Add admin data for 4D. Finishes #182818489 and #182793402
3 years ago
rebuilt
043f6de8e7
Prefer using over uniq so the filtering happens in sql
3 years ago
rebuilt
30eae881f3
Load correct column for 2021-22 year
3 years ago
rebuilt
bc63dfab7e
add one_off scripts for each academic year
3 years ago
rebuilt
3fcab58557
Do not erase all students when loading each file
3 years ago
rebuilt
3f493727b3
Load a single year of student demographic data and race scores at a time
3 years ago
rebuilt
4a32ea3332
Reduce memory use in race score loader
3 years ago