Nelson Jovel
f028e6c884
feat: if the filename includes the words 'form' or 'part' add that to the resulting cleaned filename
2 years ago
Nelson Jovel
3f44613085
chore: various fixes for race and gender categorization during cleaning.
...
Also add tests for race and gender categorization
2 years ago
Nelson Jovel
883731bce1
feat: Support SIS categories for race in the form of 'White and Asian'
...
or 'Black, Asian, and white'
2 years ago
Nelson Jovel
0931950eaf
chore: make sure to output race and gender columns during cleaning
2 years ago
Nelson Jovel
3db0f9f757
fix: Print out message to make clean when there are duplicate headers
...
present in the raw survey file
2 years ago
Nelson Jovel
6541b87e9c
feat: add 2023-24 academic year and make sure previous year enrollment
...
and staffing data get loaded when missing
2 years ago
Nelson Jovel
b7e670bb60
Lower threshold for the number of valid student responses from 17 to 11
2 years ago
Nelson Jovel
7dc881f654
chore: refactor code to put logic into models
2 years ago
Nelson Jovel
e325f38c43
Convert gender and race text into qualtrics codes during cleaning. Abide by 'prefer not to disclose' for self reported race. Give priority to self reported data but use SIS information as backup
2 years ago
Nelson Jovel
9efc1f41c6
chore: Add comment about StaffingLoader also cloning enrollment data when it clones staffing data
2 years ago
Nelson Jovel
f6f78bcd58
fix: make sure to grab the 'Gender- Qcode' column
2 years ago
Nelson Jovel
8bebe7db42
chore: since it's now possible for there to be multiple district and dese id columns separated by a dash and a number, be more explicit when we only want to filter out survey item ids that end in a -1
2 years ago
Nelson Jovel
b63c327d33
chore: when searching for dese id, split up pattern so that to be more explicit about the order in which to search out the columns that might have the dese ID we're looking for.
2 years ago
rebuilt
cddea60c8b
feat: reduce number of rows to process at one time to reduce memory use
2 years ago
rebuilt
1a707eb6bc
feat: load student responses in the same pass as loading the survey responses
...
chore: remove student loader since loading students is now done with the survey response loader
2 years ago
rebuilt
e3fbbabce5
feat: We no longer trust the progress number that gets exported from qualtrics. Instead during the cleaning progress, perform a manual count of the number of responses to filter out rows that don't meet the minimum threshold.
2 years ago
rebuilt
83661540b7
chore: upgrade to rails 7.1.
...
upgrade rspec
fix failing tests
upgrade devise
2 years ago
rebuilt
48e795fcfb
feat: add special education disaggregation
2 years ago
rebuilt
060d7aa55a
Add disaggregation by ELL
2 years ago
rebuilt
abea2cb8fa
feat: support multiple columns for race and gender information
2 years ago
rebuilt
463e4c9452
fix: hide scores on analyze page for scores that don't meet the student threshold of 25%
2 years ago
rebuilt
714b90b3eb
fix: ensure cleaner outputs columns for all survey items. Before the fix, if a survey item varient (ending in -1, ie s-tint-q1-1) did not have a matching survey item s-tint-q1, the resulting csv would not include that column
2 years ago
rebuilt
2321897283
fix: start fixing problem with variants not getting added to the cleaned csv
2 years ago
rebuilt
a785c69c44
Add Overall Response Rate
2 years ago
rebuilt
4afa030141
chore: remove precalculated race scores. Calculate race scores on every reload
2 years ago
rebuilt
f035c4d9ad
fix: Filter out responses that don't correspond to the grades the school serves
2 years ago
rebuilt
5f49746bf4
feat: Rename income labels to 'Economically Disadvantaged' and 'Not Economically Disadvantaged'
2 years ago
rebuilt
67e469a66c
feat: add popover to analyze graphs that displays the n-size of the different columns. Make sure to only calculate a score for a race if there are more than 10 respondents to a question.
2 years ago
rebuilt
cec48e55d3
chore: remove outdated admin data loader file. We now use Dese::Loader to load school level data
2 years ago
rebuilt
5c7729beeb
feat: if admin data value is above 5, round down to 5
2 years ago
rebuilt
cbd5687ff0
feat: Add out of state admin data
2 years ago
rebuilt
4f035f6a63
feat: Add income table to the database. Add seeder for income. Add a reference to income from survey item response. Update the loader to import income data from the survey response csv. Refactor analyze controller to extract presenter. Add corresponding specs. Add income graph to analyze page
2 years ago
rebuilt
d72f8d31e0
fix: There was an n+1 problem where we looked up the list of schools for
...
every row. Now we query the list of schools just once per file
2 years ago
rebuilt
e8aa75bf66
feat: update survey_item_response table to indlude recorded date and import recorded date when loading responses
2 years ago
Nelson Jovel
0a2c5e02c5
feat: add ability to merge disaggregation data with raw survey data to
...
produce a cleaned csv with merged income disaggregation columns
3 years ago
rebuilt
e2d24a9bec
Fix: make sure values don't get reordered after copying over row values from survey item variants. This fixes a problem where cleaner would produce a row with likert scores that got shifted to align with the wrong column
3 years ago
rebuilt
ddf9a628d5
Fix: enable correct detection of student surveys types by rejecting any
...
headers ending with '-1' (the variants of standard questions)
3 years ago
rebuilt
896f0d9961
Don't write a file if there's an empty dataset
3 years ago
rebuilt
76b79b99c2
Fix: Parse headers when they are surrounded by quotes. This helps load recent csv files correctly
3 years ago
rebuilt
30285efd69
It's possible for admin data likert score values to be above 5. If that happens, we
...
cap the likert score at 5. This was happening already at the scraper
level but it's also now being done by the admin data loader for safety.
Also make sure to just update admin data instead of deleting and
reloading all values each load. Add tests to confirm this behavior
3 years ago
rebuilt
abe7a8804c
Don't check standard deviation for early education surveys
3 years ago
rebuilt
9aeb5f92af
Missing progress or duration information does not result in a row removed in the cleaning process
3 years ago
rebuilt
e3ae12b425
update response_date to recorded_date
3 years ago
rebuilt
a30921ce06
Add New Jersey enrollment and staffing data
3 years ago
rebuilt
93d087a5de
Use short district name for cleaned csv
3 years ago
rebuilt
8ef8cfce58
Adjust valid duration threshold of short form items
3 years ago
rebuilt
d6b2521883
Fix regression in student loader
3 years ago
rebuilt
4509c157fa
Add automated data cleaning. Modify SurveyItemValues class to use regex
...
instead of hard coded values. Produce a clean csv and a csv with all
the removed values and columns with reason for removal. Add script for
running cleaning for each project
3 years ago
rebuilt
359e266a6c
Remove unused TODOs
3 years ago
rebuilt
3f2a7dff50
Fix problem with dese scraper lumping in 2021-22 data as 2022-23 data.
...
Deleted unused csvs. Turned off puts statements in admin loader.
Remove old, now unused admin data loader class.
3 years ago
rebuilt
4b7d2e6feb
Only return files in sftp directory, not other directories
3 years ago
rebuilt
f68b650d8b
Calculate response rate on the fly instead of looking it up from the db
...
when calculating response rates.
3 years ago
rebuilt
10cedd2310
Remove unused filename
3 years ago
rebuilt
5b2bae0d92
Fix ThreeATwo scraper
3 years ago
rebuilt
128748addd
Update logic for calculating student response rate. Remove references
...
to survey table. We no longer check or keep track of the survey type.
Instead we look in the database to see if a survey item has at least 10
responses. If it does, that survey item was presented to the respondent
and we count it, and all responses when calculating the response rate.
Remove response rate timestamp from caching logic because we no longer
add the response rate to the database. All response rates are calculated
on the fly
Update three_b_two scraper to use teacher only numbers
swap over to using https://profiles.doe.mass.edu/statereport/gradesubjectstaffing.aspx as the source of staffing information
3 years ago
rebuilt
b250ebe415
Memoize schools in SurveyItemValues and academic_years in AcademicYear
...
for performace improvement
3 years ago
rebuilt
c15cb7b483
Change survey data loader spec to use factorybot objects instead of loading seeds. Change databasecleaner to use transaction. Add back babel-preset dependency to fix failing javascript test in production.
3 years ago
rebuilt
db3e4f1178
Set config variable for SFTP_TO_GO server
3 years ago
rebuilt
5e88c8be1b
Transition from Lowell to ECP. Update text and links to reflect the new Organization.
3 years ago
rebuilt
4c4ccc01cc
Merge branch 'rpp-response-rate' to bring in changes to test files
3 years ago
rebuilt
6b31fa9115
Batch imports for staffing data
3 years ago
rebuilt
d059177f0c
load total students and batch importing records
3 years ago
rebuilt
2362d884eb
Convert admin data loader from using seeder to using factory
3 years ago
rebuilt
d0219217de
Convert response rate loader spec from using the seeder to using the factory
3 years ago
rebuilt
06f9d2f0e9
Scrape enrollment and staffing information. Seed enrollment and staffing information. Update DatabaseCleaner so it cleans up leftover information in the database. Remove old admin csvs from codebase.
3 years ago
rebuilt
c0332955f3
move csv require statement to application.rb
3 years ago
rebuilt
ae6d7151b3
Load student respondent information from enrollment loader
3 years ago
rebuilt
984fedff02
Add academic year column to enrollment file. Add enrollment loader to seeder file
3 years ago
rebuilt
52d2676f80
Populate db with grade level data
3 years ago
rebuilt
69c089d9bb
Remove occurances of MCIEA
3 years ago
rebuilt
01851d8527
Make sure blank grades in csv load as nils instead of zeros
3 years ago
rebuilt
3a744db975
Start adding kindergarten
3 years ago
rebuilt
c6b8518cb1
Fix bug with not all survey responses loading when using sftp loader
3 years ago
rebuilt
b1200dca33
update default folder for survey responses
3 years ago
rebuilt
7a159f7ddf
Process 1000 rows at a time to limit memory usage in production
3 years ago
rebuilt
79adff5faa
Don't print sftptogo_url
3 years ago
rebuilt
03c6bff7bb
Load survey responses from sftp folder
3 years ago
rebuilt
1426e7cc63
add proof of concept sftp downloader
3 years ago
rebuilt
eeeb4f5d14
Add Stoklosa survey responses
3 years ago
rebuilt
d5533bb756
Support survey item variants. Survey item id's in the format s-sbel-q1-1 will load into s-sbel-q1
3 years ago
rebuilt
ed421296c7
Fix bug that caused an incorrect score of 1 to be assigned for admin
...
data item related to parity index of teachers of color to students of
color. Now if there's not enough data to calculate a score, it produces
'NA' instead of 1. Also added admin data for a-reso-i1 and a-curv-i1
3 years ago
rebuilt
7b9abe3793
Scrape latest data from DESE
3 years ago
rebuilt
850ef9d1cd
Make sure demographic loader doesn't break when changing the designation of a Race
3 years ago
rebuilt
53a08d41bf
Change label for unknown race
3 years ago
rebuilt
a1b7db9b2e
Add ability to load student demographic information for only lowell
3 years ago
rebuilt
d49cf918ac
Add a way to seed only lowell schools
3 years ago
rebuilt
fc9e73f656
make sure survey responses for Butler get loaded correctly
3 years ago
rebuilt
9bbba64e9e
Add rules to the survey data loader to allow skipping non-lowell schools
3 years ago
Nelson Jovel
43bc55ed32
Query for survey items once instead of on each row
3 years ago
rebuilt
83fa255e5d
Make utility method private
3 years ago
rebuilt
3ca270e3b1
Fix more malformed csv errors
3 years ago
rebuilt
360579852c
Send fewer sql queries when loading survey responses
3 years ago
rebuilt
9ae94a2dae
further reduce batch size to avoid exceeding memory quota
3 years ago
rebuilt
bb472ce6c3
Reduce number of rows to process to avoid exceeding memory limit on heroku
3 years ago
rebuilt
4626d43e3e
Add changes to demographic loader so it the tests don't fail when run in parallel
3 years ago
rebuilt
7d7bee6498
Add gender disagreggation
3 years ago
rebuilt
a1f3dfaa33
Load gender information to survey item responses
3 years ago
rebuilt
8cdd1b6285
Create gender class. Load genders to database from demographics file
3 years ago
rebuilt
f4e2e6ca83
Correct headers for 4D so the data can be correctly imported
3 years ago
rebuilt
f9c4ed8880
make sure grade gets updated when loading survey responses
3 years ago