-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
limiting outcome comparison to a period, filtering out by percentile in the last year #36
Conversation
Added gdp data and the minimal working version
Added population and generalized data loading and euclidean distances to multiple features
@riadas please investigate the workflow failure, seems to be related to dependencies installation. All tests pass locally. |
…into elm-compile-unempl-rate
…ch/cities into nl-add-industry-time-series
adding industry composition as a time series
Yeh, that seems like a bug. Thanks for spotting it! Will get on it once the first version of the explainability stuff is ready. |
@riadas @emackev the bug has been fixed, inspect the first few cells of the similarity notebook to confirm the results are as desired. @emackev once the first issue has been resolved, the second is not a bug, but a feature. you should expect different years for different variables, as you restrict weights to certain years for the outcome variable only and use whatever data is available for other variables. Other variables differ in what years are available. In a discussion about this feature we talked about locations that "have similar outcome variable for years x-y and are similar in other respects", and this implements the idea. If you want comparison years to restrict all variables in similarity search, this can be implemented, but I'm not sure if this is what you want. |
…ch/cities into ru-divergence-search
…ch/cities into ru-explainability-table
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Things are working well enough now on the frontend, so think this is good to go.
Ru explainability table
scraping unemployment data from BLS, script & csv
Resolves #32.
Now you can restrict outcome variable attention in similarity calculations by years, and euclidean kins output by outcome performance percentile in the last year, like so:
f = FipsQuery(42001, "gdp", lag = 0, top =5, time_decay = 1.06,
outcome_comparison_period=(2003, 2019), outcome_percentile_range= (40,100))
A working example is available in
similarity_demo.ipynb
.