-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
refactor: move experiments into experiments folder.
Also make translation more robust
- Loading branch information
1 parent
47a5959
commit 529132b
Showing
37 changed files
with
131 additions
and
63 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Experiments | ||
|
||
This dir is meant for code that is experimenting with LLMs but not (yet) meant to be deployed to the LLM service. The only code that is deployed is in `/app`. |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
""" | ||
Functions to explore existing topic curations | ||
""" | ||
from topic_source_curation.common import get_datasets | ||
from collections import defaultdict | ||
import django | ||
django.setup() | ||
from sefaria.model import * | ||
|
||
|
||
if __name__ == '__main__': | ||
bad, good = get_datasets() | ||
counts = defaultdict(int) | ||
for example in good: | ||
for source in example.sources: | ||
oref = Ref(source.ref) | ||
index = oref.index | ||
if index.get_primary_corpus(): | ||
counts[index.get_primary_corpus()] += 1 | ||
elif len(getattr(index, 'authors', [])) == 1: | ||
counts[index.authors[0]] += 1 | ||
else: | ||
counts[index.title] += 1 | ||
for k, v in sorted(counts.items(), key=lambda x: x[1], reverse=True): | ||
print(k, v) | ||
|
||
""" | ||
quick takeaways: | ||
- sources | ||
- Tanakh, Mishnah, Midrash Rabbah, Bavli, Yerushalmi, MT, SA, Siddurim, Rashi, Zohar | ||
- Pirkei DeRabbi Eliezer, Tanchuma, Ibn Ezra, Ramban, Mekhilta | ||
- Acharonim that give major insights | ||
- maharl | ||
- ramchal | ||
- Shnei Luchot Habrit | ||
- Nachman of Breslov | ||
- levi-yitzchak-of-berditchev | ||
- tzadok-hakohen-of-lublin | ||
- Modern english | ||
- erica brown | ||
- Peninei Halacha | ||
- Eliezer Berkovitz | ||
This is a fairly comprehensive list of everything that comes up 3 times or more | ||
""" | ||
|
||
|
96 changes: 48 additions & 48 deletions
96
...or source curation data set) - Sheet1.csv → ...or source curation data set) - Sheet1.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,49 +1,49 @@ | ||
slug,category,issue,what should it have | ||
avishai,biblical figures,5 first sources are from Shmuel,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary " | ||
birds,nature,has Chullin too many times,"first source should be from Genesis, Chullin should appear once" | ||
songs,art,feels like a search page,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary" | ||
tsaddik,n/a,feels like a search page,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
flood,stories,first three sources about the story but not exact psukim ,sources should give an overview of the story | ||
building-of-the-second-temple,n/a,has Chagai too many times ,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
garden-of-eden,stories,Bereshit Rabbah and Genesis appear too many times ,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
faith,values,Too many Exodus refs , | ||
religion,beliefs,Too many Sefer Haikarim refs, | ||
avodat-hashem,values,Too many deuteronomy refs, | ||
bread,food,feels like a search page, | ||
the-spies,biblical figures,Too many Numbers refs, | ||
canaan,places,has no sources but should, | ||
slaves,social issues,feels like a search page, | ||
wars,social issues,Too many deuteronomy refs, | ||
parah-adumah,ritual objetcs,Too many Numbers refs, | ||
leadership,social issues,Too many Numbers refs, | ||
kings,n/a,Too many deuteronomy refs, | ||
elijah,biblical figures,Kings appear too many times, | ||
prophecy,beliefs,feels like a search page, | ||
women,social issues,feels like a search page, | ||
daughters-of-zelophehad,biblical figures,Too many Numbers refs, | ||
night,nature,doesn't have sources from Torah, | ||
exile,history,Doesn't start with Abraham and other known texts, | ||
music,art,feels like a search page, | ||
miriam,biblical figures,Too many Exodus and Numbers refs, | ||
rabbis,n/a,"only one source, should have more ", | ||
money,social issues,Too many Exodus refs, | ||
high-priest,biblical figures,feels like a search page, | ||
free-will,philosophy,Too many deuteronomy refs, | ||
minhag,Halachic Principles,feels like a search page, | ||
bal-tashchit,values,feels like a search page, | ||
shekhinah,beliefs,Too many Exodus refs, | ||
parah-adumah,ritual objetcs,Too many Numbers refs, | ||
love,values,feels like a search page, | ||
golems,supernatural,mostly talmudic sources , | ||
revelation,beliefs,Too many Exodus refs, | ||
yetzer-hara,beliefs,feels like a search page, | ||
birth,life cycle,feels like a search page, | ||
visiting-the-sick,values,mostly talmudic sources , | ||
mitzvot,beliefs,feels like a search page, | ||
leadership,social issues,Too many Numbers refs, | ||
shalom-bayit,values,can have more sources , | ||
wine,food,feels like a search page, | ||
angels,supernatural,feels like a search page, | ||
shivah-asar-betammuz,jewish calendar,first three sources give the same information , | ||
korach,biblical figures,Too many Numbers refs, | ||
slug,category,issue,what should it have | ||
avishai,biblical figures,5 first sources are from Shmuel,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary " | ||
birds,nature,has Chullin too many times,"first source should be from Genesis, Chullin should appear once" | ||
songs,art,feels like a search page,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary" | ||
tsaddik,n/a,feels like a search page,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
flood,stories,first three sources about the story but not exact psukim ,sources should give an overview of the story | ||
building-of-the-second-temple,n/a,has Chagai too many times ,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
garden-of-eden,stories,Bereshit Rabbah and Genesis appear too many times ,"assortment of sources that can be found from search such as Midrash, Talmud, Commentary, Chassidut" | ||
faith,values,Too many Exodus refs , | ||
religion,beliefs,Too many Sefer Haikarim refs, | ||
avodat-hashem,values,Too many deuteronomy refs, | ||
bread,food,feels like a search page, | ||
the-spies,biblical figures,Too many Numbers refs, | ||
canaan,places,has no sources but should, | ||
slaves,social issues,feels like a search page, | ||
wars,social issues,Too many deuteronomy refs, | ||
parah-adumah,ritual objetcs,Too many Numbers refs, | ||
leadership,social issues,Too many Numbers refs, | ||
kings,n/a,Too many deuteronomy refs, | ||
elijah,biblical figures,Kings appear too many times, | ||
prophecy,beliefs,feels like a search page, | ||
women,social issues,feels like a search page, | ||
daughters-of-zelophehad,biblical figures,Too many Numbers refs, | ||
night,nature,doesn't have sources from Torah, | ||
exile,history,Doesn't start with Abraham and other known texts, | ||
music,art,feels like a search page, | ||
miriam,biblical figures,Too many Exodus and Numbers refs, | ||
rabbis,n/a,"only one source, should have more ", | ||
money,social issues,Too many Exodus refs, | ||
high-priest,biblical figures,feels like a search page, | ||
free-will,philosophy,Too many deuteronomy refs, | ||
minhag,Halachic Principles,feels like a search page, | ||
bal-tashchit,values,feels like a search page, | ||
shekhinah,beliefs,Too many Exodus refs, | ||
parah-adumah,ritual objetcs,Too many Numbers refs, | ||
love,values,feels like a search page, | ||
golems,supernatural,mostly talmudic sources , | ||
revelation,beliefs,Too many Exodus refs, | ||
yetzer-hara,beliefs,feels like a search page, | ||
birth,life cycle,feels like a search page, | ||
visiting-the-sick,values,mostly talmudic sources , | ||
mitzvot,beliefs,feels like a search page, | ||
leadership,social issues,Too many Numbers refs, | ||
shalom-bayit,values,can have more sources , | ||
wine,food,feels like a search page, | ||
angels,supernatural,feels like a search page, | ||
shivah-asar-betammuz,jewish calendar,first three sources give the same information , | ||
korach,biblical figures,Too many Numbers refs, | ||
medicine,health,feels like a search page, |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters