{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":665909034,"defaultBranch":"main","name":"speakleash-examples","ownerLogin":"speakleash","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-07-13T09:14:21.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/116965386?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1709391697.0","currentOid":""},"activityList":{"items":[{"before":"ae894eef42ab9ab01b85073bfc279e16ff8cbb5d","after":"ace0e1aaffa331703af2211316434eb07807c6b1","ref":"refs/heads/main","pushedAt":"2024-03-02T16:05:16.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"update examples (#29)","shortMessageHtmlLink":"update examples (#29)"}},{"before":null,"after":"a6114d8d30bcf2968d64a9da008a1a6e5d457186","ref":"refs/heads/modification/updating_examples","pushedAt":"2024-03-02T15:01:37.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"update examples","shortMessageHtmlLink":"update examples"}},{"before":"7eb5f8ea3da29197095c0bcb662ab63504aba6d4","after":"ae894eef42ab9ab01b85073bfc279e16ff8cbb5d","ref":"refs/heads/main","pushedAt":"2023-10-19T20:01:47.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"readme extended with 7 and 8 examples (#27)\n\n* readme extended with 7 and 8 examples\r\n\r\n* fix typo","shortMessageHtmlLink":"readme extended with 7 and 8 examples (#27)"}},{"before":"c1e2d56c64fe3b522205b8080ddc63e25107b46f","after":"6ea74cafcc5c0034c09144da49521ddc1fac9480","ref":"refs/heads/readme_upgrade_with_7_8_examples","pushedAt":"2023-10-19T19:57:42.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"fix typo","shortMessageHtmlLink":"fix typo"}},{"before":null,"after":"c1e2d56c64fe3b522205b8080ddc63e25107b46f","ref":"refs/heads/readme_upgrade_with_7_8_examples","pushedAt":"2023-10-19T19:54:09.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"readme extended with 7 and 8 examples","shortMessageHtmlLink":"readme extended with 7 and 8 examples"}},{"before":"c74ab0fc3bb7720ed84b70d0bf26c7b46d34c8d5","after":"7eb5f8ea3da29197095c0bcb662ab63504aba6d4","ref":"refs/heads/main","pushedAt":"2023-10-19T19:48:28.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"8 speakleash data visualization (#25)\n\n* Added examples - pandas/polars + dataset statistics visualization\r\n\r\n* Added README.md for example 1-1 and 6\r\n\r\n* renamed folder and changed readme files\r\n\r\n* modified example_8 readme file\r\n\r\n---------\r\n\r\nCo-authored-by: Szymon Baczyński ","shortMessageHtmlLink":"8 speakleash data visualization (#25)"}},{"before":"00bc45a3ee097f59b70c7cc7da0b832df741f583","after":"fc64dd7b1d90f8b045657ec9d055db79bf4fc4a6","ref":"refs/heads/8-speakleash-data-visualization","pushedAt":"2023-10-19T19:48:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"modified example_8 readme file","shortMessageHtmlLink":"modified example_8 readme file"}},{"before":"3516556d023c80ce296a869d993dcabbea2d9c5b","after":"00bc45a3ee097f59b70c7cc7da0b832df741f583","ref":"refs/heads/8-speakleash-data-visualization","pushedAt":"2023-10-19T19:44:55.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"renamed folder and changed readme files","shortMessageHtmlLink":"renamed folder and changed readme files"}},{"before":"a5e9443dcc6d6a19c13c782c111d4064caec94e7","after":"c74ab0fc3bb7720ed84b70d0bf26c7b46d34c8d5","ref":"refs/heads/main","pushedAt":"2023-10-18T21:51:27.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"readme files modifications (#26)\n\n* readme files modifications\r\n\r\n* fix","shortMessageHtmlLink":"readme files modifications (#26)"}},{"before":"462f5daecbb1a01c0fa37b37ecbd100860c08fb3","after":"87a6f25d027cc4892093357814f480c5adc25e1f","ref":"refs/heads/readme_files_modifications","pushedAt":"2023-10-18T21:49:55.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"fix","shortMessageHtmlLink":"fix"}},{"before":null,"after":"462f5daecbb1a01c0fa37b37ecbd100860c08fb3","ref":"refs/heads/readme_files_modifications","pushedAt":"2023-10-18T21:45:23.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"readme files modifications","shortMessageHtmlLink":"readme files modifications"}},{"before":"71b9f7590ad28ea2051b4fcf4763e15fdd0b6fbe","after":"a5e9443dcc6d6a19c13c782c111d4064caec94e7","ref":"refs/heads/main","pushedAt":"2023-10-18T21:35:17.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Feature/df generator/speak leash to df (#13)\n\n* Added example of creating pandas' dataframe\r\n\r\n* removed unnecessary comment line\r\n\r\n* Added example of creating pandas' dataframe\r\n\r\n* .html chart generator added\r\n\r\n* Docstring added, constants uppercased,\r\n\r\n* changed names, added descriptions\r\n\r\n* Added requirements and readme files and removed charts script\r\n\r\n* readme.md fix\r\n\r\n* readme.md fix\r\n\r\n---------\r\n\r\nCo-authored-by: Przemyslaw Boruta \r\nCo-authored-by: Krzysztof Kaczor \r\nCo-authored-by: IgorTest19 ","shortMessageHtmlLink":"Feature/df generator/speak leash to df (#13)"}},{"before":"2e7d1bddacab4a09525ca29bb5f2d9f50eb4c5e8","after":"3516556d023c80ce296a869d993dcabbea2d9c5b","ref":"refs/heads/8-speakleash-data-visualization","pushedAt":"2023-10-03T14:16:01.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Samox1","name":"Szymon Baczyński","path":"/Samox1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/19156114?s=80&v=4"},"commit":{"message":"Added README.md for example 1-1 and 6","shortMessageHtmlLink":"Added README.md for example 1-1 and 6"}},{"before":"6c89ad88e48261b944dbd1a8f57df485f10e1b2d","after":"71b9f7590ad28ea2051b4fcf4763e15fdd0b6fbe","ref":"refs/heads/main","pushedAt":"2023-10-01T20:24:45.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"First slow version (#19)\n\n* First slow version\r\n\r\n* Create new files - 2 versions (mod. SpaCy + NLTK)\r\n\r\n* Changed default dataset (problems with 'form_symfonik' )\r\n\r\n* Text length limit (problem with memory allocation)\r\n\r\n* Add comments + dataset change + unified WordCloud visualization settings\r\n\r\n* Rename folder + add README.md + add info about installation (spaCy)\r\n\r\n* Changed to include suggestions & deleting an unnecessary folder\r\n\r\n---------\r\n\r\nCo-authored-by: Sebastian Kondracki \r\nCo-authored-by: Szymon Baczyński ","shortMessageHtmlLink":"First slow version (#19)"}},{"before":null,"after":"6c89ad88e48261b944dbd1a8f57df485f10e1b2d","ref":"refs/heads/9-incorrect-format-detection","pushedAt":"2023-10-01T14:40:05.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"adamjot","name":"Adam Jędrusyna","path":"/adamjot","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/66069215?s=80&v=4"},"commit":{"message":"Fix encoding (#24)\n\nFixed lack of encoding in save_quality_docs method","shortMessageHtmlLink":"Fix encoding (#24)"}},{"before":"7dadcee37527afda5dc43fd2430dea48014606ed","after":"2e7d1bddacab4a09525ca29bb5f2d9f50eb4c5e8","ref":"refs/heads/8-speakleash-data-visualization","pushedAt":"2023-09-28T11:48:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Samox1","name":"Szymon Baczyński","path":"/Samox1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/19156114?s=80&v=4"},"commit":{"message":"Added examples - pandas/polars + dataset statistics visualization","shortMessageHtmlLink":"Added examples - pandas/polars + dataset statistics visualization"}},{"before":"8472826c2b63209619e8ddaa1f9e3d2ac78bc91d","after":"ad0293d07d981032f3ed6438d0c3eae844824e15","ref":"refs/heads/6-speakleash-and-word-clouds","pushedAt":"2023-09-20T09:22:58.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Samox1","name":"Szymon Baczyński","path":"/Samox1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/19156114?s=80&v=4"},"commit":{"message":"Changed to include suggestions & deleting an unnecessary folder","shortMessageHtmlLink":"Changed to include suggestions & deleting an unnecessary folder"}},{"before":"135d8ba534460dc12952ff2765cad5274b66198b","after":"6c89ad88e48261b944dbd1a8f57df485f10e1b2d","ref":"refs/heads/main","pushedAt":"2023-09-14T22:03:59.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Fix encoding (#24)\n\nFixed lack of encoding in save_quality_docs method","shortMessageHtmlLink":"Fix encoding (#24)"}},{"before":null,"after":"5d6bda990bb6442ae287d41aca99d3fcfff37941","ref":"refs/heads/IgorTest19-patch-2","pushedAt":"2023-09-14T21:31:03.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Fix encoding\n\nFixed lack of encoding in save_quality_docs method","shortMessageHtmlLink":"Fix encoding"}},{"before":null,"after":"135d8ba534460dc12952ff2765cad5274b66198b","ref":"refs/heads/examples-readme-refactor-ideas","pushedAt":"2023-09-06T23:31:12.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"Samox1","name":"Szymon Baczyński","path":"/Samox1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/19156114?s=80&v=4"},"commit":{"message":"Update README.md (#23)","shortMessageHtmlLink":"Update README.md (#23)"}},{"before":"6649c778eac7aff270af3aa27fa61eae54af2e78","after":"8472826c2b63209619e8ddaa1f9e3d2ac78bc91d","ref":"refs/heads/6-speakleash-and-word-clouds","pushedAt":"2023-09-06T11:05:05.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"Samox1","name":"Szymon Baczyński","path":"/Samox1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/19156114?s=80&v=4"},"commit":{"message":"Rename folder + add README.md + add info about installation (spaCy)","shortMessageHtmlLink":"Rename folder + add README.md + add info about installation (spaCy)"}},{"before":"72a122d9898f1804e9c995a3db748c6eedfa5fa8","after":"135d8ba534460dc12952ff2765cad5274b66198b","ref":"refs/heads/main","pushedAt":"2023-09-03T20:32:22.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Update README.md (#23)","shortMessageHtmlLink":"Update README.md (#23)"}},{"before":null,"after":"e1ee370192c3c31363a7a1385a0cc2781c7a1135","ref":"refs/heads/IgorTest19-patch-1","pushedAt":"2023-09-03T19:54:37.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Update README.md","shortMessageHtmlLink":"Update README.md"}},{"before":"7dadcee37527afda5dc43fd2430dea48014606ed","after":"72a122d9898f1804e9c995a3db748c6eedfa5fa8","ref":"refs/heads/main","pushedAt":"2023-09-03T19:52:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Add extraction of high-quality docs to text files (#17)\n\n* Add extraction of high-quality docs to text files\r\n\r\nThis commit introduces a new function save_high_quality_docs() in the extraction_to_txt_file.py. It saves high-quality documents from the Speakleash dataset to a specified folder, creating the necessary directories if they don't exist. The .gitignore file has been updated to exclude the output directory from the repository. This function helps in segregating high quality documents for further analysis.\r\n\r\n* Update extraction_to_txt_file script for versatility\r\n\r\nThe extraction_to_txt_file script has been revised for added versatility. A new module has been added providing functions to create necessary directories and save documents from the 'speakleash' dataset to a specific folder. The 'save_high_quality_docs' method was updated to 'save_quality_docs' to allow for selection of document quality. New usage information was also added for guidance on the script modification, importing and using the module's functions, running the script directly, and the output from the script execution. A 'get_data' function was also added to manage the creation of necessary directories. This update provides improved script functionality and versatile document retrieval.\r\n\r\n* Update docstring position in extraction_to_txt_file methods\r\n\r\nThe commit relocates the docstrings to be directly under the method definition, as per Python's standard practice, in the 'extraction_to_txt_file' script. These provide information about the functions 'get_data' and 'save_quality_docs'. This revision ensures that the method's purpose is clarified to the developers in the correct position as a part of Python's best practices. This doesn't affect functionality but improves script readability.\r\nRemoved .gitignore file from repo\r\n\r\n* Update docstring position in extraction_to_txt_file methods\r\n\r\nThe commit relocates the docstrings to be directly under the method definition, as per Python's standard practice, in the 'extraction_to_txt_file' script. These provide information about the functions 'get_data' and 'save_quality_docs'. This revision ensures that the method's purpose is clarified to the developers in the correct position as a part of Python's best practices. This doesn't affect functionality but improves script readability.\r\nRemoved .gitignore file from repo\r\n\r\n* Commit message:\r\n Relocate imports in extraction_to_txt_file.py\r\n\r\nThis commit moves import statements (os and speakleash) to after module-level docstring in extraction_to_txt_file.py.\r\n\r\n* Add README.md for \"extraction_to_files\" module, refactor related script, and move files to a separate folder.\r\n\r\n* Modified the readme.md and main files as suggested.\r\n\r\n* Update README.md\r\n\r\n---------\r\n\r\nCo-authored-by: marekgarbowski \r\nCo-authored-by: IgorTest19 <58894110+IgorTest19@users.noreply.github.com>","shortMessageHtmlLink":"Add extraction of high-quality docs to text files (#17)"}},{"before":"123600dee1d2be8dd9522bbd8c7eb43100d10fbe","after":"5ca9ad23f1fb8510deb46bfd3f5156c007529433","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-09-03T19:51:46.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"IgorTest19","name":null,"path":"/IgorTest19","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/58894110?s=80&v=4"},"commit":{"message":"Update README.md","shortMessageHtmlLink":"Update README.md"}},{"before":"49ca0b70158ecfb4a13214963945129fb04c44b9","after":"123600dee1d2be8dd9522bbd8c7eb43100d10fbe","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-09-03T19:48:24.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"MarekGarbowski","name":"Marek Garbowski","path":"/MarekGarbowski","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/71227920?s=80&v=4"},"commit":{"message":"Modified the readme.md and main files as suggested.","shortMessageHtmlLink":"Modified the readme.md and main files as suggested."}},{"before":"9e008d01dea38eb90f2cf85dded672321b266ad1","after":"49ca0b70158ecfb4a13214963945129fb04c44b9","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-09-03T16:58:22.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"MarekGarbowski","name":"Marek Garbowski","path":"/MarekGarbowski","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/71227920?s=80&v=4"},"commit":{"message":"Add README.md for \"extraction_to_files\" module, refactor related script, and move files to a separate folder.","shortMessageHtmlLink":"Add README.md for \"extraction_to_files\" module, refactor related scri…"}},{"before":"3413507b572b619cfd6ba3892642d1d04dfb455a","after":"9e008d01dea38eb90f2cf85dded672321b266ad1","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-09-03T11:26:00.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"MarekGarbowski","name":"Marek Garbowski","path":"/MarekGarbowski","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/71227920?s=80&v=4"},"commit":{"message":"Commit message:\n Relocate imports in extraction_to_txt_file.py\n\nThis commit moves import statements (os and speakleash) to after module-level docstring in extraction_to_txt_file.py.","shortMessageHtmlLink":"Commit message:"}},{"before":"98cf92c991810e0a339e98c859f44cbfbac68bdb","after":"3413507b572b619cfd6ba3892642d1d04dfb455a","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-09-03T11:03:00.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"MarekGarbowski","name":"Marek Garbowski","path":"/MarekGarbowski","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/71227920?s=80&v=4"},"commit":{"message":"Update docstring position in extraction_to_txt_file methods\n\nThe commit relocates the docstrings to be directly under the method definition, as per Python's standard practice, in the 'extraction_to_txt_file' script. These provide information about the functions 'get_data' and 'save_quality_docs'. This revision ensures that the method's purpose is clarified to the developers in the correct position as a part of Python's best practices. This doesn't affect functionality but improves script readability.\nRemoved .gitignore file from repo","shortMessageHtmlLink":"Update docstring position in extraction_to_txt_file methods"}},{"before":"f78573f9a3e08e637a93232ab9f038350a9e6e5e","after":"98cf92c991810e0a339e98c859f44cbfbac68bdb","ref":"refs/heads/5-extraction-of-texts-quality=high-and-given-category-mg","pushedAt":"2023-08-27T19:35:26.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"MarekGarbowski","name":"Marek Garbowski","path":"/MarekGarbowski","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/71227920?s=80&v=4"},"commit":{"message":"Update extraction_to_txt_file script for versatility\n\nThe extraction_to_txt_file script has been revised for added versatility. A new module has been added providing functions to create necessary directories and save documents from the 'speakleash' dataset to a specific folder. The 'save_high_quality_docs' method was updated to 'save_quality_docs' to allow for selection of document quality. New usage information was also added for guidance on the script modification, importing and using the module's functions, running the script directly, and the output from the script execution. A 'get_data' function was also added to manage the creation of necessary directories. This update provides improved script functionality and versatile document retrieval.","shortMessageHtmlLink":"Update extraction_to_txt_file script for versatility"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"Y3Vyc29yOnYyOpK7MjAyNC0wMy0wMlQxNjowNToxNi4wMDAwMDBazwAAAAQKlGTr","startCursor":"Y3Vyc29yOnYyOpK7MjAyNC0wMy0wMlQxNjowNToxNi4wMDAwMDBazwAAAAQKlGTr","endCursor":"Y3Vyc29yOnYyOpK7MjAyMy0wOC0yN1QxOTozNToyNi4wMDAwMDBazwAAAANzhZrv"}},"title":"Activity · speakleash/speakleash-examples"}