Skip to content

Commit

Permalink
Update documentation
Browse files Browse the repository at this point in the history
  • Loading branch information
yutongo committed Dec 1, 2023
1 parent e98c336 commit 9836de6
Show file tree
Hide file tree
Showing 10 changed files with 1,255 additions and 7 deletions.
580 changes: 580 additions & 0 deletions Course Project/Possible Extra Topics.html

Large diffs are not rendered by default.

616 changes: 616 additions & 0 deletions Course Project/Project Instructions.html

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions _sources/Course Project/Possible Extra Topics.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"cells":[{"cell_type":"markdown","metadata":{"cell_id":"6b95e587565340cf9414c2f317e691b9","deepnote_cell_type":"markdown"},"source":"# Possible extra topics\n\nOne of the rubric items for the course project is to include something \"extra\" that wasn't covered in Math 10. Here are a few possibilities. It's even better if you find your own extra topic; it can be anything in Python that interests you.\n","block_group":"160748ef54654f7aa0f03fdcaa8fdf4f"},{"cell_type":"markdown","metadata":{"cell_id":"a71c4fda99cc49ef82d84e716a3501e7","deepnote_cell_type":"markdown"},"source":"## K-Nearest Neighbors\nAn understandble supervised machine learning model is [K-Nearest Neighbors](https://www.ibm.com/topics/knn#:~:text=The%20k%2Dnearest%20neighbors%20algorithm%2C%20also%20known%20as%20KNN%20or,of%20an%20individual%20data%20point.). It can be used for classification or regression, but is typically used for classification. This topic also provides a good example of the potential for overfitting (when a small number of neighbors is used). There is some information about this topic in the course notes by Christopher Davis from [Winter 2022](https://christopherdavisuci.github.io/UCI-Math-10-W22/Week6/Week6-Wednesday.html).","block_group":"c101529d9245489fbd70233b1d32d541"},{"cell_type":"markdown","metadata":{"cell_id":"f31cf1e7f88a4d919e883523b5e6d129","deepnote_cell_type":"markdown"},"source":"## Neural Networks\nThese are a fundamental (maybe the most fundamental) area of modern Machine Learning. If you want to try learning about them, that would be a great extra topic. This [3Blue1Brown video](https://www.youtube.com/watch?v=aircAruvnKk) is a great introduction. For the interactive visualization and exploration of neural networks, you can refer [here](https://playground.tensorflow.org/#activation=tanh&batchSize=10&dataset=circle&regDataset=reg-plane&learningRate=NaN&regularizationRate=0&noise=0&networkShape=4,2&seed=0.45692&showTestData=false&discretize=false&percTrainData=50&x=true&y=true&xTimesY=false&xSquared=false&ySquared=false&cosX=false&sinX=false&cosY=false&sinY=false&collectStats=false&problem=classification&initZero=false&hideText=false).","block_group":"ea8df04b4d914f3e947372dc81dc8bca"},{"cell_type":"markdown","metadata":{"cell_id":"d92efcb28582429691defce8675b317f","deepnote_cell_type":"markdown"},"source":"## Choosing parameters\n[scikit-learn user guide](https://scikit-learn.org/stable/tutorial/statistical_inference/model_selection.html). A Machine Learning topic I would like to understand better is how to choose parameters (for example, the number of clusters when doing clustering, or the depth of a decision tree). That link provides some guidance, but it is a big topic and there are many different approaches.","block_group":"4a02bc2c27d14631ac9dadaecacdc8bf"},{"cell_type":"markdown","metadata":{"cell_id":"6c927fc3d0f14a4bbf44757a6592e977","deepnote_cell_type":"markdown"},"source":"## pandas styler\n\n<img src='https://ybaki97.github.io/Math10-SSII23/_images/panda-sty.png' width=\"400\" height=\"400\">\n\nSee these examples in the [pandas documentation](https://pandas.pydata.org/pandas-docs/stable/user_guide/style.html#Styler-Functions). This provides a way to highlight certain cells in a pandas DataFrame, and is good practice using `apply` and `applymap`.","block_group":"5f40f167046c426da9944d57efca168d"},{"cell_type":"markdown","metadata":{"cell_id":"3216f428f82d4aca9fa03a59f653bbf3","deepnote_cell_type":"markdown"},"source":"## Kaggle\n\nA general way to get ideas is to browse [Kaggle](https://www.kaggle.com/). Go to a competition or dataset you find interesting, and then click on the *Code* tab near the top. You will reach a page like this one about [Fashion-MNIST](https://www.kaggle.com/zalando-research/fashionmnist/code). Any one of these notebooks is likely to contain many possibilities for extra topics.\n","block_group":"2336fd12e9d84416a1233188aad2c981"},{"cell_type":"markdown","metadata":{"cell_id":"c029d04fee474799b13afc5b267b2ae8","deepnote_cell_type":"markdown"},"source":"## Big Data(sets)\n\nDeepnote does not allow files bigger than 100mb to be uploaded. Many real-world datasets are bigger than this. Deepnote does definitely work with larger datasets. If you end up using a larger dataset, describe how you made it work in Deepnote. Some general guidelines are listed in the [Deepnote documentation](https://docs.deepnote.com/importing-and-exporing/importing-data-to-deepnote#uploading-files-to-deepnote).","block_group":"1229edf49892475999647aa5b098edb7"},{"cell_type":"markdown","metadata":{"cell_id":"9014efe75b6f4726ba99579619c327bb","deepnote_cell_type":"markdown"},"source":"## Different Python libraries\n\nIf you want to use a Python library that isn't by default installed in Deepnote, you can install it yourself within Deepnote, using a line of code like the following, which installs the `vega_datasets` library. Notice the exclamation point at the beginning (which probably won't appear in the documentation you find for the library).\n```\n!pip install vega_datasets\n```","block_group":"ecae296145ff43e997ba98e27a80204a"},{"cell_type":"markdown","metadata":{"cell_id":"6395a6d6c40148e9a5d23c81a9b9546b","deepnote_cell_type":"markdown"},"source":"## Other libraries\nHere are a few other libraries that you might find interesting. (Most of these are already installed in Deepnote.)\n* [sympy](https://www.sympy.org/en/index.html) for symbolic computation, like what you did in Math 9 using Mathematica.\n* [Pillow](https://pillow.readthedocs.io/en/stable/index.html) for image processing.\n* [re](https://docs.python.org/3/library/re.html) for advanced string methods using regular expressions.\n* [Seaborn](https://seaborn.pydata.org/) and [Plotly](https://plotly.com/python/plotly-express/). We introduced these plotting libraries briefly together with Altair early in the quarter, and we have used Seaborn frequently for importing datasets. Their syntax is similar to Altair.\n* [ipywidgets](https://ipywidgets.readthedocs.io/en/latest/) provides a way to add interactivity to a Jupyter notebook, but last I checked, not all of it works in Deepnote.","block_group":"9a558345bac34ac9afca15ff9a90e610"},{"cell_type":"markdown","metadata":{"cell_id":"4ebaa6d27c80475597ef3a01f9ecc7c5","deepnote_cell_type":"markdown"},"source":"## ChatGPT\nYou could get help writing your project from ChatGPT, documenting along the way how the process is working. That would be interesting, just keep it clear what is your work and what is provided by ChatGPT.","block_group":"27238cc53dc54521a4bca0cad05547c5"},{"cell_type":"markdown","source":"<a style='text-decoration:none;line-height:16px;display:flex;color:#5B5B62;padding:10px;justify-content:end;' href='https://deepnote.com?utm_source=created-in-deepnote-cell&projectId=bfd880e7-cf64-49b0-90ad-d84e99c71d5c' target=\"_blank\">\n<img alt='Created in deepnote.com' style='display:inline;max-height:16px;margin:0px;margin-right:7.5px;' src='' > </img>\nCreated in <span style='font-weight:600;margin-left:4px;'>Deepnote</span></a>","metadata":{"created_in_deepnote_cell":true,"deepnote_cell_type":"markdown"}}],"nbformat":4,"nbformat_minor":0,"metadata":{"deepnote":{},"orig_nbformat":2,"deepnote_notebook_id":"f10e80edae7447ed9aa6ee2b1c1a0441","deepnote_execution_queue":[]}}
Loading

0 comments on commit 9836de6

Please sign in to comment.