CDS: NLP Research Team

Team Lead: Kenta Takatsu (CS '19)
Advisor: Prof. Thorsten Joachims

About Us

We are a student-led research team from Cornell Data Science (CDS), working on Natural Language Processing projects under Prof. Thorsten Joachims. This semester, we are participating in the Yelp Dataset Challenge to provide analytic insights from raw review texts. Our final products are research papers which makes use of machine learning algorithms and statistical validations. You can visit the subteam sections to see our individual work.

Achivements

This past semester, we had a wide range of research topics, from recommendation system to deep style transfer. In general, we took the approach called Natural Language Processing -- an interaction between machine learning and text analysis.

All researches demonstrated remarkable results; an implementation of recommendation system that beats industry standard algorithm, an accurate analytic tool to assess business trends, a classifier to identify locally popular users, and a writing style transfer with deep learning.

Subteams

Extracting Rating Dimensions with Text Reviews

Members: Xuwen Shen (STAT '18), Xinzhe Yang (CS '20)
In order to give insights to overall ratings and then create a new personalized recommendation system based on the rating that account for his or her preferences, we were hoping to extract hidden information in reviews including an individual user’s preference and a business’s properties (scores for each feature of the business). Finally, we created a model combining the topics and overall ratings to get a personalized ratings for a specific user.
Topic Modeling as a Trend-Aware Performance Metric

Members: Kenta Takatsu (CS '19), Caroline Chang (CS '20)
We are developing a stream-lined star-prediction system to better assess business performance using different types of classifiers, which accounts for the temporal trends in user review topics and the strength/weakness of business characteristics in latent space.
Local Experts in Yelp

Members: Brandon Kates (BTRY '19), Brian Cheang (CS '20)
The objective of the project is to build and combine two models (Local Expert Identifier / Topical Expert Identifier) for the purpose of identifying 'experts' among yelp users.
Neural Style Transfer For Text

Members: Luca Leeser (INFO '18), Yuji Akimoto (ORIE '19), Ryan Butler (CS '19), Cameron Ibrahim (ORIE '20)
We are seeking to modify the neural style transfer algorithm proposed by Gatys et. al. to make it applicable to text. Our goal is to devise an algorithm that is able to transfer the writing style of one review onto the content of another.

Final Submissions

You can visit our final papers from the following links:

How to get the code

The code uses git submodules, so to properly intialize those you need the --recurse-submodules option. Additionally, using --depth 1 will avoid cloning the history, making the clone faster.

git clone --recurse-submodules --depth 1 https://github.com/CornellDataScience/Yelp-FA17.git

Name		Name	Last commit message	Last commit date
Latest commit History 611 Commits
dl_style_transfer		dl_style_transfer
img		img
latent_variable		latent_variable
local-experts		local-experts
topic_over_time		topic_over_time
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CDS: NLP Research Team

About Us

Achivements

Subteams

Extracting Rating Dimensions with Text Reviews

Topic Modeling as a Trend-Aware Performance Metric

Local Experts in Yelp

Neural Style Transfer For Text

Final Submissions

How to get the code

About

Releases

Packages

Contributors 12

Languages

CornellDataScience/NLP_Research-FA17

Folders and files

Latest commit

History

Repository files navigation

CDS: NLP Research Team

About Us

Achivements

Subteams

Extracting Rating Dimensions with Text Reviews

Topic Modeling as a Trend-Aware Performance Metric

Local Experts in Yelp

Neural Style Transfer For Text

Final Submissions

How to get the code

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 12

Languages

Packages