A web scraper tool for extracting Names from a Webpage using JSoup and Apache OpenNLP libraries.
To run this project, you will need to have the following software installed on your system:
- Java 8 or higher
- Maven
To build and run this project, follow these steps:
- Clone the repository to your local machine:
git clone https://https://github.com/Zeesky-code/Name-Finder.git
- Build the project using Maven:
mvn package
- Run the project:
java -jar target/bim207hw.jar <url>
To use this project, you will need to provide the URL of the webpage that you want to extract names from. The program will then output a list of the names that it was able to extract.
JSoup - A Java library for parsing HTML Apache OpenNLP - A machine learning-based toolkit for processing natural language text
If you'd like to contribute to this project, please fork the repository and make your changes in a separate branch. Once you've tested your changes and are ready to submit them, create a pull request and we'll review your contribution.
Feel free to check out my article on this project: https://zeeskylaw.hashnode.dev/extracting-names-from-a-webpage-using-jsoup-and-apache-opennlp-in-java