Skip to content

A web scraper tool to extract names from a web page - Public ESTU Coursework

Notifications You must be signed in to change notification settings

Zeesky-code/Name-Finder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Name-Finder

A web scraper tool for extracting Names from a Webpage using JSoup and Apache OpenNLP libraries.

Prerequisites

To run this project, you will need to have the following software installed on your system:

  • Java 8 or higher
  • Maven

Installing

To build and run this project, follow these steps:

  • Clone the repository to your local machine:

git clone https://https://github.com/Zeesky-code/Name-Finder.git

  • Build the project using Maven:

mvn package

  • Run the project:

java -jar target/bim207hw.jar <url>

Usage

To use this project, you will need to provide the URL of the webpage that you want to extract names from. The program will then output a list of the names that it was able to extract.

Built With

JSoup - A Java library for parsing HTML Apache OpenNLP - A machine learning-based toolkit for processing natural language text

Contributing

If you'd like to contribute to this project, please fork the repository and make your changes in a separate branch. Once you've tested your changes and are ready to submit them, create a pull request and we'll review your contribution.

Feel free to check out my article on this project: https://zeeskylaw.hashnode.dev/extracting-names-from-a-webpage-using-jsoup-and-apache-opennlp-in-java

About

A web scraper tool to extract names from a web page - Public ESTU Coursework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages