The NLP Research Visualizer is a research assistance tool, aimed to offer a streamlined and efficient way to identify trending topics in scientific and academic research. This tool provides visual analytics of key trends and hot topics in specific research domains by parsing through multiple academic databases.
Before you can use this software, you need to install Python on your computer. If you already have Python installed, you can skip to "Step 1: Download the Project".
- Download the latest version of Python from the official website: https://www.python.org/downloads/windows/
- Run the installer. In the first screen of the installation wizard, check the box that says "Add Python to PATH", then click "Install Now".
To utilize X-Force NLP Visualizer, the first step involves obtaining a copy of the project files. This can be achieved in two ways: by cloning the project repository or by downloading and extracting a .zip file of the project. Here are the steps for both methods:
Option 1: Clone the Repository:
If you have git
installed on your system, you can clone the repository directly. Cloning a repository creates a local copy of the project on your machine, maintaining the repository structure. To clone the repository, open your terminal or command line interface, navigate to the directory where you want to place the project, and run the following command:
git clone https://github.com/kkakdugee/x-force.git
Option 2: Download the ZIP File:
If you don't have git
installed or if you prefer to download the files manually, you can do so by clicking the Code
button on the repository page and then clicking Download ZIP
. After downloading the ZIP file, extract its contents into your desired directory. Remember, the directory where you extract or clone the project files will be referred to as the 'root directory' for the project.
Locate proxies.txt
in the project root folder and input your proxies. Here is an example below:
Note: The first line is your HTTP proxy and the second line is your HTTPS proxy.
Locate .env
in the project root folder and input your proxies. Here is an example below:
Locate the batch file in the root directory of the project (it should be named run.bat
). Run this file by double-clicking it. This executable file will set your proxies, verify your Python installation, handle the installation of necessary dependencies automatically, and start the NLP Research Visualizer software tool.
This section will guide you through the process of using X-Force NLP Visualizer's features. It provides a detailed walkthrough of updating & analyzing the database, as well as graph configurations and feedback/support tickets.
- Database: Allows you to load, configure, and update a database.
- Analyzing the Database: Enables you to analyze the trends in your research domain based on the updated database.
- Graph Configurations: Provides various configuration options to customize your trend graphs.
- Feedback/Help: Offers a feedback system to help improve your user experience and aid in troubleshooting.
-
Database Preview: A preview of the current state of your database is displayed.
-
Selecting & Creating a Database: This tab allows you to load a pre-existing Database, or create a new Database.
- Updating the Database: This tab allows you to add entries by query to your selected Database.
- Filtering the Database: This tab allows you to filter your selected Database.
The "Analyze Database" functionality offers six types of visualizations, each designed to help you better understand the distribution and frequency of specific keywords within your databases. These visualizations include:
-
Database Summary: The "Database Summary" generates a graph showing the distribution of articles among the Scopus, arXiv, and both databases.
-
Keyword Frequency: The "Keyword Frequency" visualization generates a graph depicting the frequency of your filtered queries from the selected database.
-
Publish Frequency: The "Publish Frequency" visualization creates a graph showing the publishing frequency of your filtered queries from the selected database.
-
Text Frequency: The "Text Frequency" visualization generates a graph showing the text frequency (e.g., character/word count of title/abstract) of filtered queries from the selected database.
-
Network Co-occurrence: The "Network Co-occurrence" visualization generates a graph that displays the relationships between different keywords or terms based on their co-occurrence within the same documents.
-
Bubble Map: The "Bubble Map" visualization generates a graph where data is represented as bubbles or circles. The size of the bubbles typically corresponds to the frequency or importance of a keyword, indicating the density of that term within the dataset.
The "Graph Configurations" functionality allows you to personalize the aesthetics of your visualization graphs. Here's a rundown of the different customization options:
-
Name Your Configuration: Specify a custom name for your configuration to easily identify it later.
-
Font Settings:
- Font Type: Select your preferred font type from 'Arial', 'Times New Roman', or 'Calibri'.
- Font Size: Use the slider to choose the size of your font, with a range from 8 to 20.
-
Figure Size:
- Figure Width: Use the slider to adjust the width of your figure, with a range from 3.5 to 10.0.
- Figure Height: Use the slider to adjust the height of your figure, with a range from 2.75 to 10.0.
After customizing, click the 'Save Configuration' button. This will save your configuration for later use.
The "Load Configuration" option allows you to upload a pre-saved configuration file (.rc/.matplotlibrc
format). You can also view the details of your current configuration.
-
Upload Configuration: Upload a previously saved configuration file.
-
Current Configuration: The system will display whether you're using the default configuration or a custom one. You can also view the specific details of your current configuration.
The Feedback/Help section contains a simple form for you to fill out.
-
Name: Enter your name.
-
Email: Enter your email address where we can reach you for follow-up communications.
-
Select Type: Use the dropdown menu to select the nature of your communication. You can choose between "Help" if you need assistance with the tool, or "Feedback" if you want to provide comments or suggestions for improvement.
-
Describe your suggestion/needed assistance: In this text area, provide a detailed description of your feedback or the help you require. If you're reporting an issue, try to include as much information about what you were doing when the problem occurred and any error messages you saw.
-
GitHub Token: Enter your GitHub Personal Access Token.
Click the "Submit" button to send us your feedback or help request. The tool automatically creates an issue in our GitHub repository for each submitted feedback or help request, allowing us to track and respond to your needs effectively.
Note: A GitHub Personal Access Token can be created here. Under Repository access, select Only select repositories, and select this repository (x-force). For permissions, please only enable access for Issues. For the access level, select Read and write.