Spring Boot Batch Processing: Dumping CSV Data into Database

This Spring Boot application facilitates batch processing to import data from a CSV file into a database. The process involves reading data from a CSV file, processing it, and persisting it into the database.

Features

CSV Data Import

Source Data: The application reads data from a CSV file named customers.csv.
File Location: The CSV file is located in the src/main/resources directory of the project.
Data Structure: The CSV file contains customer information such as ID, first name, last name, email, gender, contact number, country, and date of birth.

Chunk Processing

Chunk Size: Data processing is performed in chunks of 10 records.
Efficiency: Chunk processing enhances the application's efficiency, particularly when dealing with large datasets.
Transaction Management: Each chunk of data is processed within a transaction to ensure data consistency.

Parallel Processing

Task Executor: The application utilizes a customized task executor to enable parallel processing of chunks.
Concurrency: The task executor is configured with a concurrency limit of 10 threads.
Improved Performance: Parallel processing enhances throughput and performance, allowing for faster data processing.

Database Persistence

Data Persistence: Processed data is persisted into a database using Spring Data JPA and Hibernate.
Repository: The CustomerRepository interface, extending JpaRepository, is used to interact with the database and perform CRUD operations on customer entities.
Automatic Schema Generation: Hibernate automatically generates database schema based on the entity mappings defined in the application.

Spring Data JPA

JPA Repository: Spring Data JPA provides a powerful repository abstraction layer on top of JPA, simplifying database access and reducing boilerplate code.
Entity-Repository Association: Each entity in the application is associated with a corresponding repository interface, allowing for easy database operations.

Prerequisites

Java Development Kit (JDK): Ensure that JDK 8 or higher is installed on your system.
Apache Maven: Maven is required to build and run the Spring Boot application.
Database: Install and configure MySQL or another compatible database server. Ensure that the database server is running locally.

Usage

Clone Repository: Clone the repository to your local machine using the following command:
```
git clone <https://github.com/Paras2322/SpringBatch-CSV-to-DB.git>
```
Navigate to Project Directory: Change to the project directory using the following command:
```
cd spring-boot-batch-processing
```
Place CSV File: Ensure that the customers.csv file containing the data to be imported is placed in the src/main/resources directory of the project.
Run Application: Execute the application using Maven with the following command:
```
mvn spring-boot:run
```
Test : Through Postman Run
Monitor Progress: Monitor the console logs for batch processing status and any errors encountered during execution.

Customized Task Executor

In this application, the task executor is customized to allow parallel processing of chunks. The following configuration is used to define the task executor bean:

@Bean
public TaskExecutor taskExecutor(){
    SimpleAsyncTaskExecutor asyncTaskExecutor = new SimpleAsyncTaskExecutor();
    asyncTaskExecutor.setConcurrencyLimit(10);
    return asyncTaskExecutor;
}

Contributions

Contributions to this project are welcome! If you have any suggestions, enhancements, or bug fixes, feel free to open an issue or submit a pull request.

Additional Information

Official Documentation: Refer to the official documentation for more information on Spring Batch framework.
Customization: Customize batch processing logic, chunk size, and task executor configuration as per your specific requirements.
Maintenance: Regularly monitor batch job execution and database performance for optimal operation of the application.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.mvn/wrapper		.mvn/wrapper
src		src
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spring Boot Batch Processing: Dumping CSV Data into Database

Features

CSV Data Import

Chunk Processing

Parallel Processing

Database Persistence

Spring Data JPA

Prerequisites

Usage

Customized Task Executor

Contributions

Additional Information

About

Releases

Packages

Languages

Subhashchandra-Birajdar/SpringBatch_CSV_to_DB

Folders and files

Latest commit

History

Repository files navigation

Spring Boot Batch Processing: Dumping CSV Data into Database

Features

CSV Data Import

Chunk Processing

Parallel Processing

Database Persistence

Spring Data JPA

Prerequisites

Usage

Customized Task Executor

Contributions

Additional Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages