Getting Started

Prerequisites

Install Docker

Follow the Docker installation guide for your system's requirements.

Install awslocal

This package provides the awslocal command, which is a thin wrapper around the AWS command line interface for use with LocalStack.

pip install "awscli-local[ver2]"

Install LocalStack

# MacOS
brew install localstack/tap/localstack-cli

# PyPI (MacOS, Windows, Linux)
python3 -m pip install localstack

Running the Application

Option 1: Using Docker Compose

# Build and run the containers
docker-compose build --no-cache
docker-compose up -d

# Note: You might need to manually restart Celery due to a known bug

Option 2: Running Services Individually

# Start Redis
docker run --name redis_container -p 6379:6379 redis/redis-stack-server

# Start PostgreSQL
docker run -d --name postgres-container \
    -e POSTGRES_DB=document_classifier \
    -e POSTGRES_USER=postgres \
    -e POSTGRES_PASSWORD=postgres \
    -p 5432:5432 postgres

# Additional Services:
# - LocalStack creates a docker image after installation
# - Flask can be started via VSCode debugger
# - Celery can be started with: celery -A src.tasks worker --pool=solo --loglevel=info

Usage Notes

Development Recommendations

It's recommended to run services individually instead of using docker-compose for testing:
- Better visibility of changes and logs
- Avoid potential LocalStack docker issues
- The project has been tested and works fine locally

Running the Application

Start the services using either method above
Access the application:
- Docker Compose: localhost:8000
- VSCode debugger: Custom port (e.g., Flask default 5000)
Send requests to /classify_file with file payload
Monitor progress:
- Watch Celery terminal output
- Poll /task_status/{task_id} endpoint

Architecture Notes

The project uses Celery+Redis for the following reasons:

Immediate request acceptance and storage in Redis broker
Asynchronous processing by workers
Non-blocking /classify_file endpoint
Results can be checked regularly for long-running tasks

Potential Improvements

Reduce task duration by:
- Allocating more resources (CPU/GPU)
- Using models trained for specific document types
- These options can be discussed based on company needs

Note: This solution demonstrates asynchronous processing capabilities while maintaining system responsiveness. Thank you for the opportunity!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
files		files
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
bitbucket-pipelines.yml		bitbucket-pipelines.yml
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting Started

Prerequisites

Install Docker

Install awslocal

Install LocalStack

Running the Application

Option 1: Using Docker Compose

Option 2: Running Services Individually

Usage Notes

Development Recommendations

Running the Application

Architecture Notes

Potential Improvements

About

Releases

Packages

Languages

onurbaskin/join-the-siege

Folders and files

Latest commit

History

Repository files navigation

Getting Started

Prerequisites

Install Docker

Install awslocal

Install LocalStack

Running the Application

Option 1: Using Docker Compose

Option 2: Running Services Individually

Usage Notes

Development Recommendations

Running the Application

Architecture Notes

Potential Improvements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages