New REST Endpoints for Content Import Job Management #30550

jgambarios · 2024-11-01T17:05:38Z

Parent Issue

Task

We need to create REST endpoints to interact with the new com.dotcms.jobs.business.processor.impl.ImportContentletsProcessor to manage content import operations through a job queue system.

Objectives

Implement REST endpoints for content import job management
Provide real-time monitoring capabilities
Enable comprehensive validation and error handling

REST Endpoints Specification

Create Import Job

Endpoint: POST /content/import
Purpose: Creates and enqueues a new content import job
Returns: Job identifier

List Import Jobs

Endpoint: GET /content/import
Parameters:
- page: Page number
- size: Items per page
- status: Filter by job status
Purpose: Lists all enqueued jobs with pagination

Validate Import

Endpoint: POST /content/import/validate
Purpose: Performs import validation without actual import
Parameters: Same as content import endpoint

Get Job Status

Endpoint: GET /content/import/{jobId}
Parameters: jobId
Returns: Job state, progress percentage, executing node, etc

Cancel Job

Endpoint: POST /content/import/{jobId}/cancel
Parameters: jobId
Purpose: Cancels a running import job

Monitor Job

Endpoint: GET /content/import/{jobId}/monitor
Parameters: jobId
Type: Server-Sent Events (SSE)
Purpose: Real-time job status monitoring

Technical Requirements

Import Features:

CSV file upload support
Content type specification
Content relationships handling
Multilingual content support
Error handling
Comprehensive data validation

Error Handling:

Detailed error reporting
Validation-only mode

Proposed Objective

Core Features

Proposed Priority

Priority 2 - Important

Acceptance Criteria

All endpoints return appropriate HTTP status codes
Job queue properly manages concurrent imports
SSE endpoint provides real-time updates
Error messages are clear and actionable
Import validation provides comprehensive checks
Job cancellation effectively stops processing
Multilingual content is properly handled
Content relationships are maintained

Tasks

Give feedback

[Content Import Job Management] Implement the Create Import Job REST endpoint #30669

Doc : Needs Doc Merged QA : Passed Internal Release : 24.12.05 Team : Scout Type : Task
[Content Import Job Management] Implement the Content Import Validate Job REST endpoint #30771

Doc : Needs Doc Merged OKR : Core Features Priority : 2 High QA : Passed Internal Release : 24.12.05 Team : Scout Type : Task
[Content Import Job Management] Implement the getJobStatus REST endpoint #30791

Doc : Needs Doc Merged QA : Passed Internal Release : 24.12.10 Team : Scout Type : Task
Rest API: Implement missing endpoints for Content Import Job Resource #30874

Doc : Needs Doc Merged QA : Passed Internal Release : 24.12.10 Team : Scout Type : Task
Options

The text was updated successfully, but these errors were encountered:

fmontes · 2024-11-04T18:30:36Z

Thanks for the detailed user story! I have a few comments to ensure we align with the comprehensive goals outlined in our Import/Export API documentation:

Multilingual and Relationship Handling: Ensure that we have clear implementation details on how to handle multilingual content and relationships.
Dry-Run Feature: It looks like the POST /content/import/validate endpoint, which acts as a dry-run mode, is not explicitly mentioned.
Binary Content Consideration: Double-check that binary content handling is covered. Users need to reference existing binary files via paths in the CSV, so make sure this is clear in both implementation and documentation.
Performance Expectations: Can we confirm that the system can handle importing at least 100,000 items within one hour? is that a realistic goal?

jgambarios · 2024-11-04T19:10:53Z

@fmontes

1. Multilingual and Relationship Handling: The current import process already handles multilingual and relationships, so, this is covered.

2. Dry-Run Feature: Correct, the dry-run (or "Preview" in our current import) feature will run under the POST /content/import/validate endpoint, we can always change the name of the endpoint for clarity if we want.

3. Binary Content Consideration: Same as number 1, as we are "sharing" the current import logic in our new import processor, the binary case is handled, like you said, the binary must already exist in dotCMS and it is referenced by path.

4. Performance Expectations: This is something we need to investigate, during my local testing I imported 30,000 contentlets in like 6mins, but, this is locally and using a very simple content type, so I would need to run tests to provide more realistic numbers.

jgambarios added Triage Type : Task labels Nov 1, 2024

jgambarios added this to dotCMS - Product Planning Nov 1, 2024

github-project-automation bot moved this to New in dotCMS - Product Planning Nov 1, 2024

jgambarios added the Team : Scout label Nov 1, 2024

jgambarios mentioned this issue Nov 1, 2024

Implementation of the REST Endpoint and Job Queue Abstraction #29474

Open

nollymar moved this from New to Next 1-3 Sprints in dotCMS - Product Planning Nov 1, 2024

nollymar added Epic and removed Triage Type : Task labels Nov 1, 2024

valentinogiardino mentioned this issue Nov 15, 2024

[Content Import Job Management] Implement the Create Import Job REST endpoint #30669

Closed

6 tasks

This was referenced Nov 26, 2024

[Content Import Job Management] Implement the Content Import Validate Job REST endpoint #30771

Closed

[Content Import Job Management] Implement the getJobStatus REST endpoint #30791

Closed

valentinogiardino mentioned this issue Dec 6, 2024

Rest API: Implement missing endpoints for Content Import Job Resource #30874

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New REST Endpoints for Content Import Job Management #30550

New REST Endpoints for Content Import Job Management #30550

jgambarios commented Nov 1, 2024 •

edited by valentinogiardino

Loading

Tasks

fmontes commented Nov 4, 2024

jgambarios commented Nov 4, 2024 •

edited

Loading

New REST Endpoints for Content Import Job Management #30550

New REST Endpoints for Content Import Job Management #30550

Comments

jgambarios commented Nov 1, 2024 • edited by valentinogiardino Loading

Parent Issue

Task

Objectives

REST Endpoints Specification

Technical Requirements

Proposed Objective

Proposed Priority

Acceptance Criteria

Tasks

fmontes commented Nov 4, 2024

jgambarios commented Nov 4, 2024 • edited Loading

jgambarios commented Nov 1, 2024 •

edited by valentinogiardino

Loading

jgambarios commented Nov 4, 2024 •

edited

Loading