Skip to content

feat: extend adelete_document method to include remove_from_storage o…#225

Merged
a-klos merged 5 commits intomainfrom
fix/doc-upload
Feb 3, 2026
Merged

feat: extend adelete_document method to include remove_from_storage o…#225
a-klos merged 5 commits intomainfrom
fix/doc-upload

Conversation

@a-klos
Copy link
Member

@a-klos a-klos commented Feb 2, 2026

This pull request introduces several improvements and refactors to the admin-api-lib and extractor-api-lib libraries, focusing on better modularization through re-exports, enhanced document deletion logic, and improved development and testing workflows. The most significant changes are grouped below.

1. Codebase Modularization and Re-exports

  • The FileService, S3Service, and S3Settings classes in admin-api-lib and extractor-api-lib now re-export their implementations from rag-core-lib, removing local duplicate definitions and ensuring consistency across libraries. [1] [2] [3]

2. Document Deletion API Enhancements

  • The DocumentDeleter interface and its default implementation now accept a new remove_from_storage argument, allowing callers to control whether the file is deleted from storage in addition to the key-value store. The logic for file deletion was refactored to handle storage keys more robustly and to provide clearer error handling and logging. [1] [2] [3] [4] [5] [6]

3. Upstream Usage and Test Updates

  • All usages of adelete_document in file and source uploaders, as well as related tests, have been updated to use the new remove_from_storage parameter, ensuring correct behavior and test coverage for the enhanced deletion API. [1] [2] [3] [4]

4. Docker and Build System Improvements

  • The Docker build configuration for the extractor service and the libs/Dockerfile have been updated to include and install rag-core-lib as a dependency where needed, improving development workflows and ensuring all necessary dependencies are available for testing and running the services. [1] [2] [3]

These changes collectively improve code maintainability, modularity, and the flexibility of document management operations across the codebase.

** issue:
#224

- Added abstract class `FileService` for handling file I/O operations.
- Created concrete implementation `S3Service` for interacting with S3 storage.
- Introduced `S3Settings` class for managing S3 configuration.
- Updated `poetry.lock` files for dependencies related to AWS SDK (boto3, botocore, jmespath, s3transfer).
- Added file service interfaces in `rag_core_lib`.
@a-klos a-klos merged commit 799727e into main Feb 3, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants