1.3.0 rc #115

rostilos · 2026-02-08T18:43:05Z

No description provided.

…cation in WebSecurityConfig

…hook handlers

…e conditions

…ted before alias creation

… conditions

…rategy for direct collections

…esolution tracking

…oth incremental and full modes

…er clarity

… RAG pipeline

- Updated JobService to use REQUIRES_NEW transaction propagation for deleting ignored jobs, ensuring fresh entity retrieval and preventing issues with the calling transaction. - Removed token limitation from AI connection model and related DTOs, transitioning to project-level configuration for token limits. - Adjusted AIConnectionDTO tests to reflect the removal of token limitation. - Enhanced Bitbucket, GitHub, and GitLab AI client services to check token limits before analysis, throwing DiffTooLargeException when limits are exceeded. - Updated command processors to utilize project-level token limits instead of AI connection-specific limits. - Modified webhook processing to handle diff size issues gracefully, posting informative messages to VCS when analysis is skipped due to large diffs. - Cleaned up integration tests to remove references to token limitation in AI connection creation and updates.

…sis processing. Project PR analysis max analysis token limit implementation

…Exception in webhook processors

… entities from async contexts

…ies without re-fetching in async contexts

… lazy loading of associations

…cy across transaction contexts

…oading of associations

…ansaction management in async context

…mproved job management

…mmit analysis

…xt management

…ndling

…ervice for direct deletion

…d of deleting

…nt in webhook processing

…n for RAG context

- Added AST-based code splitter using Tree-sitter for accurate code parsing. - Introduced TreeSitterParser for dynamic language loading and caching. - Created scoring configuration for RAG query result reranking with configurable boost factors and priority patterns. - Refactored RAGQueryService to utilize the new scoring configuration for enhanced result ranking. - Improved metadata extraction and handling for better context in scoring.

…rove code referencing in prompts

…components

… target branch results

…emental updates in RAG operations

…nk size in tests

…pipeline

…eparation

…ssue reconciliation process

…model configurations - Removed deprecated `_get_collection_name` method from RAGIndexManager. - Updated imports in `models/__init__.py` to exclude DocumentMetadata. - Changed default model in RAGConfig from "openai/text-embedding-3-small" to "qwen/qwen3-embedding-8b". - Deleted DocumentMetadata class from RAGConfig. - Removed WebhookIntegration service and its related methods. - Cleaned up unused utility function `is_code_file` from utils. - Updated tests to reflect the removal of DocumentMetadata and WebhookIntegration.

- Increased the queue capacity of the webhook executor to handle bursts of incoming requests. - Updated the rejected execution handler to throw exceptions instead of blocking the caller, allowing for better error handling in webhook endpoints. - Improved error logging in the WebhookAsyncProcessor for better traceability of job failures. - Introduced an EntityManager to detach jobs from the persistence context, preventing overwriting of job statuses during async processing. - Implemented a deduplication mechanism in the WebhookDeduplicationService to avoid processing duplicate commits within a defined time window. - Enhanced the response parsing logic to ensure proper logging and handling of resolved issues. - Streamlined the handling of PR and MR analysis in GitHub and GitLab webhook handlers, ensuring proper lock management and error handling. - Added memory-efficient streaming methods for preserving and copying points in the BranchManager. - Improved regex patterns in metadata extraction for various programming languages to account for leading whitespace. - Updated scoring configuration to use word-boundary matching for file path patterns, enhancing accuracy in priority assignment.

@PreAuthorize

- Replaced @PreAuthorize annotations with @IsWorkspaceMember and @HasOwnerOrAdminRights in JobController, AllowedCommandUserController, ProjectController, QualityGateController, and VCS controllers (BitbucketCloud, GitHub, GitLab). - Improved readability and maintainability of security checks by utilizing custom annotations. - Removed redundant security checks in various endpoints to streamline access control. - Deleted unused _extract_archive function in web_server.py to clean up codebase.

- Updated import paths for IssueDTO in prompt_builder.py. - Modified analysis_summary prompt in prompt_constants.py for clarity. - Enhanced Dockerfile with CPU threading optimizations. - Improved environment validation in main.py to support Ollama and OpenRouter embedding providers. - Added new endpoints for parsing files and batch parsing in api.py, including AST metadata extraction. - Introduced embedding_factory.py to manage embedding model creation for Ollama and OpenRouter. - Implemented Ollama embedding wrapper in ollama_embedding.py for local model support. - Updated index_manager to check vector dimensions before copying branches. - Refactored RAGConfig to validate embedding provider configurations and auto-detect embedding dimensions. - Adjusted query_service to utilize the new embedding factory for model instantiation.

…action management in JobService and WebhookAsyncProcessor

…s for CRUD operations. Cloud version preparations

…ersion for existing records; update database migration scripts for version handling and remove token limitation from ai_connection

Epic/ca 7 pr review process flow

rostilos · 2026-02-08T18:43:13Z

/codecrow analyze

coderabbitai · 2026-02-08T18:43:15Z

Important

Review skipped

Too many files!

This PR contains 197 files, which is 47 over the limit of 150.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch 1.3.0-rc

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…ling - Introduced MAX_CONCURRENT_REVIEWS to limit simultaneous review requests. - Implemented asyncio.Semaphore to manage concurrent reviews. - Added a timeout of 600 seconds for review processing to prevent long-running requests. - Refactored LLM reranker initialization to be per-request, improving resource management. - Ensured MCP sessions are closed after review processing to release resources. - Enhanced error handling for timeouts and exceptions during review processing. refactor: Simplify context builder and remove unused components - Removed legacy context budget management and model context limits. - Streamlined context builder utilities for RAG metrics and caching. - Updated context fetching logic to align with new architecture. fix: Update prompt templates for clarity and accuracy - Revised Stage 2 cross-file prompt to focus on relevant aspects. - Changed references from "Database Migrations" to "Migration Files" for consistency. feat: Implement service-to-service authentication middleware - Added ServiceSecretMiddleware to validate internal requests with a shared secret. - Configured middleware to skip authentication for public endpoints. enhance: Improve collection management with payload indexing - Added functionality to create payload indexes for efficient filtering on common fields in Qdrant collections. fix: Adjust query service to handle path prefix mismatches - Updated fallback logic in RAGQueryService to improve handling of filename matches during queries.

feat: Enhance ReviewService with concurrency control and timeout hand…

rostilos · 2026-02-08T20:16:30Z

/codecrow analyze

- Added DiffFingerprintUtil to compute a stable fingerprint for code changes in pull requests. - Enhanced PullRequestAnalysisProcessor to utilize commit hash and diff fingerprint caches for reusing analysis results. - Updated CodeAnalysis model to include a diff_fingerprint field for storage. - Modified CodeAnalysisService to support retrieval and cloning of analyses based on diff fingerprints and commit hashes. - Added database migrations to introduce the diff_fingerprint column and create necessary indexes. - Improved error handling and logging in various components, including file existence checks in Bitbucket Cloud. - Refactored tests to accommodate new functionality and ensure coverage for caching mechanisms.

…or header validation

…ysis-binding feat: Implement diff fingerprint caching for pull request analysis

codecrow-local · 2026-02-10T10:17:18Z

⚠️ Code Analysis Results

Summary

Pull Request Review: 1.3.0 rc


Status	FAIL
Risk Level	HIGH
Review Coverage	23 files analyzed in depth
Confidence	HIGH

Executive Summary

This PR (1.3.0 rc) introduces significant architectural changes to the analysis engine, including a new diff fingerprinting mechanism for issue deduplication and a major refactoring of the webhook processing pipeline. While the scope aims to improve system scalability and data integrity, the current implementation introduces several high-risk regressions across security, database migration stability, and core RAG extraction logic.

Recommendation

Decision: FAIL

The PR is not suitable for merge in its current state due to critical blockers, including a "fail-open" authentication vulnerability in the Python middleware and conflicting database migrations that will cause deployment failures. Additionally, the RAG pipeline contains runtime incompatibilities with the Tree-sitter API that must be resolved to maintain core functionality.

Issues Overview

Severity	Count
🔴 High	7	Critical issues requiring immediate attention
🟡 Medium	27	Issues that should be addressed
🔵 Low	14	Minor issues and improvements
ℹ️ Info	3	Informational notes and suggestions

Analysis completed on 2026-02-10 10:20:23 | View Full Report | Pull Request

📋 Detailed Issues (51)

🔴 High Severity Issues

Id on Platform: 1943

Category: 🔒 Security

File: .../api/api.py:30

Issue: The 'startswith' check on resolved paths is vulnerable to prefix attacks. For example, if '_ALLOWED_REPO_ROOT' is '/tmp', a path like '/tmp-secret/repo' would pass the validation because '/tmp-secret/repo'.startswith('/tmp') is true, even though it is outside the intended root.

💡 Suggested Fix

Use 'os.path.commonpath' to verify that the resolved path is actually within the allowed root directory.

--- a/python-ecosystem/rag-pipeline/src/rag_pipeline/api/api.py
+++ b/python-ecosystem/rag-pipeline/src/rag_pipeline/api/api.py
@@ -25,7 +25,8 @@
 def _validate_repo_path(path: str) -> str:
     """Validate that a repo path is within the allowed root and contains no traversal."""
+    root = os.path.realpath(_ALLOWED_REPO_ROOT)
     resolved = os.path.realpath(path)
-    if not resolved.startswith(os.path.realpath(_ALLOWED_REPO_ROOT)):
+    if os.path.commonpath([root, resolved]) != root:
         raise ValueError(
             f"Path must be under {_ALLOWED_REPO_ROOT}, got: {path}"
         )

1.3.0 rc #115

Are you sure you want to change the base?

1.3.0 rc #115

Conversation

rostilos commented Feb 8, 2026

Uh oh!

rostilos commented Feb 8, 2026

Uh oh!

coderabbitai bot commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

rostilos commented Feb 8, 2026

Uh oh!

codecrow-local bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Code Analysis Results

Summary

Pull Request Review: 1.3.0 rc

Executive Summary

Recommendation

Issues Overview

🔴 High Severity Issues

🟡 Medium Severity Issues

🔵 Low Severity Issues

ℹ️ Informational Notes

Files Affected

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Feb 8, 2026 •

edited

Loading

codecrow-local bot commented Feb 10, 2026 •

edited

Loading