Feature: Add glob pattern support, fixes #49 by Houston56 · Pull Request #55 · tedivm/paracelsus

Houston56 · 2026-01-18T10:01:47Z

Summary

Hi! This PR adds glob pattern support to --import-module, addressing #49. Users can now use glob patterns to discover and import modules instead of listing each path.

Changes

Core Features

Glob pattern support in --import-module:
- * - matches any string at one level (e.g., example.*.models)
- ? - matches a single character (e.g., example.fo?.models)
- ** - recursive wildcard matching zero or more levels (e.g., example.**.api.*.models)
- [abc] - character class (e.g., example.api.v[12].models)
- [a-z] - character range (e.g., example.api.v[0-9].models)
- [!a] - negated character class (e.g., example.api.v[!1].models)
Wildcard import support: Patterns ending with :* perform from module import * for each matching module
- Example: "src.domains.*.models:*" finds all matching modules and imports each with from <module> import *
Glob pattern support for base class path: The base_class_path argument now also supports glob patterns for finding multiple base classes
- Example: "project*.example.base:Base" will find and merge metadata from all matching base classes

Implementation Details

Pattern validation (paracelsus/models/pattern.py):
- New Pattern class with grammar validation
- Prevents ambiguous patterns where ** is followed by *
- Validates pattern syntax before processing
Module finder (paracelsus/finders.py):
- New ModuleFinder class using BFS traversal
- Supports namespace packages (PEP 420)
- Prevents infinite loops with symlinks and redundant paths
Graph building refactor (paracelsus/graph.py):
- New function: get_graph_metadata() - separates graph building logic from serialization
- New function: serialize_metadata() - handles metadata serialization
- Refactored: get_graph_string() - now a convenience wrapper combining get_graph_metadata() and serialize_metadata() for backward compatibility
- New function: _find_base_classes_by_pattern() - finds multiple base classes by glob pattern
- New function: _merge_metadata() - merges MetaData from multiple base classes with conflict resolution using path-based prefixes
- New function: to_module_name() - converts filesystem paths to Python module names
- New function: consume_import_tasks() - threaded module import worker
- Updated import logic: Replaced simple loop with pattern-aware ModuleFinder and threaded import queue
- Updated base class logic: Added support for glob patterns in base_class_path with automatic metadata merging

Examples

# Basic usage from Issue #49
paracelsus inject docs/database.md src.infra.orm:Base \
  --import-module "src.domains.*.models:*"

# Recursive lookup
paracelsus graph example.base:Base \
  --import-module "example.**.api.*.models"

# Multiple patterns
paracelsus graph example.base:Base \
  --import-module "example.domain.*.models" \
  --import-module "example.api.v[0-9].models"

# Base class pattern (namespace packages)
paracelsus graph "project*.example.base:Base" \
  --import-module "project*.example.*.models"

Testing

Tests covering all pattern types
Tests for nested package structures
Tests for namespace packages (PEP 420)
Integration tests with SQLAlchemy models
Validation error tests for invalid patterns

Related Issues

Closes #49

…tring

…to feature/glob-pattern-support

TheLazzziest · 2026-01-18T11:03:41Z

paracelsus/graph.py

+        if any(pattern.errors):
+            raise ValueError(pattern.serialized_errors)
+
+        current_root = Path.cwd()


This variable can be dropped as it's been defined on the 194th line.

tedivm · 2026-01-18T14:37:46Z

Before this can be merged can you please do some small cleanup:

Resolve the conflicts with the main branch so that this PR can be "squash merged".
Fix the mypy issues.

Thanks!

…n-support

Houston56 · 2026-01-18T16:47:07Z

Done!

Updated branch with upstream/main (no conflicts)
Fixed mypy errors by adding explicit type hints

TheLazzziest · 2026-01-18T17:45:19Z

hi @tedivm ! A quick question about the docs. Do we need to update it here or a separate PR will do as well?

tedivm · 2026-01-19T15:43:03Z

paracelsus/graph.py

+    import_queue_sentinel = object()
+    import_queue: Queue[Union[Dict[str, str], object]] = Queue()
+    import_worker = Thread(target=consume_import_tasks, args=(import_queue, import_queue_sentinel), daemon=True)
+    import_worker.start()


I have concerns about making this application threaded. There are some platforms which do not support threading, and it adds additional complexity to the application. I'm also not sure there will be much performance increase. Can you make this single threaded for now, and then we can talk about introducing threading as a separate PR? Or can you add some benchmarks showing that the complexity is worth it form a performance standpoint?

Hey @tedivm . Thanks for the question. Unfortunately, I don't have any benchmarks to provide, so I could share my logic behind this decision.

Since import is thread-safe, a background worker can 'warm up' sys.modules by handling the I/O-heavy operations (especially of deep dependency trees (like for Django applications) in parallel. This effectively hides the disk latency from the main execution thread allowing the exploration process to run concurrently. It means that while the finder keeps going over the project, the import cache is warming up progressively reducing the initialization time of the context significantly.

However, simplicity and portability would be a better bet in this case. So I'd rather not over-engineer this if you prefer a leaner codebase.

Anyways, we can revisit the threading/concurrency logic as a separate, data-driven improvement later on.

@tedivm , can we close this thread and move on or there are stil some questions left ?

sorry, busy week at work, but I'm planning on reviewing this over the weekend.

Houston56 · 2026-02-21T03:00:07Z

Hi @tedivm ! Just a quick bump on this PR. Thanks!

Apti and others added 20 commits December 11, 2025 18:19

WIP: add glob pattern support for --import-module

952199d

Implemented tests

b8d6f62

Refactor fixtures to use asset templates

e1d0d17

refactor: use singledispatch for test path utilities

ba08efe

refactor and extend glob pattern matching with advanced features

fbf6cb0

Feature, Add Pattern model

559b393

Feature, Add ModuleFinder

7f35f75

Feature, Fix validation issues

e6afae3

Feature, Reimplment module lookup block. Update tests

8c3a769

Feature, Add pre-commit to optional dependencies

2a0fc66

Feature, Improve the method description for state processing

eabad47

Feature, Replace pool executor with a separate thread

408019b

Feature, Add validation rules for pattern masks

de0a598

Feature, Add base.py module to the namespace case

8eb7a61

Feature, Fix validation tests for patterns. Add tests for get_graph_s…

2156ce1

…tring

Feature, Remove do_import

c5d99f2

Merge branch 'main' into feature/glob-pattern-support

7d0716e

Separate graph building from serialization and add dynamic comparison

ca98f29

fix format

26a903e

Merge remote-tracking branch 'origin/feature/glob-pattern-support' in…

04c83e8

…to feature/glob-pattern-support

TheLazzziest reviewed Jan 18, 2026

View reviewed changes

Houston56 added 2 commits January 18, 2026 19:33

Merge remote-tracking branch 'upstream/main' into feature/glob-patter…

ce11e2d

…n-support

Fix mypy type errors

c26352d

tedivm requested changes Jan 19, 2026

View reviewed changes

Refactor: make module imports single-threaded

85da90a

Houston56 requested a review from tedivm January 24, 2026 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Add glob pattern support, fixes #49#55

Feature: Add glob pattern support, fixes #49#55
Houston56 wants to merge 23 commits intotedivm:mainfrom
Houston56:feature/glob-pattern-support

Houston56 commented Jan 18, 2026 •

edited

Loading

Uh oh!

TheLazzziest Jan 18, 2026

Uh oh!

tedivm commented Jan 18, 2026

Uh oh!

Houston56 commented Jan 18, 2026

Uh oh!

TheLazzziest commented Jan 18, 2026

Uh oh!

tedivm Jan 19, 2026

Uh oh!

TheLazzziest Jan 23, 2026

Uh oh!

TheLazzziest Jan 30, 2026

Uh oh!

tedivm Jan 30, 2026

Uh oh!

Houston56 commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Houston56 commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Core Features

Implementation Details

Examples

Testing

Related Issues

Uh oh!

TheLazzziest Jan 18, 2026

Choose a reason for hiding this comment

Uh oh!

tedivm commented Jan 18, 2026

Uh oh!

Houston56 commented Jan 18, 2026

Uh oh!

TheLazzziest commented Jan 18, 2026

Uh oh!

tedivm Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

TheLazzziest Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

TheLazzziest Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

tedivm Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Houston56 commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Houston56 commented Jan 18, 2026 •

edited

Loading