hbllmutils.meta.code.unittest_generation

Unit test generation utilities for Python code using LLM models.

This module provides comprehensive tools for automatically generating unit tests from Python source code using Large Language Models. It leverages LLM capabilities to analyze source code and generate appropriate test cases with configurable test frameworks and marking strategies.

The module contains the following main components:

UnittestCodeGenerationLLMTask - Main task class for generating unit tests
create_unittest_generation_task() - Factory function for creating configured test generation tasks

Note

This module requires a configured LLM model and supports multiple test frameworks including pytest, unittest, and nose2.

Warning

Generated tests should be reviewed and validated before use in production. The LLM may not cover all edge cases or generate semantically correct tests.

Example:

>>> from hbllmutils.meta.code.unittest_generation import create_unittest_generation_task
>>> 
>>> # Create a task with pytest framework
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     test_framework_name='pytest',
...     mark_name='unittest'
... )
>>> 
>>> # Generate tests for a source file
>>> test_code = task.generate(
...     source_file='mypackage/calculator.py',
...     max_retries=3
... )
>>> print(test_code)

>>> # Generate tests with existing test file as reference
>>> test_code = task.generate(
...     source_file='mypackage/calculator.py',
...     test_file='tests/test_calculator_old.py'
... )

UnittestCodeGenerationLLMTask

class hbllmutils.meta.code.unittest_generation.UnittestCodeGenerationLLMTask(model: str | LLMModel, history: LLMHistory | None = None, default_max_retries: int = 5, show_module_directory_tree: bool = False, skip_when_error: bool = True, force_ast_check: bool = True, ignore_modules: Iterable[str] | None = None, no_ignore_modules: Iterable[str] | None = None)[source]

LLM task for generating unit test code from Python source files.

This class extends PythonCodeGenerationLLMTask to provide specialized functionality for generating unit tests. It analyzes source code and optionally existing test files to generate comprehensive test cases using an LLM model.

The task supports:

Generating tests from source code with full dependency analysis
Using existing test files as reference for test style and patterns
Optional module directory tree visualization for context
Configurable error handling during import analysis
Automatic AST validation of generated test code
Module filtering with ignore and no-ignore lists

Parameters:

model (LLMModelTyping) – The LLM model to use for test generation.
history (Optional[LLMHistory]) – Optional conversation history with system prompt. If None, creates new history.
default_max_retries (int) – Maximum number of retry attempts for generation and parsing. Defaults to 5.
show_module_directory_tree (bool) – If True, include module directory tree in the prompt to provide structural context. Defaults to False.
skip_when_error (bool) – If True, skip imports that fail to load during analysis instead of raising exceptions. Defaults to True.
force_ast_check (bool) – If True, validate generated code with AST parsing. Defaults to True.
ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names to explicitly ignore during analysis.
no_ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names that should never be ignored.

Variables:

show_module_directory_tree (bool) – Whether to include directory tree in prompts.
skip_when_error (bool) – Whether to skip failed imports during analysis.
ignore_modules (set) – Set of module names to explicitly ignore.
no_ignore_modules (set) – Set of module names that should never be ignored.

Note

The generated tests should be reviewed for correctness and completeness. The LLM may not generate tests for all edge cases or complex scenarios.

Warning

Large source files with many dependencies may generate very large prompts, potentially exceeding model context limits.

Example:

>>> from hbllmutils.model import LLMModel
>>> from hbllmutils.history import LLMHistory
>>> 
>>> # Create task with custom configuration
>>> model = LLMModel(...)
>>> history = LLMHistory().with_system_prompt("Generate comprehensive pytest tests")
>>> task = UnittestCodeGenerationLLMTask(
...     model=model,
...     history=history,
...     show_module_directory_tree=True,
...     skip_when_error=True,
...     ignore_modules=['deprecated_module'],
...     no_ignore_modules=['mypackage.core']
... )
>>> 
>>> # Generate tests for a module
>>> test_code = task.generate('mypackage/calculator.py')
>>> print(test_code)

>>> # Generate with existing tests as reference
>>> test_code = task.generate(
...     source_file='mypackage/calculator.py',
...     test_file='tests/test_calculator_old.py',
...     max_retries=3
... )

__init__(model: str | LLMModel, history: LLMHistory | None = None, default_max_retries: int = 5, show_module_directory_tree: bool = False, skip_when_error: bool = True, force_ast_check: bool = True, ignore_modules: Iterable[str] | None = None, no_ignore_modules: Iterable[str] | None = None)[source]

Initialize the UnittestCodeGenerationLLMTask.

Parameters:

model (LLMModelTyping) – The LLM model to use for test generation.
history (Optional[LLMHistory]) – Optional conversation history. If None, creates new history.
default_max_retries (int) – Maximum retry attempts for parsing. Defaults to 5.
show_module_directory_tree (bool) – Whether to include directory tree. Defaults to False.
skip_when_error (bool) – Whether to skip failed imports. Defaults to True.
force_ast_check (bool) – Whether to enforce AST validation. Defaults to True.
ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names to explicitly ignore.
no_ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names that should never be ignored.

generate(source_file: str, test_file: str | None = None, max_retries: int | None = None, **params)[source]

Generate unit test code for the specified source file.

This method analyzes the source file and optionally an existing test file to generate comprehensive unit tests. It creates a detailed prompt containing:

Complete source code analysis with dependencies
Optional module directory tree for structural context
Optional existing test file for reference patterns
All imported dependencies and their implementations

The generated prompt is then sent to the LLM model, which produces test code that is validated and returned.

Parameters:

source_file (str) – Path to the Python source file to generate tests for.
test_file (Optional[str]) – Optional path to existing test file to use as reference for test style and patterns. If provided, the existing tests will be included in the prompt to guide generation.
max_retries (Optional[int]) – Maximum number of retry attempts if generation fails. If None, uses the default_max_retries value.
params (dict) – Additional parameters to pass to the LLM model during generation. These may include temperature, max_tokens, etc.

Returns:

The generated unit test code as a string, validated with AST parsing.

Return type:

str

Raises:

OutputParseFailed – If test generation fails after all retry attempts.
FileNotFoundError – If source_file or test_file does not exist.
SyntaxError – If the generated code has syntax errors (after retries exhausted).

Note

The method uses get_prompt_for_source_file() to analyze both the source and test files. Import failures can be handled gracefully with the skip_when_error parameter.

Warning

Very large source files or complex dependency trees may generate prompts that exceed the model’s context window, potentially causing failures.

Example:

>>> task = UnittestCodeGenerationLLMTask(model, history)
>>> 
>>> # Generate tests for a simple module
>>> test_code = task.generate('mypackage/calculator.py')
>>> print(test_code)
import pytest
from mypackage.calculator import Calculator

@pytest.mark.unittest
def test_calculator_add():
    calc = Calculator()
    assert calc.add(2, 3) == 5

>>> # Generate with existing tests as reference
>>> test_code = task.generate(
...     source_file='mypackage/calculator.py',
...     test_file='tests/test_calculator_old.py'
... )

>>> # Generate with custom retry limit
>>> test_code = task.generate(
...     source_file='mypackage/complex_module.py',
...     max_retries=10
... )

>>> # Generate with model parameters
>>> test_code = task.generate(
...     source_file='mypackage/calculator.py',
...     temperature=0.7,
...     max_tokens=2000
... )

create_unittest_generation_task

hbllmutils.meta.code.unittest_generation.create_unittest_generation_task(model: str | LLMModel, show_module_directory_tree: bool = False, skip_when_error: bool = True, force_ast_check: bool = True, test_framework_name: Literal['pytest', 'unittest', 'nose2'] = 'pytest', mark_name: str | None = 'unittest', ignore_modules: Iterable[str] | None = None, no_ignore_modules: Iterable[str] | None = None) → UnittestCodeGenerationLLMTask[source]

Create a configured unit test generation task with appropriate system prompt.

This factory function creates an UnittestCodeGenerationLLMTask instance with a system prompt tailored for the specified test framework. The system prompt is loaded from a Jinja2 template and rendered with the provided configuration.

The function handles:

Loading and initializing the specified LLM model
Creating a system prompt from template with framework-specific instructions
Configuring test marking strategies (e.g., @pytest.mark.unittest)
Setting up error handling and validation options
Configuring module filtering with ignore and no-ignore lists

Parameters:

model (LLMModelTyping) – The LLM model to use. Can be a model name string, an LLMModel instance, or None to use the default model from configuration.
show_module_directory_tree (bool) – If True, include module directory tree in prompts to provide structural context. Defaults to False.
skip_when_error (bool) – If True, skip imports that fail to load during analysis instead of raising exceptions. Defaults to True.
force_ast_check (bool) – If True, validate generated code with AST parsing. Defaults to True.
test_framework_name (Literal['pytest', 'unittest', 'nose2']) – The test framework to generate tests for. Must be one of ‘pytest’, ‘unittest’, or ‘nose2’. Defaults to ‘pytest’.
mark_name (Optional[str]) – The pytest mark name to use for generated tests (e.g., ‘unittest’ will generate @pytest.mark.unittest decorators). If None or empty, no mark decorators will be added. Only applies to pytest framework. Defaults to ‘unittest’.
ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names that should be explicitly ignored during dependency analysis regardless of download count or other criteria.
no_ignore_modules (Optional[Iterable[str]]) – Optional iterable of module names that should never be ignored during dependency analysis regardless of download count or other filtering criteria.

Returns:

A configured UnittestCodeGenerationLLMTask instance ready for test generation.

Return type:

UnittestCodeGenerationLLMTask

Raises:

ValueError – If test_framework_name is not one of the supported frameworks.
FileNotFoundError – If the system prompt template file cannot be found.
TypeError – If model parameter is of invalid type.

Note

The system prompt template is loaded from ‘unittest_generation.j2’ in the same directory as this module. The template is rendered with the specified test framework and mark name.

Warning

Different test frameworks have different capabilities and syntax. Ensure the LLM model is capable of generating tests for the specified framework.

Example:

>>> # Create task with pytest framework
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     test_framework_name='pytest',
...     mark_name='unittest'
... )
>>> test_code = task.generate('mypackage/calculator.py')

>>> # Create task with unittest framework
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     test_framework_name='unittest',
...     mark_name=None
... )
>>> test_code = task.generate('mypackage/calculator.py')

>>> # Create task without pytest marks
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     test_framework_name='pytest',
...     mark_name=None
... )

>>> # Create task with directory tree visualization
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     show_module_directory_tree=True,
...     test_framework_name='pytest'
... )

>>> # Create task with custom error handling
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     skip_when_error=False,
...     force_ast_check=True
... )

>>> # Create task with module filtering
>>> task = create_unittest_generation_task(
...     model='gpt-4',
...     ignore_modules=['deprecated_module', 'legacy_code'],
...     no_ignore_modules=['mypackage.core', 'mypackage.utils']
... )

>>> # Use existing model instance
>>> from hbllmutils.model import RemoteLLMModel
>>> my_model = RemoteLLMModel(base_url='...', api_token='...', model_name='gpt-4')
>>> task = create_unittest_generation_task(
...     model=my_model,
...     test_framework_name='pytest'
... )

>>> # Use default model from configuration
>>> task = create_unittest_generation_task(
...     model=None,
...     test_framework_name='pytest'
... )