mirror of https://github.com/LearningCircuit/local-deep-research.git synced 2026-06-15 19:46:56 +03:00

Files

LearningCircuit b1cbc6fbe0 fix(llm): normalize str-returns to a message in the central LLM wrapper (+ ainvoke) (#4342 )

* fix(llm): normalize str-returns to a message in ProcessingLLMWrapper + add ainvoke

The central wrapper (returned by all get_llm paths) stripped <think> tags but
returned a bare str when the base LLM returned a str — that inconsistent shape is
the root of the recurring "'str' object has no attribute 'content'" crashes we've
been fixing site-by-site (#3884 -> #4339).

Generic fix at the choke point:
- invoke(): when the base returns a bare str, wrap it into AIMessage(content=stripped)
  instead of returning a str. Message returns are unchanged (mutate .content in place,
  preserving additional_kwargs/reasoning_content/tool_calls). Other types pass through.
- add ainvoke(): mirrors invoke(); without it, the 7 direct .ainvoke() sites
  (browsecomp_entity/modular strategies) bypassed think-stripping via __getattr__.

Now every get_llm LLM yields a think-free str .content on both sync and async direct
calls, so the raw .invoke().content sites are safe automatically (deferred per-site
migration cancelled). Reasoning-safe: only .content is rewritten, so DeepSeek
thinking-mode reasoning_content round-tripping (#4194) is not worsened.

Limitation: the LangGraph create_agent path binds tools on the base model
(model.bind_tools via __getattr__), so it bypasses this wrapper — unchanged by this PR.

Tests: updated the 2 tests asserting a str return; added shape, reasoning_content/
tool_calls-preservation (#4194 guard), and ainvoke regression tests.
mypy 552 clean; ruff clean; 2171 passed across 78 LLM-layer test files + citation_handlers.

* refactor(llm): extract _log_llm_error helper + add type hints (review polish)

Addresses the #4342 review recommendations:
- DRY: invoke() and ainvoke() shared the same try/except error-logging verbatim;
  extracted a _log_llm_error(error) static helper so they can't diverge.
- Type hints: added annotations to _normalize_response/_log_llm_error/invoke/ainvoke.
No behavior change. ruff + mypy clean; 112 config tests pass.

2026-05-25 16:47:11 +02:00

accessibility_tests

fix(security): bump transitive qs to >= 6.15.2 (GHSA-q8mj-m7cp-5q26) (#4211 )

2026-05-22 21:24:21 +02:00

advanced_search_system

fix(synthesis): strip <think> in synthesize_findings + knowledge gen; guard empty extractor answers (#4336 )

2026-05-25 16:37:18 +02:00

api

…

api_tests

test: gate test_research_creation.py with @pytest.mark.requires_llm (#4288 )

2026-05-25 00:29:55 +02:00

api_tests_with_login

test: delete tier-3 dead-code + stdlib shadow + redundant API tests (#4269 )

2026-05-24 22:50:27 +02:00

auth_tests

test: tighten 1 status-or-tautology + delete 3 hasattr-only tests (#4271 )

2026-05-25 00:26:41 +02:00

benchmarks

fix(tests): narrow broad status-code tautologies in test_benchmark_routes.py (#4137 )

2026-05-24 09:11:46 +02:00

…

citation_bugs

…

citation_handlers

fix(synthesis): strip <think> in synthesize_findings + knowledge gen; guard empty extractor answers (#4336 )

2026-05-25 16:37:18 +02:00

config

fix(llm): normalize str-returns to a message in the central LLM wrapper (+ ainvoke) (#4342 )

2026-05-25 16:47:11 +02:00

content_fetcher

test: replace 4 more tautological asserts with real contracts (#4249 )

2026-05-24 22:37:17 +02:00

core

test: remove 2 shadow files in tests/core/ (#4122 )

2026-05-18 23:48:36 +02:00

database

test(database): migrate 2 credential_store TTL tests to freezegun (#4235 )

2026-05-24 09:33:03 +02:00

defaults

…

deletion

…

document_loaders

test: replace 4 more tautological asserts with real contracts (#4249 )

2026-05-24 22:37:17 +02:00

domain_classifier

test: replace 3 tautology asserts in test_domain_classifier with real checks (#4248 )

2026-05-24 22:36:43 +02:00

embeddings

fix(embeddings): normalize OpenAI base_url before hostname check (#4310 )

2026-05-25 00:09:40 +02:00

error_handling

fix(errors): dispatch RateLimitError to RATE_LIMIT_ERROR, not MODEL_ERROR (#4086 )

2026-05-20 23:13:19 +02:00

exporters

…

feature_tests

test: delete script-style placeholder test_custom_context.py (#4244 )

2026-05-24 22:21:28 +02:00

fix_tests

…

fixtures

…

followup_research

chore: delete unused FollowUpResponse dataclass (#4167 )

2026-05-23 11:48:28 +02:00

fuzz

…

health_check

…

hooks

test(hooks): cover compound-selector regression in CSS class prefix hook (#3702 )

2026-04-27 23:17:27 +02:00

infrastructure_tests

chore(deps-dev): bump jest in /tests/infrastructure_tests (#3922 )

2026-05-09 13:14:26 +02:00

integration

fix(tests): add engine.dispose() / db.reset() cleanup to journal quality tests (#3760 )

2026-05-01 19:27:53 +02:00

journal_quality

fix(tests): add engine.dispose() / db.reset() cleanup to journal quality tests (#3760 )

2026-05-01 19:27:53 +02:00

test(js): add window exports + vitest coverage for 3 Tier-2 pure helpers (#4312 )

2026-05-25 13:35:20 +02:00

langchain_integration

…

ldr-news-dev-files/redundant-tests

…

library

chore: delete unused get_retry_statistics monitoring method (#4172 )

2026-05-23 11:43:12 +02:00

llm

fix(llm): remove silent gemma3:12b fallback for Ollama model (#3670 )

2026-05-01 15:28:16 +02:00

llm_providers

feat(lmstudio): add optional API key support for authenticated instances (#3573 ) (#3740 )

2026-05-10 00:02:52 +02:00

mcp

test: remove 2 more redundant tests (MCP client + evidence analyzer) (#4261 )

2026-05-24 22:49:36 +02:00

metrics

test: delete 10 hasattr-only tests in test_research_metrics_extended.py (#4277 )

2026-05-25 00:27:13 +02:00

news

test: strengthen overmocked rating_storage.create test to verify field mapping (#4246 )

2026-05-25 09:08:05 +02:00

notifications

feat(notifications): default-off + env-only master switch for SSRF rebinding risk (#3675 )

2026-04-27 23:18:00 +00:00

pdf_tests

fix(tests): narrow broad status-code tautologies in 3 misc test files (#4165 )

2026-05-23 11:40:09 +02:00

performance

fix(content-fetcher): disable JS rendering by default (#3826 ) (#3971 )

2026-05-16 14:20:14 +02:00

programmatic_access

…

puppeteer

chore(deps): bump mocha from 11.7.5 to 11.7.6 in /tests/puppeteer (#4205 )

2026-05-23 10:35:48 +02:00

rate_limiting

…

report

refactor(citation): route LLM responses through get_llm_response_text (#4334 )

2026-05-25 11:16:45 +02:00

research_library

ci+test: retry transient network installs, fix patch.object race (#4302 )

2026-05-24 18:16:20 +02:00

research_scheduler

refactor(scheduler): inline DocumentSchedulerUtil into routes (#3750 )

2026-05-01 13:14:49 +02:00

retriever_integration

…

routes

fix(tests): narrow broad status-code tautologies in 3 misc test files (#4165 )

2026-05-23 11:40:09 +02:00

search_engines

fix(tests): two release-pipeline failures — UA assertions + WebKit budget (#4232 )

2026-05-24 08:58:09 +02:00

searxng

…

security

test(security): mark 10 placeholder tests as skipped instead of silently passing (#4233 )

2026-05-24 09:32:01 +02:00

settings

chore(settings): cleanup after #4222 — drop stale defaults exemption, align index_type casing (#4227 )

2026-05-23 13:13:47 +02:00

storage

…

strategies

refactor: delete 3 dead strategy files and 4 orphaned test files (~3,200 lines) (#3147 )

2026-05-19 21:54:10 +02:00

test_llm

fix(llm): remove silent gemma3:12b fallback for Ollama model (#3670 )

2026-05-01 15:28:16 +02:00

test_utilities

…

text_optimization

refactor(citation): pass collections explicitly to source-tagged formatter (#4096 )

2026-05-17 20:31:40 +02:00

text_processing

…

theme_tests

test: delete 17 NO_FAILURE_PATH theme tests that always passed silently (#4252 )

2026-05-25 11:27:31 +02:00

ui_tests

test: delete 5 ui_tests/test_uuid* / test_trace_error / test_mixed_id script tests (#4274 )

2026-05-24 18:17:34 +02:00

unit

…

utilities

test: migrate 3 search_cache TTL tests from time.sleep to freezegun (#4289 )

2026-05-25 11:29:47 +02:00

web

test: remove 7 no-assertion / placeholder tests across 7 files (#4247 )

2026-05-24 22:22:05 +02:00

web_search_engines

test: remove 6 duplicate tests from search_engine_factory coverage files (#4254 )

2026-05-24 22:39:00 +02:00

web_services

fix(pdf): render CJK characters in exported PDFs (#4055 ) (#4058 )

2026-05-16 13:12:28 +02:00

__init__.py

…

CI_INTEGRATION.md

…

conftest.py

refactor(scheduler): inline DocumentSchedulerUtil into routes (#3750 )

2026-05-01 13:14:49 +02:00

download_stuff_for_local.py

…

mock_fixtures.py

…

mock_llm_config.py

…

mock_modules.py

…

package-lock.json

chore(deps): bump mocha from 11.7.5 to 11.7.6 in /tests (#4206 )

2026-05-22 20:24:04 +02:00

package.json

chore(deps): bump mocha from 11.7.5 to 11.7.6 in /tests (#4206 )

2026-05-22 20:24:04 +02:00

README.md

…

run_all_tests.py

fix(tests): update remaining stale paths from PR #3538 rename (#3642 )

2026-04-25 23:38:38 +02:00

run_followup_tests.sh

…

test_api_key_configuration.py

…

test_api_key_frontend_settings.py

…

test_api_key_settings.js

chore(lint): enable no-unused-vars + mechanical cleanup (#3536 )

2026-04-19 15:44:35 +02:00

test_api_settings_advanced.py

test: remove 7 no-assertion / placeholder tests across 7 files (#4247 )

2026-05-24 22:22:05 +02:00

test_api_settings_e2e.py

test: remove 7 no-assertion / placeholder tests across 7 files (#4247 )

2026-05-24 22:22:05 +02:00

test_api_settings_validation.py

…

test_api_settings.py

…

test_ci_config.py

…

test_citation_handler.py

…

test_context_overflow_detection.py

test: remove 7 no-assertion / placeholder tests across 7 files (#4247 )

2026-05-24 22:22:05 +02:00

test_database_initialization.py

…

test_followup_api.py

fix(tests): patch all middleware db_manager bindings in followup API tests (#3602 )

2026-04-24 20:17:09 +02:00

test_google_pse.py

…

test_link_analytics.py

test: remove 7 no-assertion / placeholder tests across 7 files (#4247 )

2026-05-24 22:22:05 +02:00

test_llm_provider_integration.py

test: delete mock-roundtrip test in test_llm_provider_integration.py (#4276 )

2026-05-24 15:07:09 +02:00

test_llm_rate_limiting.py

…

test_openai_api_key_e2e.py

…

test_openai_api_key_usage.py

…

test_openai_endpoint_api_key.py

fix(ui): clarify openai_endpoint API key is optional for local servers (#3908 )

2026-05-09 10:41:14 +02:00

TEST_OPTIMIZATION.md

…

TEST_ORGANIZATION.md

…

test_programmatic_custom_llm_retriever.py

…

test_reexport_modules.py

…

test_report_generator_coverage.py

test(citation): fix broken think-tag test + harden coverage after #4334 (#4335 )

2026-05-25 12:01:47 +02:00

test_report_generator_edge_cases.py

…

test_report_generator.py

fix(test): update report generator tests to expect numbered headings (#3604 )

2026-04-23 21:11:45 +02:00

test_search_cache_stampede.py

…

test_search_engines_enhanced.py

…

test_search_system_factory_high_value.py

…

test_search_system_factory.py

…

test_search_system_gaps.py

test: quality cleanup — stop tests passing when the SUT misbehaves (#3970 )

2026-05-10 15:10:06 +02:00

test_search_system.py

…

test_settings_manager.py

fix(settings): persist Embeddings page changes and unblock OpenAI test (#4208 ) (#4212 )

2026-05-22 21:23:40 +02:00

test_token_counter_coverage.py

fix(tests): align missed truncation_ratio assertion in tests/test_token_counter_coverage.py (#3845 )

2026-05-07 20:03:32 +02:00

test_url_utils_simple.py

…

test_url_utils.py

…

test_utils.py

…

test_wikipedia_url_security.py

…

TESTING_COMPARISON.md

…

TESTING_PROPOSAL.md

…

README.md

Testing Guide for Local Deep Research

This document provides a comprehensive guide to running tests in the Local Deep Research project.

Quick Start

# Fast feedback loop (< 30 seconds)
python tests/run_all_tests.py fast

# Standard development testing (< 5 minutes)
python tests/run_all_tests.py standard

# Full comprehensive testing (< 15 minutes)
python tests/run_all_tests.py full

# Run with external server (skip automatic startup)
python tests/run_all_tests.py standard --no-server-start

# Unit tests only (no server needed)
python tests/run_all_tests.py unit-only

Test Structure

The project uses a multi-layered testing approach with different types of tests organized by purpose and execution speed:

Test Categories

Category	Location	Purpose	Duration	Dependencies
Health Checks	`tests/health_check/`	Fast endpoint validation	5-30s	Server running
Unit Tests	`tests/test_*.py`	Component isolation testing	30-60s	None
Feature Tests	`tests/feature_tests/`	Feature-specific validation	60-120s	Test DB
Integration Tests	`tests/searxng/`, `tests/fix_tests/`	External service testing	60-180s	External APIs
UI Tests	`tests/ui_tests/`	Browser automation	120-300s	Server + Node.js

Test Technologies

Python: pytest with coverage, requests for HTTP testing
JavaScript: Puppeteer for browser automation
Shell: curl-based health checks for minimal dependencies

Test Execution Profiles

1. Fast Profile (`fast`)

Purpose: Rapid feedback during development Duration: < 30 seconds Includes: Health checks + Unit tests

python tests/run_all_tests.py fast

2. Standard Profile (`standard`)

Purpose: Regular development workflow Duration: < 5 minutes Includes: Fast + UI tests (core workflows)

python tests/run_all_tests.py standard

3. Full Profile (`full`)

Purpose: Comprehensive validation before releases Duration: < 15 minutes Includes: All tests including external integrations

python tests/run_all_tests.py full

4. CI Profile (`ci`)

Purpose: Continuous integration optimized Duration: < 2 minutes Includes: Fast + selected stable tests

python tests/run_all_tests.py ci

5. Unit-Only Profile (`unit-only`)

Purpose: Pure unit testing without server dependencies Duration: < 10 seconds Includes: Unit and feature tests only

python tests/run_all_tests.py unit-only

Individual Test Runners

Health Checks

Fast endpoint validation to ensure the server is responding correctly:

# Python version (auto-detects running server)
python tests/health_check/run_quick_health_check.py

# Shell version (minimal dependencies)
bash tests/health_check/test_endpoints_health.sh

Python Tests

Unit and integration tests using pytest:

# Run all Python tests with coverage
python run_tests.py

# Run specific test categories
pytest tests/test_*.py -v                    # Unit tests only
pytest tests/feature_tests/ -v               # Feature tests only
pytest tests/searxng/ -v                     # Integration tests only

UI Tests

Browser automation tests using Puppeteer:

# Run all UI tests
node tests/ui_tests/run_all_tests.js

# Run individual UI tests
node tests/ui_tests/test_cost_analytics.js   # Cost analytics page
node tests/ui_tests/test_settings_page.js    # Settings functionality
node tests/ui_tests/test_metrics_charts.js   # Chart visualizations

Prerequisites

Required for All Tests

Python 3.8+ with project dependencies installed
Local Deep Research server running on http://127.0.0.1:5000

Additional Requirements by Test Type

UI Tests:

Node.js (for Puppeteer)
Chrome/Chromium browser
Server must be running and accessible

Integration Tests:

Network access for external APIs
Valid API keys (if testing external search engines)
SearXNG instance (for SearXNG integration tests)

Health Checks:

curl (for shell version)
requests library (for Python version)

Running Tests in Development

Before Committing Code

# Quick validation
python tests/run_all_tests.py fast

# If fast tests pass, run standard
python tests/run_all_tests.py standard

Before Creating a Pull Request

# Run comprehensive tests
python tests/run_all_tests.py full

Debugging Failed Tests

# Run with verbose output
pytest tests/ -v -s

# Run specific failing test
pytest tests/test_specific_test.py::test_function -v -s

# UI test debugging (saves screenshots)
node tests/ui_tests/test_specific_ui.js
# Check tests/ui_tests/screenshots/ for visual debugging

Test Configuration

pytest Configuration

Configuration is handled in:

pyproject.toml - pytest settings and coverage configuration
tests/conftest.py - test fixtures and database mocking
.coveragerc - coverage reporting settings

UI Test Configuration

Puppeteer tests are configured with:

3-second navigation timeout for faster execution
Screenshot capture for debugging
Automatic retry for flaky network operations

Environment Variables

# Set Python path for proper imports
export PYTHONPATH=/path/to/local-deep-research

# Optional: Configure test database
export TEST_DATABASE_URL=sqlite:///test.db

# Optional: Skip slow tests
export SKIP_SLOW_TESTS=1

Continuous Integration

GitHub Actions / CI Pipeline

Recommended CI test strategy:

# Fast checks on every PR
- name: Fast Tests
  run: python tests/run_all_tests.py ci

# Full validation before merge
- name: Full Tests
  run: python tests/run_all_tests.py full
  if: github.event_name == 'push' && github.ref == 'refs/heads/main'

Local Pre-commit Hooks

Add to .pre-commit-config.yaml:

- repo: local
  hooks:
    - id: fast-tests
      name: Fast Tests
      entry: python tests/run_all_tests.py fast
      language: system
      pass_filenames: false

Test Data and Fixtures

Database Testing

Tests use isolated SQLite databases with fixtures defined in tests/conftest.py:

Automatic rollback after each test
Mock data for consistent testing
No impact on production data

UI Test Screenshots

UI tests automatically capture screenshots:

Saved to tests/ui_tests/screenshots/
Useful for debugging visual issues
Automatically cleaned up after successful runs

External API Mocking

Integration tests can use mocked responses:

Real API calls in integration environment
Mocked responses for unit tests
Configurable via environment variables

Troubleshooting

Common Issues

"Server not running" error:

# Option 1: Start server manually, then run tests
pdm run ldr-web

# In another terminal:
python tests/run_all_tests.py standard --no-server-start

Server startup hangs during tests:

# Skip automatic server startup and start manually
pdm run ldr-web &  # Start in background

# Run tests without automatic server startup
python tests/run_all_tests.py standard --no-server-start

"Node.js not found" error:

# Install Node.js (Ubuntu/Debian)
sudo apt install nodejs npm

# Install Node.js (macOS)
brew install node

# Verify installation
node --version

Import errors in tests:

# Ensure PYTHONPATH is set
export PYTHONPATH=$(pwd)
python tests/run_all_tests.py fast

Puppeteer browser launch failures:

# Install missing dependencies (Ubuntu/Debian)
sudo apt install chromium-browser

# Or use bundled Chromium
npm install puppeteer

Performance Issues

Tests running slowly:

Use fast profile for development
Check network connectivity for integration tests
Verify server performance with health checks

UI tests timing out:

Increase timeout in individual test files
Check browser developer tools for JavaScript errors
Verify server is responding quickly

Test Coverage

Generate detailed coverage reports:

# HTML coverage report
python run_tests.py
open coverage_html/index.html

# Terminal coverage report
pytest tests/ --cov=src --cov-report=term-missing

Adding New Tests

Unit Tests

Add to tests/test_new_feature.py:

import pytest
from src.local_deep_research.module import function

def test_new_function():
    assert function("input") == "expected_output"

UI Tests

Add to tests/ui_tests/test_new_ui_feature.js:

const puppeteer = require('puppeteer');

(async () => {
    const browser = await puppeteer.launch();
    const page = await browser.newPage();

    await page.goto('http://127.0.0.1:5000/new-page');
    await page.waitForSelector('.new-feature');

    console.log('✅ New UI feature test passed');
    await browser.close();
})();

Integration Tests

Add to tests/test_new_integration.py:

import pytest
import requests

def test_external_api_integration():
    # Test real API integration
    response = requests.get("https://api.example.com/data")
    assert response.status_code == 200

Summary

The Local Deep Research testing framework provides multiple execution profiles to balance thoroughness with speed. Use the run_all_tests.py script for orchestrated testing, or run individual test suites for targeted debugging. The modular approach ensures you can quickly validate changes during development while maintaining comprehensive coverage for releases.

README.md

Testing Guide for Local Deep Research

Quick Start

Test Structure

Test Categories

Test Technologies

Test Execution Profiles

1. Fast Profile (fast)

2. Standard Profile (standard)

3. Full Profile (full)

4. CI Profile (ci)

5. Unit-Only Profile (unit-only)

Individual Test Runners

Health Checks

Python Tests

UI Tests

Prerequisites

Required for All Tests

Additional Requirements by Test Type

Running Tests in Development

Before Committing Code

Before Creating a Pull Request

Debugging Failed Tests

Test Configuration

pytest Configuration

UI Test Configuration

Environment Variables

Continuous Integration

GitHub Actions / CI Pipeline

Local Pre-commit Hooks

Test Data and Fixtures

Database Testing

UI Test Screenshots

External API Mocking

Troubleshooting

Common Issues

Performance Issues

Test Coverage

Adding New Tests

Unit Tests

UI Tests

Integration Tests

Summary

1. Fast Profile (`fast`)

2. Standard Profile (`standard`)

3. Full Profile (`full`)

4. CI Profile (`ci`)

5. Unit-Only Profile (`unit-only`)