WIP: Giskard v3 by kevinmessiaen · Pull Request #2229 · Giskard-AI/giskard-oss

kevinmessiaen · 2026-01-19T09:11:37Z

No description provided.

- Add Sphinx and related dependencies to pyproject.toml - Add serve-docs Makefile target for local documentation serving - Remove .python-version files from libs directories - Add docs/ directory with Sphinx configuration - Update pre-commit config to exclude example API keys from secret detection

Refactor documentation to prioritize the fluent API pattern scenario().interact().check() over InteractionSpec and TestCase APIs. Changes: - Update all tutorial files (rag-evaluation, testing-agents, chatbot-testing) to use fluent API instead of InteractionSpec/TestCase - Update getting started guides (quickstart, single-turn, multi-turn) to prioritize fluent API - De-emphasize InteractionSpec/TestCase in core-concepts, move to Advanced Usage - Update giskard-checks README to use fluent API in Quickstart section - Move InteractionSpec/TestCase examples to Advanced Usage section in README - Update Python docstrings (TestCase, Scenario, InteractionSpec) to recommend fluent API for most use cases - Wrap all examples in async functions with asyncio.run() for proper execution - Rename test_* functions to run_*_example() to avoid pytest confusion This reduces the learning curve for beginners while keeping advanced APIs available in reference documentation for power users. Refs: ENG-1294

…fy-api-surface-in-tutorials-fluent-api-focus docs(checks): simplify API surface in tutorials with fluent API focus

Remove 47 unused image files from docs/source/_static/images/: - 2 images from sdk/ directory - 3 images from oss/ directory - 42 images from hub/ directory (including scan/ subdirectory) None of these images were referenced in any documentation source files.

Remove all async wrapper functions (run_*_example, main) and asyncio.run() calls from documentation examples across all guides and tutorials. Add a disclaimer in quickstart.rst explaining how to run async code examples using asyncio.run() or within an async context (e.g., Jupyter notebooks or pytest-asyncio). This simplifies the documentation examples by showing the async code directly without unnecessary wrapper functions, while still providing guidance on how to execute the code.

…mplementation - Replace StringMatchingCheck with StringMatching class name - Update API parameters: content -> keyword, key -> text_key/keyword_key - Update JSONPath syntax: interactions[-1] -> trace.last - Add examples for normalization_form and case_sensitive parameters - Update all documentation files including tutorials and API references

* feat: rich display for test case results * chore: refactoring * feat: improved formatting * chore: cleanup * Update src/giskard/checks/core/result.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update src/giskard/checks/core/result.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update src/giskard/checks/core/result.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * fix: code formatting * Add rich representation for trace and interaction --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Pierre Le Jeune <pierre@giskard.ai>

…st-stringmatching-behavior-to-match-hub feature(checks): Improve string matching checks

- Run all tests on main branch - On PRs, run tests for changed projects and their dependents - giskard-core changes trigger tests for all projects - giskard-agents changes trigger tests for giskard-agents and giskard-checks - giskard-checks changes only trigger its own tests

Bumps [filelock](https://github.com/tox-dev/py-filelock) from 3.20.0 to 3.20.3. - [Release notes](https://github.com/tox-dev/py-filelock/releases) - [Changelog](https://github.com/tox-dev/filelock/blob/main/docs/changelog.rst) - [Commits](tox-dev/filelock@3.20.0...3.20.3) --- updated-dependencies: - dependency-name: filelock dependency-version: 3.20.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

--- updated-dependencies: - dependency-name: aiohttp dependency-version: 3.13.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

…d-v3

- Remove docs directory and all documentation source files - Remove documentation dependencies from pyproject.toml - Remove serve-docs target from Makefile - Update uv sync command to remove --all-groups flag

…-update-tests-run-condition ci: update test run conditions based on project dependencies

* Add similarity check * Clean up * Use gemini embedding as default * build: add numpy as explicit dependency to giskard-agents and giskard-checks numpy is used directly in embeddings and semantic similarity checks but was only available as a transitive dependency. Adding it as an explicit dependency ensures it's always available. * refactor(agents): make embedding batch parameters configurable per-call - Remove max_batch_size and max_total_chars from EmbeddingParams - Add support for per-call batch size and total chars parameters - Add environment variable support for default batch limits (GISKARD_AGENTS_DEFAULT_MAX_BATCH_SIZE, GISKARD_AGENTS_DEFAULT_MAX_TOTAL_CHARS) - Update embed() and _embed() methods to accept optional params and batch parameters - Update all tests to use new API * docs(checks): add documentation and docstrings for SemanticSimilarity - Add SemanticSimilarity section to builtin-checks.rst following the same format as other checks (EqualityCheck, StringMatchingCheck, etc.) - Add SemanticSimilarity to quick reference imports in index.rst - Add comprehensive class-level docstring to SemanticSimilarity with Attributes and Examples sections - Add docstrings to get_embeddings method and enhance cosine_similarity function docstring --------- Co-authored-by: Kevin Messiaen <kevin.messiaen@icloud.com> Co-authored-by: Kevin Messiaen <114553769+kevinmessiaen@users.noreply.github.com>

* feature(check): improved equality test * test(equality): added test for equality check * refactor(checks): simplify extraction API and add JSONPath utilities - Replace Extractor ABC pattern with simple resolve() and provided_or_resolve() functions - Remove extraction_check.py in favor of direct JSONPath resolution - Add NotProvided sentinel and utility functions to giskard.core.utils - Update builtin checks (equality, groundedness, string_matching) to use new extraction API - Update tests to reflect simplified extraction interface This change makes JSONPath extraction more accessible as a builtin feature for metadata checks while reducing code complexity. * refactor(checks): extract normalization utilities into shared module - Move normalization logic from string_matching to utils.normalization - Add normalize_string and normalize_data functions for reuse - Update string_matching to use new normalization utilities - Add comprehensive tests for normalization utilities - Add Unicode normalization tests for equality checks * refactor(checks): replace Equality with ComparisonCheck system Replace the Equality check with a more general ComparisonCheck base class that supports multiple comparison operators (equals, greater than, less than, greater than or equal, less than or equal). BREAKING CHANGE: The Equality check has been replaced with Equals and other comparison checks. Import paths have changed from Equality to Equals, and new comparison operators are now available. * feat(checks): add NotEquals comparison check Add NotEquals check class that validates if extracted values do not equal an expected value. The check uses Python's __ne__ method for comparison and follows the same pattern as other comparison checks (Equals, LesserThan, GreaterThan, etc.). - Implement NotEquals class in comparison.py - Register check as 'not_equals' - Add comprehensive test coverage for various types and edge cases - Export NotEquals in builtin and main __init__.py modules * chore: improve check kind naming Co-authored-by: Pierre Le Jeune <pierre@giskard.ai> * feat(checks): support extracting expected values from traces in comparison checks - Add expected_value_key parameter to ComparisonCheck for dynamic extraction - Make expected_value optional when expected_value_key is provided - Add provided_or_resolve utility function for flexible value resolution - Update NotProvided to be a Pydantic BaseModel for better validation - Update tests to reflect new behavior when expected value extraction fails * chore: improve variable clarity --------- Co-authored-by: Pierre Le Jeune <pierre@giskard.ai>

Refresh fluent scenario examples, JSONPath usage, and docstrings to match current API behavior and trace access patterns.

chore(ci): update CI workflow - separate environment for functional tests requiring external API secrets - updated Makefile and workflow - updated test markers in packages to enable functional testing

mattbit and others added 30 commits June 9, 2025 18:36

sketch

37d4e52

basic draft

cd0a449

Add readme

b080af3

Working on tools

d27ada1

Tools

52559c0

working on pipeline with tools

2060b34

Add cursor rule

f8abeab

add pylint-pydantic

2e91e73

use default params in generator

8e57abe

Simple implementation of multi message templates

4765753

Better implementation of the prompt manager

b03a15d

Render pydantic models as json

3419212

feat: added rate limiter (#1)

0bb3aca

feat: output parsing and instructions

aef48b9

chore: add tests for chat output

17a3c6c

Add tests

9e1ec60

update ci

fdf4731

formatting and cleanup

8c9209d

Adding ruff

ff7083a

fix: only include output instructions if model is available

91bcd2f

chore: use gemini 2.0 flash for tests

9b9bcd5

chore: update deps

fe2e24f

chore: update package description

be9273e

fix: adapt tool decorator to work with class methods

7fc2d7f

feat: add run context for tools (#2)

27b505c

chore: adding docs

f94eb45

fix: add template method in generator (#3)

5cd4058

fix: mistake in readme

09b4505

feat: enforce structured output (#4)

d9a696e

feat: pass inputs to run context (#5)

a329dce

kevinmessiaen and others added 24 commits January 20, 2026 11:48

Merge pull request #2230 from Giskard-AI/feature/eng-1294-docs-simpli…

ca1a741

…fy-api-surface-in-tutorials-fluent-api-focus docs(checks): simplify API surface in tutorials with fluent API focus

fix: add --all-groups flag to uv sync

25c3627

feature(checks): Improve string matching checks

cabe76c

Merge pull request #2233 from Giskard-AI/feature/eng-1309-confirmadju…

d45c131

…st-stringmatching-behavior-to-match-hub feature(checks): Improve string matching checks

Merge remote-tracking branch 'checks_origin/main' into feature/giskar…

f46e97c

…d-v3

feat(dep): add rich to giskard checks

f0c3b02

chore: add VSCode directory to gitignore

0aa83e3

chore(docs): remove documentation infrastructure

e44e3fc

- Remove docs directory and all documentation source files - Remove documentation dependencies from pyproject.toml - Remove serve-docs target from Makefile - Update uv sync command to remove --all-groups flag

Merge pull request #2235 from Giskard-AI/feature/eng-1304-giskard-oss…

cd75b0a

…-update-tests-run-condition ci: update test run conditions based on project dependencies

chore(docs): align checks README and docstrings

0230f78

Refresh fluent scenario examples, JSONPath usage, and docstrings to match current API behavior and trace access patterns.

chore(deps): bump workspace packages to 1.0.0a1 and use >=1.0.0a1

8ef0437

docs(readme): rewrite for v3 alpha scope

6ada999

chore: ci workflows v3 (#2238)

81e424b

chore(ci): update CI workflow - separate environment for functional tests requiring external API secrets - updated Makefile and workflow - updated test markers in packages to enable functional testing

kevinmessiaen requested a deployment to CI February 5, 2026 08:03 — with GitHub Actions Waiting

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Giskard v3#2229

WIP: Giskard v3#2229
kevinmessiaen wants to merge 227 commits intomainfrom
feature/giskard-v3

kevinmessiaen commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

kevinmessiaen commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants