Skip to content
View IbnuEyni's full-sized avatar

Block or report IbnuEyni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
IbnuEyni/README.md

Hey, I'm Amir Ahmedin 👋

I build AI agent systems — from evaluation benchmarks to production pipelines that enrich, qualify, and convert leads autonomously.

🔭 Currently: Building domain-specific benchmarks and multi-tool agent orchestration systems
🧠 Focus: Agent evaluation, data pipelines, constrained tool-use, and context engineering
💬 Ask me about: AI agent architectures, sales automation, document intelligence, data lineage


🏗️ Featured Projects

Natural language data analytics agent evaluated on DataAgentBench (54 queries, 12 datasets, 9 domains). Uses a 3-layer knowledge base injection architecture to achieve 35.2% pass@1 against a 54.3% SOTA ceiling.

Domain-specific benchmark for B2B sales agents — 250 tasks across signal grounding, tone consistency, resource honesty, and workflow correctness. Published on HuggingFace with a SimPO-trained judge model.

Automated lead generation system with 5-signal enrichment pipeline (Crunchbase, job posts, layoffs, leadership changes, AI maturity), ICP classification, multi-channel outreach, and CRM sync.

Codebase intelligence system that transforms undocumented repos into queryable knowledge graphs — module dependency analysis, data lineage tracking, blast radius calculation, and LLM-powered semantic analysis.

Enterprise-grade agentic pipeline for unstructured document extraction. Multi-strategy routing (fast text → layout-aware → vision-augmented) with confidence-gated escalation and spatial provenance.

Schema integrity and lineage attribution system — auto-generates Bitol-compatible contracts, validates data snapshots, traces violations to upstream git commits, and detects schema drift.


⚡ Tech Stack

Python FastAPI PostgreSQL Docker LangChain HuggingFace NetworkX Pydantic


📊 What I'm Working On

  • Agent evaluation methodology — building benchmarks that catch real production failures
  • Centralized orchestration patterns for multi-tool agent systems
  • Context engineering for data agents (knowledge base injection, schema hints, corrections memory)

📫 Connect

LinkedIn HuggingFace Email

Pinned Loading

  1. tenacious-sales-bench tenacious-sales-bench Public

    Domain-specific benchmark for B2B sales agents — 250 tasks, SimPO judge model, published on HuggingFace.

    Python

  2. oracle-forge oracle-forge Public

    Natural language data analytics agent — 3-layer KB injection on DataAgentBench (54 queries, 12 datasets). 35.2% pass@1.

    Python 4

  3. Conversion-Engine Conversion-Engine Public

    Automated lead generation & conversion system — 5-signal enrichment, ICP classification, multi-channel outreach, CRM sync.

    Python

  4. amharic_braille_backend amharic_braille_backend Public

    Python

  5. Amharic-Hate-Speech-Detector Amharic-Hate-Speech-Detector Public

    Jupyter Notebook

  6. A2SV A2SV Public

    Python