Saisab Sadhu — Research Portfolio

About

I am an MS & B.Tech graduate from IISER Bhopal (April 2026), specializing in Data Science and Engineering. My research sits at the intersection of natural language processing, large language model interpretability, and multi-agent reasoning systems.

At Lexsi Labs, I investigate whether machine unlearning produces genuine circuit disruption or merely suppresses behavioral expression — using mechanistic attribution tools on Gemma Scope SAE features. This work connects deeply to my thesis on how RAG systems handle epistemic conflict between parametric and retrieved knowledge.

I am drawn to research that asks mechanistic questions: not just what models do, but why they do it, and how we can design architectures and training procedures that produce more reliable, trustworthy reasoning.

Research Interests

Mechanistic Interpretability Circuit-level attribution, SAE feature analysis, unlearning audits
RAG & Knowledge Conflicts Epistemic conflict detection, dialectical reasoning, credibility inference
Multi-Agent Systems Adversarial oversight, hierarchical agent frameworks, debate-driven synthesis
Biomedical NLP Evidence synthesis, PICO extraction, clinical text understanding
Reliable AI Sycophancy detection, alignment probing, trustworthy LLM behavior

Research

Ongoing and completed research projects.

Ongoing — MS Thesis Jan 2025 – Present

Dialectical RAG: Resolving Knowledge Conflicts via Cross-Examination

Bio Medical Data Science Lab, IISER Bhopal • PI: Dr. Tanmay Basu

Investigating mechanisms to overcome imperfect context retrieval and resolve knowledge conflicts — both parametric and external — in Retrieval-Augmented Generation frameworks. The core contribution is a DARE dialectical engine that operationalizes formal cross-examination to dynamically assess source credibility based on logical resilience rather than static weighting. The engine subjects candidate sources to structured adversarial challenges and infers reliability from how well each source withstands scrutiny.

77% improvement on FaithEval benchmark
28% improvement on RAMDocs
Published at SIGIR 2026 and ECIR 2026

Ongoing March 2026 – Present

Machine Unlearning vs. Behavioral Suppression in LLMs

Lexsi Labs • Mechanistic Interpretability

Investigating whether standard machine unlearning methods produce genuine circuit disruption or merely suppress behavioral expression of internal knowledge states. Using EAP-IG attribution shifts over Gemma Scope SAE features pre- and post-unlearning on the TOFU forget10 benchmark, this work operationalizes the hypothesis that unlearning — like instruction-tuning — acts as a suppression mechanism rather than true knowledge erasure.

Validated via relearning attack recovery speed
Mechanistic attribution over Gemma Scope SAE features
Connects circuit analysis with unlearning evaluation

Ongoing Jan 2025 – Present

Automated PICO Extraction from Clinical Literature

Bio Medical Data Science Lab • ICMR Bhopal Collaboration

Developing an end-to-end deep learning framework for automated PICO (Population, Intervention, Comparison, Outcome) extraction from full-text clinical papers. In collaboration with ICMR Bhopal, curating annotated datasets of full research articles to accelerate evidence synthesis for systematic reviews.

Completed — BS Thesis 2024 – 2025

Hybrid Extractive–Abstractive Summarization via Siamese Ranking

Bio Medical Data Science Lab, IISER Bhopal

Engineered a two-stage summarization pipeline combining a ModernBERT-based Siamese extractive stage (using scaled adaptive margin triplet loss for candidate ranking) with an abstractive generation stage. Evaluated on CNN/DailyMail.

53.13 ROUGE-1 on CNN/DailyMail
Scaled adaptive margin triplet loss for ranking

Completed May – July 2024

Gender Equity in IIT Admissions — Quantitative Policy Analysis

School of Public Policy, IIT Delhi • Guide: Dr. Nandana Sengupta

Analyzed 2000+ faculty profiles from IRINS to identify a 12% gender differential in negative marking impact under JEE. Evaluated the socio-economic viability and impact of the 20% supernumerary quota for women at IITs using statistical modeling and causal inference methods.

Completed 2024

AI Fintech Platform — Backtested Trading Framework

Student Innovation Grant • IICE / DST, Government of India

Developed an AI-driven fintech platform awarded the Student Innovation Grant (Rs. 2 Lakhs) by IICE, funded by DST, Government of India. The platform demonstrated a 68% profit increase in backtesting using ML-driven signal generation and portfolio optimization.

Publications

Peer-reviewed publications and workshop papers. Click any paper to expand its abstract.

SIGIR 2026 Accepted

When RAG Disagrees: Detecting Latent Epistemic Conflict via Logit Interactions

Saisab Sadhu, et al.

49th International ACM SIGIR Conference on Research and Development in Information Retrieval

arXiv ↗ DOI ↗ ACL ↗

BibTeX

@inproceedings{sadhu2026ragdisagrees,
  title     = {When {RAG} Disagrees: Detecting Latent Epistemic Conflict
               via Logit Interactions},
  author    = {Sadhu, Saisab and others},
  booktitle = {Proceedings of the 49th International {ACM} {SIGIR} Conference
               on Research and Development in Information Retrieval},
  year      = {2026},
  publisher = {ACM}
}

ECIR 2026 Accepted

DARE: A Dialectical Framework for Adversarial and Evidence-Aware RAG

Saisab Sadhu, et al.

48th European Conference on Information Retrieval

arXiv ↗ DOI ↗ Springer ↗

BibTeX

@inproceedings{sadhu2026dare,
  title     = {{DARE}: A Dialectical Framework for Adversarial
               and Evidence-Aware {RAG}},
  author    = {Sadhu, Saisab and others},
  booktitle = {Proceedings of the 48th European Conference on
               Information Retrieval ({ECIR})},
  year      = {2026},
  publisher = {Springer}
}

EMNLP 2025 Workshop Published • #1 Global

Structured Adversarial Synthesis: A Multi-Agent Framework for Generating Persuasive Financial Analysis from Earning Call Transcripts

Saisab Sadhu, et al.

Proceedings of The 10th Workshop on FinNLP, EMNLP 2025

arXiv ↗ ACL ↗ PDF ↗

BibTeX

@inproceedings{sadhu2025adversarial,
  title     = {Structured Adversarial Synthesis: A Multi-Agent Framework
               for Generating Persuasive Financial Analysis from
               Earning Call Transcripts},
  author    = {Sadhu, Saisab and others},
  booktitle = {Proceedings of the 10th Workshop on Financial Technology
               and Natural Language Processing ({FinNLP}), {EMNLP} 2025},
  year      = {2025},
  publisher = {Association for Computational Linguistics}
}

IJCAI-AACL 2025 Workshop Published

Structure-Aware Chunking for Abstractive Summarization of Long Legal Documents

Saisab Sadhu, et al.

JustNLP Workshop at IJCAI-AACL 2025

arXiv ↗ PDF ↗

BibTeX

@inproceedings{sadhu2025legal,
  title     = {Structure-Aware Chunking for Abstractive Summarization
               of Long Legal Documents},
  author    = {Sadhu, Saisab and others},
  booktitle = {Proceedings of the {JustNLP} Workshop at {IJCAI-AACL} 2025},
  year      = {2025}
}

AAAI EGSAI 2026 Accepted • 1 of 51 globally

Hierarchical Pedagogical Oversight: A Multi-Agent Adversarial Framework for Reliable AI Tutoring

Saisab Sadhu, et al.

AAAI 2026 EGSAI Community Activity

arXiv ↗ PDF ↗

BibTeX

@inproceedings{sadhu2026hpo,
  title     = {Hierarchical Pedagogical Oversight: A Multi-Agent Adversarial
               Framework for Reliable {AI} Tutoring},
  author    = {Sadhu, Saisab and others},
  booktitle = {{AAAI} 2026 {EGSAI} Community Activity},
  year      = {2026}
}

Experience

Industry March 2026 – Present

Research Intern — Mechanistic Interpretability

Lexsi Labs • Mumbai, India

Investigating whether standard machine unlearning methods produce genuine circuit disruption or behavioral suppression, using EAP-IG attribution shifts over Gemma Scope SAE features pre- and post-unlearning on TOFU forget10.
Operationalizing the hypothesis that unlearning, like instruction-tuning, suppresses behavioral expression of internal knowledge states without disrupting underlying circuits; validated via relearning attack recovery speed.

Research Jan 2025 – Present

Graduate & Undergraduate Researcher

Bio Medical Data Science Lab, IISER Bhopal • PI: Dr. Tanmay Basu

MS Thesis: Dialectical engine for RAG knowledge conflict resolution — 77% on FaithEval, 28% on RAMDocs (SIGIR 2026, ECIR 2026).
BS Thesis: Hybrid extractive–abstractive summarization pipeline with ModernBERT Siamese extractive stage; 53.13 ROUGE-1 on CNN/DailyMail.
Biomedical NLP: End-to-end PICO extraction framework from full-text clinical papers in collaboration with ICMR Bhopal.

Industry Jan 2026 – March 2026

Senior Analyst Intern — DnA Team

MIQ Digital • Bengaluru, India

Analyzed large-scale user datasets on Databricks over AWS S3; applied statistical methods and ML models for campaign optimization and attribution modeling.

Research May – July 2024

Research Intern

School of Public Policy, IIT Delhi • Guide: Dr. Nandana Sengupta

Analyzed 2000+ faculty profiles (IRINS) to identify a 12% gender differential in negative marking impact under JEE.
Evaluated the socio-economic viability and impact of the 20% supernumerary quota for women at IITs.

Education

MS & B.Tech (Integrated) in Data Science and Engineering

Indian Institute of Science Education and Research Bhopal

December 2021 – April 2026 • CPI: 7.31 / 10

Integrated five-year program combining undergraduate engineering and master's-level research in data science.

Technical Skills

Languages

PythonCRSQLBashMATLAB

ML / DL Frameworks

PyTorchTensorFlowKerasScikit-learnHugging Face TransformersPEFT

NLP & LLMs

NLTKGensimspaCystanzaLangChainLangGraph

GenAI & Research

AutoGenDPOLoRA / QLoRARLHFG-EvalRAG Architectures

Data & Automation

PySparkpandasNumPyBeautifulSoupSelenium

Dev & MLOps

GitDockerKubernetesSLURMDatabricksWeights & Biases

Achievements & Conferences

🏆

Top Performer — FinNLP @ EMNLP 2025

Ranked #1 globally on the official "Win Rate vs. Analyst Report" metric. System-generated financial reports were preferred over those of professional human analysts.

🎉

AAAI 2026 EGSAI Selection

Selected to present at AAAI 2026 EGSAI Community Activity — one of 51 works chosen from global submissions.

🎁

Student Innovation Grant — Rs. 2 Lakhs

Awarded by IICE (Funded by DST, Government of India) to develop an AI fintech platform; demonstrated 68% profit increase in backtesting.

✈

CARE Conference Travel Grant

Awarded full registration waiver and travel support for poster presentation at the Collaborative for Academic Research Excellence Conference, IIT Guwahati.

Workshops & Schools

2025

FinNLP Workshop @ EMNLP 2025 — Virtually presented "Structured Adversarial Synthesis" and participated in shared task discussions.

2025

CARE Conference on Data Science & AI — Presented poster on Multi-Agent Adversarial RAG, IIT Guwahati.

2024

Climate Change AI Summer School — Engaged with leading researchers on ML for climate science; hands-on workshops. Pittsburgh, US (Online).

2024

7th Summer School on AI (CV & ML) — Selected for intensive program by CVIT, IIIT Hyderabad.

Contact

I'm always happy to discuss research ideas, collaborations, or ongoing work. Feel free to reach out.

✉ sadhusaisab@gmail.com 🏫 saisab21@iiserb.ac.in in LinkedIn GitHub