Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
Published in ACL, 2026
View compact publications list
Published in ACL, 2026
Published in TOPLAS, will be presented on OOPSLA, 2026
LLMs can produce loop invariants with correct high-level insights but invalid details due to small logical reasoning errors. LORIS builds a pipeline that first asks the LLM to generate reasoning steps and then formalizes and checks those steps with a verifier to identify the failing reasoning. By providing detailed feedback on the failure, LORIS guides the LLM toward valid loop invariants more effectively.
On a benchmark suite of 460 C programs, LORIS with GPT-4.1 solves 445 programs with a 93.1% success rate, outperforming the SOTA approaches LaM4Inv and Clause2Inv, both of which remain below 85% under the same model. Across five tested models, LORIS solves the most unique programs on all models and achieves higher success rates on four models. On an additional non-linear benchmark suite, LORIS solves 47 of 50 programs.
Published in OOPSLA, 2026
Bayesian program analysis is a novel paradigm that systematically introduces probabilities into program analysis. It ranks alarms by probabilistic confidence and updates those confidences using user inspection feedback. However, existing interactive approaches greedily pick the highest-confidence alarm for inspection, which can get stuck in local optima, resulting in sub-optimal performance.
Our approach, Beer, frames interactive alarm resolution as an exploration-exploitation problem: when the analysis appears stuck, it explores by selecting alarms that approximately maximize information gain to break out of local optima more effectively. Experiments on datarace, thread-escape, and taint analyses show that Beer outperforms the greedy baseline, achieving up to 32% fewer false-alarm inspections, and also outperforms SOTA Bayesian analysis systems. This demonstrates that its exploration-exploitation strategy generalizes beyond a single analysis.
Published in OOPSLA, 2024
CEGAR-based abstraction refinement often relies on constraint solving, which can struggle to scale to large Datalog analyses. This work uses graph neural networks to prune unhelpful abstraction parameters from Datalog derivation graphs before MaxSAT-based refinement, yielding smaller constraint problems that speed up constraint solving and more effective refinements that reduce the number of the refinement iterations.
Our approach is general and does not require heavy domain knowledge for different analyses. Experiments on pointer and typestate analyses show that this approach answers 2.83x and 1.5x as many queries, respectively, as the baseline on large programs. It also runs faster on programs where both approaches terminate, and its timeout frequency is about 30% of the baseline’s.