Nils Feldhus

nfel

https://nfelnlp.github.io

AI & ML interests

Interpretability, Explainability, Natural Language Generation

Recent Activity

upvoted a collection 24 days ago

👤 Implicit Personalization in Language Models

liked a Space 26 days ago

aaron0eidt/ELIA

authored a paper 3 months ago

Interpreting Language Models Through Concept Descriptions: A Survey

View all activity

Organizations

authored a paper 3 months ago

Interpreting Language Models Through Concept Descriptions: A Survey

Paper • 2510.01048 • Published Oct 1 • 2

authored a paper 5 months ago

Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes

Paper • 2507.12261 • Published Jul 16 • 1

authored 2 papers 6 months ago

Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data

Paper • 2507.00152 • Published Jun 30 • 1

Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework

Paper • 2506.15538 • Published Jun 18 • 1

authored 2 papers 7 months ago

Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals

Paper • 2505.13972 • Published May 20 • 1

Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability

Paper • 2505.13963 • Published May 20 • 1

authored 5 papers 8 months ago

Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods

Paper • 2505.01198 • Published May 2 • 2

Inseq: An Interpretability Toolkit for Sequence Generation Models

Paper • 2302.13942 • Published Feb 27, 2023 • 1

Efficient Explanations from Empirical Explainers

Paper • 2103.15429 • Published Mar 29, 2021

LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools

Paper • 2401.12576 • Published Jan 23, 2024 • 2

Free-text Rationale Generation under Readability Level Control

Paper • 2407.01384 • Published Jul 1, 2024

authored 3 papers over 1 year ago

Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools

Paper • 2108.13961 • Published Aug 31, 2021

Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods

Paper • 2210.07222 • Published Oct 13, 2022

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations

Paper • 2310.05592 • Published Oct 9, 2023

Nils Feldhus

AI & ML interests

Recent Activity

Organizations

nfel's activity