Publications

Characterizing Context Influence and Hallucination in Summarization

Author(s): J Flemings, W Zhang, B Jiang, Z Takhirov, M Annavaram

Neurips Workshop on Safe & Trustworthy Agents, Dec 2024.

Paper Link

Enabling Resource-Efficient On-Device Fine-Tuning of LLMs Using Only Inference Engines

Author(s): L. Gao, A. Ziashahabi, Y. Niu, S. Avestimehr, M. Annavaram

Neurips Workshop on Efficient Natural Language and Speech Processing, Dec 2024.

Paper Link

Biased User History Synthesis for Personalized Long-Tail Item Recommendation

Author(s): K. Balasubramanian, A. Alshabanah, E. Markowitz, G. Ver Steeg, M. Annavaram

Proceedings of the 18th ACM Conference on Recommender Systems, Oct 2024. (Acceptance rate 58/266 21.8%)

Paper Link

CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data

Author(s): H. Entezari Zarch, A. Alshabanah, C. Jiang, M. Annavaram

Workshop on Risks, Opportunities, and Evaluation of Generative Models in Recommender Systems, Oct 2024.

Paper Link

Differentially Private Knowledge Distillation via Synthetic Text Generation

Author(s): J. Flemings, M. Annavaram

The Findings of Association for Computational Linguistics, August 2024.

Paper Link

Differentially Private Next-Token Prediction of Large Language Models

Author(s): J. Flemings, M. Razaviyayn, M. Annavaram

North American Chapter of the Association for Computational Linguistics, June 2024. (Acceptance rate 565/2434, 23.2%)

Paper Link

Ethos: Rectifying Language Models in Orthogonal Parameter Space

Author(s): L. Gao and Y. Niu and T. Tang and S. Avestimehr, M. Annavaram

The Findings of North American Chapter of the Association for Computational Linguistics, June 2024. (Acceptance rate 304/2434, 12.5%)

Paper Link

Edge Private Graph Neural Networks with Singular Value Perturbation

Author(s): T. Tang, Y. Niu, S. Avestimehr, M. Annavaram

Proceedings of the 24th annual Privacy Enhancing Technologies Symposium (PETS), July 2024. (Acceptance rate 99/456, 22%)

Paper Link

LAORAM: Look Ahead ORAM for Recommendation Model Privacy

Author(s): Y. Wang, R. Rajat, M. Annavaram

Proceedings of the International Symposium on Computer Architecture (ISCA), June 2023. (Acceptance rate 79/373, 21%)

Paper Link

PageORAM: An Efficient DRAM Page Aware ORAM

Author(s): R. Rajat, Y. Wang, M. Annavaram

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, October 2022. (Acceptance rate 83/369, 22%)

Paper Link

StATIK: Structure and Text for Inductive Knowledge Graph Completion

Author(s): E.S. Markowitz, K. Balasubramanian, M. Mirtaheri, M. Annavaram, A. Galstyan, G.V. Steeg

2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics, July 2022.

Paper Link

Characterization of MPC-based Private Inferences for Transformer-based Models

Author(s): Y. Wang, E. Suh, W. Xiong, M. Annavaram, H. Lee

Proceedings of the International Symposium on Performance Analysis of Systems and Software, May 2022.

Paper Link

Enhancing Privacy Through Domain Adaptive Noise Injection for Speech Emotion Recognition

Author(s): T. Feng, H. Hashemi, M. Annavaram, S. Narayanan

Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022.

Paper Link

AVCC: Adaptative Verifiable Coded Computing

Author(s): T. Tang, H. Hashemi, R. Ali, S. Avestimehr, M. Annavaram

Proceedings of the International Conference on Parallel and Distributed Processing Systems, May 2022. (Acceptance rate 46/474, 10% Round 1 acceptance)

Paper Link

SpreadGNN: Serverless Multi-task Federated Learning for Molecular Graphs

Author(s): E. Ceyani, C. He, K. Balasubramaniyan, M. Annavaram, S. Avestimehr

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), Feb 2022. (Acceptance rate 1349/9251, 15%)

Paper Link

Check-N-Run: A Checkpointing System for Training Deep Learning Recommendation Models

Author(s): A. Eisenman, K. Matam, S. Ingram, D. Mudigere, R. Krishnamoorthi, K. Nair, M. Smelyanskiy, M. Annavaram

Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI), April 2022.

Paper Link

DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware

Author(s): H. Hashemi, Y. Wang, M. Annavaram

Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture, October 2021. (Acceptance rate 94/423, 22%)

Paper Link

cDLRM: Look Ahead Caching for Scalable Training of Recommendation Models

Author(s): K. Balasubramanian, A. Alsabnah, J. Choe, M. Annavaram

Proceedings of the 15th ACM Conference on Recommender Systems, Oct 2021. (Acceptance rate 49/267, 18%)

Paper Link

Origami Inference: Private Inference Using Hardware Enclaves

Author(s): K. Narra, Z. Lin, Y. Wang, K. Balasubramanian, M. Annavaram

IEEE International Conference on Cloud Computing (Short paper), Sept 2021. (Acceptance rate 23.8%)

Paper Link

MultiLogVC: Efficient Out-of-Core Graph Processing Framework on Flash Storage

Author(s): K. Matam, H. Hashemi, M. Annavaram

Proceedings of the International Conference on Parallel and Distributed Processing Systems, May 2021.

Paper Link

Group Knowledge Transfer: Federated Learning of Large CNNs at the Edge

Author(s): C. He, M. Annavaram, S. Avestimehr

Advances in Neural Information Processing Systems, Dec 2020. (Acceptance rate 1900/9454, 20%)

Paper Link

Collage Inference: Using Coded Redundancy for Lowering Latency Variation in Distributed Image Classification Systems

Author(s): K. Narra, Z. Lin, S. Avestimehr, G. Ananthanarayanan, M. Annavaram

Proceedings of the International Conference on Distributed Computing Systems, July 2020. (Acceptance rate 105/584, 18%)

Paper Link