We introduce WINA (Weight-Informed Neuron Activation), a novel framework designed to enhance the efficiency of Large Language Models (LLMs) without requiring retraining. Traditional methods often struggle with the signif…
Home
Feed
Search
Library
Download