A Survey on Routing Strategies for Resource Optimisation in Large Language Model-Based Systems
View the PDF file from the paper entitled “Doing More: A Survey on Guidance Strategies to improve resources in the language model systems, by Clovis Varangut Reel and 5 other authors
PDF HTML (experimental) view
a summary:Llm language model systems (LLM), i.e. interconnected elements that include LLM as a central component, such as conversation factors, with fixed homogeneous structures that depend on one LLM, are a contractor for general purposes to deal with all user quarrels. However, these systems may be ineffective because different information may require different levels of thinking, domain or prior treatment. While Llms Generalist (for example, GPT-4O, Claude-Sonnet) works well through a wide range of tasks, they may bear great financial, energy and calculation costs. These costs may be inappropriate with the simplest inquiries, which leads to the use of unnecessary resources. Consequently, the verse to guide the information can be used into more convenient components, such as smaller or specialized models, thus improving efficiency and improving resource consumption. This poll aims to provide a comprehensive overview of the steering strategies in LLM systems. Specifically, it reviews Matthew, why and how to integrate guidance into LLM pipelines to improve efficiency, expansion and performance. We define the goals to improve, such as reducing cost and maximizing performance, and discussing the timing of guidance within the LLM workflow, whether this happens before or after the generation. We also detail the various implementation strategies, including similarity -based methods, supervision news, learning based, and obstetric methods. Practical considerations such as industrial applications and current restrictions are also examined, such as unifying guidance experiences, non -financial cost accounting, and adaptive strategies design. By formalizing the guidance as a problem of improving the cost of performance, this survey provides tools and trends to guide future research and develop low -cost LLM systems.
The application date
From: Clovis Varangut Reel [view email]
[v1]
Saturday, Feb 1 2025 12:08:38 UTC (590 KB)
[v2]
Tuesday, 4 February 2025 09:12:03 UTC (590 KB)
[v3]
Monday, 21 July 2025 12:20:06 UTC (273 KB)
Don’t miss more hot News like this! Click here to discover the latest in AI news!
2025-07-22 04:00:00



