Measuring LLM ROI from a Single Transaction Using Real-World Costs

cover
14 Jul 2025

Abstract and 1. Introduction

  1. Basic premises for our models

    2.1. How LLMs are charged

    2.2 An anedotal model from industry

    2.3 Choice of Costs in the Model

    2.4 Why multiple scenarios

  2. A Decision Theoretic Model

    3.1 How to model success

  3. A Model for Commercial Operations Based on a Single Transaction

    4.1 Example of Using the Model

    4.2 Model Analysis

    4.3 Discussion of the RoI vs. Earning dillema

  4. Modelling of a Binary Classification Problem

    5.1 Local Sensitivity Analysis

    5.2 Global Sensitivity Analysis by Sobol Method

  5. Related Work

  6. Future work

  7. Conclusion, References, and Acknowledgements

4 A Model for Commercial Operations Based on a Single Transaction

This scenario supposes that a company evaluates different LLMs for the same task, based on the probability of success of each LLM in the business. The analytical steps include building a decision-theoretic RoI model, performing a qualitative sensitivity analysis based on first-degree partial derivatives, and performing a Sokol evaluation of the first and second degree. In this scenario, we also show a numerical example based on real prices from the market in May 2024, and an arbitrarily supposed performance that is compatible with LLMs with the specified cost, to show how RoI formulae can be used in a real case.

In this first analysis, we consider only the (final) probability of success, modeled as P.

To quantify the economic impact of choosing an LLM, it is assumed that success in the task generates a gain G and an error implies a cost L. Gains and losses are inherent to the task and not to the LLM.

Thus, the expected value of earnings, E[E], and RoI, E[R], per transaction using an LLM, can be calculated with a decision-theoretical model where the earnings are calculated from the gain per transaction that generates a positive result multiplied by the probability of the result being positive, minus the loss per transaction that generates a negative impact, multiplied by the probability of a negative impact occurring and minus the total cost of the transaction, as in Equation 18. The expected RoI is the expected earnings divided by the total cost of the transaction, as in Equation 19.

These expressions facilitate a comparative analysis of earnings and RoI, taking into account not only the direct costs of using LLMs but also the economic consequences of the results produced by these technologies.

4.1 Example of Using the Model

For classification tasks, the cost can be completely dominated by the input, as the output can only be 1 token. Under the assumption of those specific conditions in a classification task, the earnings (E) and the RoI (R) for each LLM would be:

This example demonstrates that, despite llm-1 costing 20 times more and presenting a higher income, it also presents a significantly lower return on investment. This is a typical result of comparing projects with a large difference in investment [Project Management Institute, 2021].

However, if we consider a very large transaction, we can change the value of T to 128,000 and maintain the other values. With these data, it is possible to find an approximate value of 84% as the minimum performance required for llm-2 to become more economical than llm-1.

The graph in Figure 1 shows the evolution of the earnings for two competitive models according to the values used in the previous example as a function of the average transaction size.

Figure 1: Transaction gain variation as a function of size for two models with different performance (95% and 80%) and cost (US$10 and US$1).

Authors:

(1) Geraldo Xexéo, Programa de Engenharia de Sistemas e Computação – COPPE, Universidade Federal do Rio de Janeiro, Brasil;

(2) Filipe Braida, Departamento de Ciência da Computação, Universidade Federal Rural do Rio de Janeiro;

(3) Marcus Parreiras, Programa de Engenharia de Sistemas e Computação – COPPE, Universidade Federal do Rio de Janeiro, Brasil and Coordenadoria de Engenharia de Produção - COENP, CEFET/RJ, Unidade Nova Iguaçu;

(4) Paulo Xavier, Programa de Engenharia de Sistemas e Computação – COPPE, Universidade Federal do Rio de Janeiro, Brasil.


This paper is available on arxiv under CC BY-NC-ND 4.0 DEED license.