YoctoEconomics

martes, 6 de marzo de 2018

Does rationality imply selfishness?

The main assumption of standard economics is rationality. This means that economic agents maximize their utility (the word economists use for happiness). Therefore, standard economics assumes that people have a limitless memory and reasoning power and can make optimal decisions always. For instance, rationality implies that people can go to a supermarket with a predetermined budget and use it optimally to buy exactly the goods that will maximize their utility.

Although this assumption is extremely unreal, it has been maintained for decades, until behavioral economics started to study rationality boundaries and behavioral anomalies. But this is not the topic of this post.

Suppose for the rest of the post that we still believe that people are rational. Actually, all undergraduate economics students around the world never discuss the limits of rationality during their bachelor degrees. In addition, we are also taught that rationality implies selfishness, because if we are rational, then it is optimal to do what is best for ourselves and do not worry for others´ happiness.

But should this be a condition for rationality?

In my Master, I share classes with students from more than 30 nationalities and all of us were taught that rationality implies selfishness. Although I never thought about this before, it seems obvious now that there is nothing in rationality that necessarily implies selfishness. So, let´s start from the beginning, what are the conditions for rationality?

1- Preferences are complete: this condition requires that either A is preferred to B, B to A or they are indifferent. It cannot happen that a person does not know what he prefers between two options.

2- Preferences are transitive: this condition requires that if A is preferred to B and B to C, then A must be preferred to C.

3- Preferences are continuous: this condition simply requires that there are not jumps in people´s preferences. In other words, if A is preferred to B, options very close to A must also be preferred to B.

4- Preferences are independent of irrelevant alternatives: this condition requires that if A is preferred to B, a mixture of A and C must be preferred to a mixture of B and C (both with the same shares of C).

If these four axioms of rationality hold, then it is possible to confirm that a person has rational preferences, which means that they are consistent and that they will not change due to marketing tricks, for example.
But these axioms do not tell us anything about selfishness or altruism, right? consistency means that people are efficient in using their available resources to meet their goals, whatever those goals are. If a particular person preferences just considers his own happiness, then it is obvious that any spending in others will be suboptimal, but what happens if not?

Dear economists, stop saying that rationality implies selfishness because it is not true. A pure altruist individual, for example, will maximize his utility giving everything to others. If his choices are consistent, why should not been them considered rational?

Dear economists, altruism is rational.

sábado, 3 de febrero de 2018

Decoding the Prisoner´s Dilemma

Game Theory can be defined as a mathematical representation of social life and it is usually used to understand and predict strategic behavior when actions of all players affect everybody else.
The standard and best known example of game theory in action is the Prisoner´s Dilemma, which gained its popularity thanks to a "real-life" story that represents the individual decision-making problem. The story can be framed like this:

Two members of a criminal gang are arrested and imprisoned. Each prisoner is in solitary confinement with no means of communicating with the other. The prosecutors lack sufficient evidence to convict the pair on the principal charge. They hope to get both sentenced to a year in prison on a lesser charge. Simultaneously, the prosecutors offer each prisoner a bargain. Each prisoner is given the opportunity either to: betray the other by testifying that the other committed the crime, or to cooperate with the other by remaining silent. The offer is:

If A and B each betray the other (defect), each of them serves 2 years in prison
If A defects B but B remains silent, A will be set free and B will serve 3 years in prison (and vice versa)
If A and B both remain silent (cooperate), both of them will only serve 1 year in prison.

Game theory problems usually start from stories like this, and what game theorists do is to automatically represent the outcomes in matrixes. As you may see below, the matrix represents the payoffs of both players, with the left number as the payoff for B and the right number for A.

In the Prisoner´s Dilemma, from an individual point of view, it is always optimal to defect. No matter what the other player does, if I defect I get a better outcome than if I cooperate. Suppose that A cooperates, then if B cooperates he gets -1 while if B defects he gets 0 (then defecting is better). Suppose now that A defects, then if B cooperates he gets -3 while if he defects he gets -2 (defecting is better again). Game theorists call defecting (in this case) a strictly dominant strategy. It is individually optimal for both players to defect.

But what makes beautiful the Prisoner´s Dilemma is that although it is optimal for both players to defect, if both do so, they get an outcome of (-2,-2), while if they manage to cooperate, they would get (-1,-1). I find this situation simply fantastic because it easily illustrates a lot about human behavior. If both players do what is best for themselves, they both arrive to a situation that is worse than if they decide to cooperate, with the risk of being betrayed by the other one.

In game theoretic terms, (Defect, Defect) is the Nash Equilibrium, i.e. the stable situation that no player wants to abandon. As can be seen, nothing guarantees that a Nash Equilibrium is a good outcome for the players.

So, if (Cooperate, Cooperate) generates a better outcome for both players, what can we do as choice architects in our real-life problems to boost cooperation?

Behavioral game theory and experimental economics have demonstrated that people usually do not choose (Defect, Defect) and that there are techniques to generate more cooperation. Actually, we are irrational in a predictable way, so what if we try some of these ideas?

Repetition

If you make people play several times with the same partner and they do not know when the game is going to finish, it is possible to build conditional cooperation between them. For example, they can have trigger strategies (start cooperating and cooperate as long as the other one does the same). This type of strategy creates a different situation, in which players evaluate the whole set of payments and will choose the actions that maximize their lifetime outcome.

Framing

It may seem obvious but results in this game have been shown to be completely different if it is called Wall Street Game or Cooperation Game. If you have a real-life situation that resembles the Prisoner´s Dilemma, try to generate a context that promotes cooperation, rather than competitive behavior.

Social norms

It has been shown that people respond to what is usually done by most of the people. If you can find data like in previous cases, 70% of the people decided to cooperate, do not hesitate to let new players know it! You will be surprised about how decision-makers behavior will change towards a cooperation rate higher than 70%.

Private norms

People may also feel guilt if they are betraying their own values. Generate a context that make them realize about the selfishness of their actions and, probably, many of them will change towards a more sociably desirable behavior.

lunes, 1 de enero de 2018

A Beautiful Mind needs revision

When someone asks me what is Game Theory about, I usually answer with a question: have you seen A Beautiful Mind?

The movie is a representation of the life of John Nash, a Nobel Laureate in Economics, for his developments in Game Theory. Most of you have probably heard about the notion of a Nash Equilibrium which is, in summary, a situation in which no player wants to change his strategy, based on what the other players are doing. A Nash Equilibrium does not need to be a good nor a desirable outcome, just a best response of all players. In other words, if all players can stop the time and think: am I doing the best I can considering what everyone else did? and they all find that choosing another strategy will yield them worse results, then that sum of strategies of all players is a Nash Equilibrium.

In the movie, there is a famous scene that tries to illustrate this concept:

The idea is simple: if all players do what is best for themselves ignoring that other players will also do what is best for themselves, then they will get a negative outcome. If, in contrast, they consider what other players may be thinking before making a decision, they may prefer to choose another action.

The example is easy to understand and well explained but it is full of mistakes! It is strange that a biographical movie about one of the best mathematicians of the history makes so many mistakes in a crucial part of the movie.

Nash did not study Sequential Games

The illustration of the movie can be represented as a Sequential Game. The men move first and receive a first response from the approached women. Then, they move again and receive a new response from the women. The idea of Sequential Games is that if the men can anticipate by backward induction that they will be rejected by the brunettes in the second subgame, they would prefer to approach them directly in the first subgame.

Although the concept is part of Game Theory (and one of the most interesting and useful topics), it is not what John Nash studied! In fact, he focused on Strategic Games, in which all players have to move simultaneously. In other words, the situation represented in the movie implies a game with two rounds. Therefore, it should not be part of John Nash biographical movie.

The equilibrium is definitely wrong!

This is what actually drives me crazy. In a Nash Equilibrium, all the players must be playing their best response, taking into account what the rest of the players are doing. Therefore, if there are 4 men and 5 women and we assume that the blond is the prettiest and the 4 brunettes are less desirable, then an outcome in which the 4 men approach the 4 brunettes is not a Nash Equilibrium! in fact, all the men would individually be willing to change their action and approach the blond woman (considering that nobody else has approached her). Actually, this game would have four Nash Equilibria. In each equilibrium, one different man finishes with the blond and the three others finish with the three most beautiful brunettes. It is straightforward to verify that this is actually a Nash Equilibrium, because in this case, nobody would be willing to change his action. If one of the men that approached a brunette decides to approach the blond, he will have a worse outcome because he will be rejected by the blond (who was with other guy) and will finish without any woman. In addition, nobody wants to change their woman for the remaining brunette, because she is not as pretty as the ones that they have already approached.

There is also a Mixed Strategy Nash Equilibrium

This may be a bit too technical, and it seems reasonable that the director did not include it in the movie. There are some games in which there is not a Nash Equilibrium in pure strategies. For example, in Rock Paper Scissors, if A did Rock and B Paper, it is not a Nash Equilibrium because A would prefer to change to Scissors. But this is not a Nash Equilibrium because B would prefer to change to Rock, and it continues ad infinitum. However, the situation in which both players randomize and play each action with probability 1/3 is a Mixed Strategy Nash Equilibrium, because if the other one is playing that strategy, my best response is to do it too. The four Nash Equilibria described in the previous section assume that players do not play symmetric actions (one goes with the blond woman and the other ones with the brunettes). However, we can think about a Mixed Strategy Nash Equilibrium in which the 4 men randomize their actions, and approach the blond woman with probability P and a brunette with probability 1-P. The computation of P is not complicated, but we would need to assume some Bernoulli Values in order to maximize their expected utility.

In conclusion, it is a bit frustrating that the best representation of a fascinating topic like Game Theory has so many mistakes in one of the most important scenes. In the movie, Nash says:

Adam Smith said the best result comes from everyone in the group doing what's best for himself. Right? That's what he said, right? Incomplete. Incomplete, okay? Because the best result will come from everyone in the group doing what's best for himself … and the group.

Real life John Nash would agree that Adam Smith theory was incomplete, but he would argue that the mistake was different. The real John Nash would never claim that the best result come from everyone in the group doing what is best for himself and the group, but from everyone doing what is best for himself taking into consideration what the rest of the group has done.

Paraphrasing Russell Crowe´s script, A Beautiful Mind needs revision...definitely.

sábado, 23 de diciembre de 2017

A visit to the Casino, with a human

Casinos are real laboratories for researchers in statistics, economics and psychology. The fact that people actually go to Casinos and bet with the probabilities against them is a puzzle by itself. Why are Casinos so popular? How do we decide how much to spend and when to stop? How do we control our impulsive desire to bet one more time? How do we interpret randomness and our otucome when the night is over?

I believe that I could be able explain the whole literature about Behavioral Economics just with Casinos examples. Self-Control problems, Self-Serving interpretations, Reference Points, Risk and Loss Aversion, Bounded Rationality, Heuristics and Biases. The fact that Humans are not Econs (we do not behave like Standard Economics predict, maximizing expected value of lifetime utility subject to budget constraints, without making mistakes and with limitless reasoning power and memory) is crucial to understand some persistent and robust errors that we make while allocating subjective probabilities under uncertainty. I will just enumerate and briefly explain some of these interesting phenomena.

Reference points and value function
Go anywhere in the world to an economics school and ask students how is it possible that people go to Casinos and then buy insurances. All the students will answer the same: they must be different individuals, since every decision-maker has a unique utility function consistent either with risk-averse, risk-neutral or risk-loving preferences. Therefore, people that go to Casinos or entrepeneurs are risk-lovers and do not demand insurance. We all learn this at university, but it is obvious that this is not true. Daniel Kahneman and Amos Tversky explain in their famous article Prospect Theory, that people have value functions that are usually concave for gains and convex for losses, which means that, when probabilities are not too small, people are risk-averse for gains and risk-lovers for losses. When probabilities are small the risk attitude is reversed, since people tend to forget about probabilities when they are too small and focus just on the outcomes (most people prefer a lottery with 0.01% probability of winning 6000 than other with 0.02% probability of winning 3000). Considering that small probabilities are overweighted, people are risk-lovers for gains and risk-averse for losses, which is clearly in line with the fact that people gamble (risk-loving when they can win) and demand insurance (risk-averse when they can lose).

Money Fungibility
Standard economics assumes that money is fungible, which means that people value all money the same and is indifferent between spending $100 that were acquired as part of their wage or found in the street. We all can think examples to demonstrate that money is not completely fungible in our minds. When people go to a Casino with $200 and started the night winning, then they will probably put the initial $200 in their pocket and say that they will continue playing with the gained money, "playing for free". But this will definetely not happen to an Econ! if an Econ wins $300, now he has $500 and there will not be any difference between the initial $200 and the new $300. He will have $500 and will use them in the way that maximizes his expected utility.

Planner and Doer - Self-Control
Behavioral Economics also studies self-control problems, because it is clear that on some occasions people behave in a way that is not good for themselves and, what is even more interesting, people know in advance that this will happen to them. The Planner and Doer framework implies that inside us there is a Planner that is patient, calculative and willing to maximize lifetime utility. Besides, there is a Doer who is impulsive and just focus on the pleasure of the immediate experience, without considering any consequence. Therefore, if a person is going to the Casino and the Planner anticipates that the Doer will make him spend more money than expected, the Planner can control the Doer, making him go with a particular maximum amount of money and without credit cards. Doing this, the Planner can prevent the impatient and impulsive Doer from harming the lifetime utility of the individual.

Self-Serving Bias
Another important fact that leads to more spending than expected in Casinos is the Self-Serving Bias, that makes reference to the fact that people tend to believe that good outcomes were caused by their ability and effort, and the bad outcomes are other´s responsibility or even bad luck. Again, it is easy to think about several examples to illustrate this phenomenon. If a person that is playing in a Casino believes that every time he wins it was due to his ability or perception and every time he losses he believes it was just bad luck, he will probably have overconfidence about his probabilities of winning the next time.

Gambler´s and Hot-Hand Fallacy
When people try to interpret and rationalize randomness, they will always make mistakes. The Gambler´s Fallacy is a well-known effect related with the belief that if something happens more frequently than expected in the short-run, then it will happen less frequently in the near future. Probabilities are the same each repetition, so the belief that the previous outcome cannot be repeated is a fallacy that can make people lose a lot of money.
In addition, the Hot-Hand Fallacy also comes from the need to interpret randomness. If a person has won the first 3 rounds of a random game, he does not have more chances of winning the next one. The fact that someone did well in the past is just luck, and nothing states that the result will be repeated in future repetitions.

In conclusion, this post was just a compilation of mistakes people make when they go to Casinos. Casinos are particularly interesting because people must subjectively assign probabilities to uncertain events and constantly decide between prospects. The above list of examples can be definitely longer, but let´s leave information for future posts...

martes, 28 de noviembre de 2017

Arrogant and bounded rationality

Imagine that you are part of a game with other 50 people. You have to choose a real number between 0 and 100 and a prize of $2,000 will be given to the person whose number was closest to the average divided by two. What number would you choose?

Think a bit more...

If you tried to find a solution, you may have realized that the highest reasonable number is 50, because if everyone else chooses 100, then the average will be close to 100 and you will win the prize. However, if you made an extra effort, you have probably realized that you are not the only one with the same reasoning, so if everyone bids 50, the maximum reasonable bid is now 25. But the game continues, because if you think that the other ones are as intelligent as you are, they will also choose 25, so now it is optimal to bid 12.5. If we are perfectly rational and recognize others as perfectly rational too, then the only reasonable solution for this game - its Nash Equilibrium - is that all players choose 0.

However, several experiments have been done around this game and almost nobody behaves like this. Usually bids are lower than 50 but higher than 0. But, if people can realize that choosing a number higher than 50 is not a best response, why cannot they follow the mental process to its end and arrive to the conclusion that 0 is their best option? The two possible explanations to this phenomenon are that either people are not as rational as economic theory predicts they are, or everyone believes he is the only rational person in a world of irrationals that will not realize that bidding something higher than 0 is suboptimal. In a nutshell, both situations are combined: we are not completely rational and we know that others are not as well.

This last conclusion is crucial and revolutionary, because almost all the economic theory of the last hundred years has been developed from the assumption of perfectly rational individuals that can instantaneously incorporate and process all the available information and take optimal decisions that maximize their utility. Central bankers and policy makers have been using models that assume that we all have limitless reasoning power and memory. Recognizing the real nature of people behavior is necessary to design efficient and effective public policies that may increase people wealth and happiness.

Therefore, a more accurate way of analyzing human behavior is considering that we have a bounded rationality, which means that we try to be rational, but sometimes we cannot, because problems are to complex to be solved in the available time. We simplify the problems with heuristics (rules of thumb) that are described as "judgmental shortcuts that generally get us where we need to go – and quickly". However, the psychologists Daniel Kahneman and Amos Tversky have found that although heuristics are necessary and useful, they sometimes generate systematical and generalized biases (errors). For instance, imagine this situation:

Linda is 31 years old, single, outspoken, and very bright. She majored in philosophy. As a student, she was deeply concerned with issues of discrimination and social justice, and also participated in anti-nuclear demonstrations.

Which is more probable?

Linda is a bank teller.
Linda is a bank teller and is active in the feminist movement.

Several experiments have demonstrated that under situations like this, most of the people choose the option 2, based on their heuristics and prejudices. However, it is easy to see that this is an error that we make while trying to assign subjective probabilities to unknown events, because option 2 is part of option 1 and, therefore, option 1 must be more probable than option 2. This bias is called the Representativeness Heuristic and the concept that it tries to illustrate is that people fail to distinguish when something is more probable or more representative.

But this is not the only error that people make while assigning probabilities. Imagine the following question:

Are there more words that start with N or words that have N as a third letter (in written english)?

Although there are much more words that have N as a third letter, people usually think the opposite, because it is easier to imagine words that start with N that words that have N as their third letter. This is called the Availability Heuristic and the logic is that if it is easier to imagine a situation, it must be more likely. This bias generates, for example, more demand of "flight insurance" than "any reason insurance" at the same cost.

In conclusion, during the last decades there have been important developments in Behavioral Economics, the new field that studies how people actually behave (with their heuristics and biases) and tries to design efficient public policies based on this behavior. In my opinion the conjunction between Game Theory and Behavioral Economics is a powerful platform to analyze strategic behavior of people and groups under particular circumstances. You may probably not believe me, but you must trust Vilfredo Pareto:

“The foundation of political economy and in general of every social science is evidently psychology.”

Why do people attend University?

All students have asked themselves - at least once - if it is actually efficient to spend time studying irrelevant concepts at university, while they could be already working for employers that would never require most of those concepts. History dates and facts, integrals, subjects and predicates are just some examples of topics that students need to incorporate during their learning process and demonstrate it in specific examinations.

In my particular experience, I have worked during my bachelor degree and after finishing it. Now, that I am pursuing a MSc in Behavioral Economics and Game Theory, I find myself again studying theories and topics that would never be required in a professional environment and I cannot avoid feeling a bit frustrated by this situation. What is more, although companies do not need their workers to know much of the concepts they learnt at university, they still require that applicants have finished particular programs and with minimum GPA. The apparently irrational situation could be illustrated like the following: Jake goes to undergraduate and graduate school and spends 6 years of his life studying and learning things that he knows will be completely useless in his future professional career. In the meantime, Companies filter applicants that do not have Jake background, although they know that Jake´s knowledge is completely irrelevant and unnecessary.

So, why does Jake still attend University? and why do companies still require unnecessary and irrelevant knowledge?

Fortunately, Game Theory has a brilliant answer to this question. We can think about the job search process as a game with incomplete and asymmetric information. The Applicant goes to a job interview knowing his characteristics, strong and weak points and the Employer has to evaluate and compare this Applicant with other ones, based on the information that the Applicant provides and other references that he may get. Being in a job interview is not an easy task, but all applicants know that they must sellthemselves. Thus, they will try to seem intelligent and the ideal candidate to fulfill the vacancy. If an Employer needs a worker to do an analytical job, then it will be extremely difficult to choose among several candidates that are making their best to seem analytical profiles. Therefore, the Employer will try to find a signal that help him determine the type of the Applicant (for simplicity, assume Good or Bad). All Applicants will try to seem Good candidates, so they will be willing to provide a good signal to the Employer. This means that in order to have a useful signal, it must be costly, because otherwise all the Applicants would be able to deliver the good signal and it will become completely uninformative.

University programs can be interpreted as costly signals that job applicants provide to their potential employers. A Bad Applicant for an analytical position, would need to put much more effort to finish an analytical university program and, if he achieves it, probably will not even try to attend graduate school. On the other hand, a Good Applicant for an analytical position, needs less effort to understand analytical tasks. Considering that effort is a source of disutility (all of us want to maximize our utility, and putting effort in a task lowers our utility), probably just Good Applicants will finish analytical PhD programs and the Employer knows that if he sets a PhD degree as a minimum requirement, he will probably find just Good Applicants. Of course, this does not mean that there are not Good Applicants without graduate studies or that there are not Bad Applicants with PhD degrees, but the signal of studying particular programs is so costly, that the probability of someone not ideal for it actually doing it, is definitely low.

So, why do people attend university? the first answer would be that people attend university to learn but this vision is completely biased. Students decide to go to university in order to get the appropriate signal that will give them the opportunity to be considered for particular jobs. Therefore, the difference between a PhD in Economics from Harvard and an applicant without a university degree is not how much they actually know, but a signal about what type of challenges each applicant can overcome. In other words, if a person was accepted in Harvard and finished his PhD with a Cum Laude diploma, it means that he was able to learn extremely difficult - and probably useless - concepts and that he was able to outperform in complicated and stressful situations. So, if he could learn how to calculate complicated integrals and optimizations, he will probably be ready to learn the particular tasks related with his job position.

Going back to the Game, the Applicant will go to the job interview trying to seem a Good candidate. However, the Employer will use the university degree as a signal about the candidate suitability for the job position. Knowing this in advance, young and ambitious students will decide to go to university and pursue complicated programs, no matter if the content is useful or not. Bad applicants will have to put so much effort that the disutility will be high enough to discourage them to continue and will never get the appropriate signal, while Good applicants will finish the program because the necessary effort will not be so high for them.

In conclusion, this analysis states that students do not go to university to learn, but for signaling. This does not mean that they cannot learn anything in the process; in fact they will probably learn several useful and irrelevant concepts. In addition, there still are job positions for which this theory does not apply, such as medicine, architecture or academia. For all these examples, university is also the first quasi-internship, so the conclusions may be completely different.

References:

Job Market Signaling (Michael Spence). The Quarterly Journal of Economics, Vol. 87, No. 3. (Aug., 1973), pp. 355-374

La dispersión del Real-Time Bidding

Imagine el lector la siguiente situación y piense rápidamente que haría si se lo pusiera en ese escenario:

Se le ofrece elegir entre obtener 5.000 dólares con seguridad o tirar una moneda. Si sale cara, recibirá 10.000 dólares pero si sale seca no obtendrá nada.

Muy probablemente el lector habrá elegido por la opción de recibir 5.000 USD con seguridad a pesar de que el valor esperado de ambas loterías era exactamente el mismo. Esto ocurre porque los humanos somos, la gran mayoría, aversos al riesgo. Es decir, que una situación de riesgo nos quita utilidad y preferimos loterías con menor valor esperado, siempre y cuando disminuyan también el riesgo. Si el lector aún duda acerca de esta afirmación, ¿no aceptaría acaso también recibir 4.500 USD antes de tirar al aire la moneda con la esperanza de ganar 10.000 USD con un 50% de probabilidad?

Para los que aún se resisten a considerarse agentes aversos al riesgo, los invito a pensar si hoy en día son propietarios de algún seguro, ya que estas herramientas son optimamente demandadas por agentes aversos al riesgo, dispuestos a pagar un monto mayor a la probabilidad del suceso con tal de cubrirse del riesgo de su ocurrencia.

La teoría de selección óptima de portafolios de inversión ha incorporado la aversión al riesgo como un aspecto central e incuestionable. Teoría y práctica coinciden en que un inversor únicamente aceptará un mayor riesgo si viene acompañado por un mayor retorno esperado. Quienes adquieren bonos están dispuestos a obtener menores retornos esperados que quienes invierten en acciones, cuyo retorno esperado es superior, pero viene acompañado de un riesgo comparativamente elevado.

Es curioso entonces que los mecanismos de decisión de pujas óptimas en el Real-Time Bidding (RTB) consideren únicamente el valor esperado de la impresión. Actualmente, el RTB evalúa el Response Rate (Click-Through Rate o Conversion Rate según el caso) de impresiones similares en el pasado y puja un valor proporcional al resultado muestral. Es decir, que el RTB en ningún momento incorpora el impacto del riesgo en su análisis y supone, por lo tanto, que los compradores de medios resultamos indiferentes al comprar una impresión que con seguridad nos dará un CTR del 0.5% o una que nos generará un CTR de 1% con un 50% de probabilidad y de 0% con el otro 50% de probabilidad.

La medida ampliamente utlizada para incorporar al riesgo a cualquier proceso de decisión es la varianza. La varianza es la medida estadística que nos permite medir la dispersión y, por lo tanto, ante el mismo valor esperado (Response-Rate), los compradores estarán dispuestos a pagar más por aquella impresión con una menor varianza.

Ahora, para quienes consideran que el problema finalizaría con la inclusión de la varianza al mecanismo de decisión de pujas óptimas aún están equivocados. Si bien implicaría un paso fundamental para volver al RTB aún más eficiente que lo actual, el enfoque de media-varianza solo sería suficiente para distribuciones de probabilidades normales. Si ese no fuera el caso, sería necesario también incorporar el tercer y cuarto momento de la función de distribución de probabilidades. Una distribución con sesgo positivo será más demandada que una con sesgo negativo por un agente averso al riesgo, más allá de que tengan la misma media y varianza. Además, una distribución leptocúrtica (valores concentrados cerca de la media) será preferible, frente a otra platicúrtica (con colas pesadas).

En conclusión, no quedan hoy en día dudas de que el RTB es el mecanismo de compra-venta de medios más eficiente del mercado ya que es el único que arriba a un Equilibrio de Nash, superando económica y estadísticamente a Google Adwords y Facebook Ads. No obstante, en el futuro deberá continuar el mismo camino que han tomado los enfoques de selección óptima de portafolios de inversión, dejando en el pasado el supuesto de neutralidad al riesgo e incorporando la varianza, sesgo y curtósis de la función de distribución de probabilidades logrando, de esa manera, pujar exactamente el valor que cada anunciante desee por cada impresión, sea este último averso, neutral o amante al riesgo.

Header Bidding y el atrapante Teorema de Equivalencia de Ingresos

El Teorema de Equivalencia de Ingresos pertenece a la Teoría de Juegos y plantea que, bajo ciertos supuestos, el precio promedio al que se termina vendiendo cualquier bien es, en el largo plazo, el mismo en Subastas de Primer o Segundo Precio.

Las derivaciones de este -poco- conocido Teorema son monumentales, ya que es la justificación matemática que se utilizó para desestimar la demanda de los Publishers de pasar a esquemas de Subastas de Primer Precio, en lugar de las Subastas de Segundo Precio que hoy en día dominan los mecanismos de compra-venta en el mercado de medios digitales (Real-Time Bidding). El argumento del Teorema, intuitivamente, es que en Subastas de Primer Precio los jugadores ganadores (de mayor valuación) están impulsados a bajar su puja progresivamente en cada repetición, hasta llegar a conocer la valuación del segundo jugador en la lista y ofrecer un centavo más. En cambio, en las Subastas de Segundo Precio, el incentivo de todos los jugadores es ofrecer su propia valuación ya que de esa manera la oferta mayor gana, paga un centavo más que la segunda y obtiene el bien, alcanzando un Equilibrio Bayesiano de Nash.

Sin embargo, la aparición de los Private Marketplaces (PMP) y del Header Bidding son claras demostraciones de que el mecanismo económicamente eficiente para los jugadores (Advertisers) no lo era para los rematadores (Publishers), puesto que estos últimos han encontrado que existe la posibilidad de obtener ingresos mayores a partir de innovaciones tecnológicas y nuevos modelos de negocio.

Ahora, ¿qué es lo que fallaba en el mecanismo anterior que llevó a los Publishers a encontrar mejores resultados con la incorporación del Header Bidding? ¿por qué motivo si los Publishers deberían estar obteniendo con las Subastas de Segundo Precio el mayor ingreso posible, solicitaron durante estos años migraciones a mecanismos con Subastas de Primer Precio y/o la incorporación del Header Bidding?

La respuesta a ambas preguntas parte de un análisis específico del Teorema de Equivalencia de Ingresos. En primer lugar, como dijo Lord Keynes, en el largo plazo estamos todos muertos y esta es una premisa que en la práctica no podemos omitir. En segundo lugar, es fundamental considerar que para poder confiar en las conclusiones de un Teorema primero hay que garantizar el cumplimiento de sus supuestos. Para que el Teorema de Equivalencia de Ingresos se cumpla son necesarias las siguientes condiciones:

1) Cada jugador conoce su propia valuación, que es privada e independiente

2) El pago depende únicamente de las pujas realizadas

3) Los jugadores son neutrales al riesgo

4) Los jugadores son simétricos, es decir, que poseen la misma información

Los primeros dos supuestos no generan grandes debates en este contexto. El de neutralidad al riesgo es interesante por si mismo, ya que podría pensarse en que es una simplificación suponer que los Advertisers evalúan únicamente el valor esperado de la impresión (Response Rate) para determinar la puja y sería más prudente pensar en esquemas que contemplen también la varianza, el sesgo y la curtosis de la función de distribución de probabilidades. No obstante, este supuesto no pone en jaque el mecanismo cuestionado en este artículo ya que todo el sistema está armado sobre el supuesto de neutralidad al riesgo, más allá de que sea adecuado o no.

En contraste, el supuesto de jugadores simétricos es completamente relevante en esta discusión: para que el Publisher pueda recibir en promedio el mismo ingreso por Subastas de Primer o de Segundo Precio, es necesario que los jugadores (Advertisers) tengan todos acceso a la misma información en el mismo momento y que, por lo tanto, puedan realizar las pujas con exactamente la misma prioridad.

El Header Bidding, que consiste en un sistema mediante el cual los Publisherspueden recibir ofertas de todos los Advertisers simultáneamente, eliminando cualquier esquema de prioridades, es la respuesta sistémica a un mecanismo previo matemáticamente ineficiente. En resumen, primero los Publishers propusieron migrar de Subastas de Segundo Precio a Subastas de Primer Precio. Cuando la respuesta fue negativa y fundamentada en el Teorema de Equivalencia de Ingresos, entonces los Publishers optaron por exigir que se cumplan los supuestos necesarios para que el Teorema entre en vigencia y sus conclusiones se observen empíricamente.

En conclusión, la incorporación del Header Bidding al esquema de compra-venta de espacios publicitarios digitales, no hace más que acercar cada vez más a esta industria caracterizada por el uso de Big Data y de las decisiones automatizadas en tiempo real a la eficiencia microeconómica necesaria para que todos los agentes participantes obtengan el máximo beneficio posible por su participación. Abrazar la incorporación del Header Bidding implica entender que es clave en esta industria eliminar cualquier fricción que genere ineficiencias que provoquen luego la necesidad de replantear los mecanismos de compra-venta de medios digitales.