22 Apr 2013 ... Marius Zoican additionally thanks the participants from the Tinbergen Institute
and DSF Brown Bag Seminars for insightful comments. 1 ...
Need For Speed? Low Latency Trading and Adverse Selection Albert J. Menkveld and Marius A. Zoican ∗ April 22, 2013
- very preliminary and incomplete. comments and suggestions are welcome -
First version: 1 February 2013
∗
Both authors are affiliated with VU University Amsterdam, the Tinbergen Institute and Duisenberg School of Finance. Address: FEWEB, De Boelelaan 1105, 1081 HV Amsterdam, Netherlands. Albert Menkveld can be contacted at
[email protected]. Marius Zoican can be contacted at
[email protected]. The authors would like to extend their gratitude to EMCF for data sponsoring this project. We have greatly benefited from discussionson this research with Istvan Barra, Alejandro Bernales, Peter Hoffmann, Olga Lebedeva, Emiliano Pagnotta and Bart Yueshen Zhou. Albert Menkveld gratefully acknowledges NWO for a VIDI grant. Marius Zoican additionally thanks the participants from the Tinbergen Institute and DSF Brown Bag Seminars for insightful comments.
1
Need For Speed? Low Latency Trading and Adverse Selection
Abstract This paper investigates the impact of market-wide low latency trading technologies on informational asymmetries between traders. It develops a model with two types of high-frequency traders: market makers (HFT-M) and ”bandits” (HFT-B), who profit by trading on stale quotes. The HFTs endogenously decide on information acquisition. Competitive HFT market-makers face higher adverse selection risks when latency drops, as the conditional probability of a liquidity motivated trade decreases. In equilibrium, they will charge higher spreads to compensate for the additional risk. Lower market latencies also provide incentives for market makers to gather information. Informed market makers reduce expected losses on stale quotes while still attracting liquidity demand from uninformed traders. We find empirical support for the model implications: the adverse selection component of the bid-ask spread increases by 15% (0.35 bps) on NASDAQ OMX after introducing low-latency technology. The effect is stronger in more volatile securities. Keywords: market microstructure, trading speed, information asymmetry, high frequency trading JEL Codes: G11, G12, G14
2
1
Introduction
Market venues have invested considerable effort and financial resources in recent years to improve the speed with which traders can submit limit or market orders. Pagnotta and Philippon (2011) list the most important low-latency investments around the globe from 2008 to 2012: for example, in 2009, NYSE reduced its latency to 5 down from 150 milliseconds. Similar investments were undertaken by stock exchanges in Tokyo, Singapore, London or Johannesburg. As of October 2012, the fastest trading system in the world belongs to ALGO Technologies, with a round trip latency of 16 microseconds. The trading world already faces the lower bound of the speed of light as potentially binding1 . Hasbrouck and Saar (2010) define trading latency as ”the time it takes to observe a market event (e.g., a new bid price in the limit order book), through the time it takes to analyze this event and send an order to the exchange that responds to the event”. This paper is primarily focused on changes in the latter component of market latency, which depends on the trading platform’s technology rather than the individual traders’ algorithms. Hence, in this paper, we define by ”market latency” the time it takes an order to reach the market and confirmation is received back by the trader. For the high frequency traders, who usually co-locate their computers with the venue’s servers, the latency is already very low (a few milliseconds), and any improvement in market latency is likely to have a large impact on trading times. On the other hand, for lower frequency traders, such a drop would likely have little effect on their market access times. The main research focus of this paper is analysing the effects of low latency-trading on the information asymmetries across traders, adverse selection risks, spreads and ultimately social welfare. Foucault (2012) argues that ”the jury is still out” on this issue, as the ultimate effect of low-latency trading on adverse selection depends on HFT specialisation. In low-latency environments, highfrequency market-makers are able to quickly update the quotes before better informed speculators can trade against them. On the other hand, high-frequency speculators symmetrically improve their ability to reach the market faster, thus being equally able to act on information as the market maker. The net effect on information asymmetries is non-trivial as the HFT sector does not specialise only in passive or speculative strategies. The Securities and Exchange Commission identify various strategies of high frequency traders, including passive market-making , directional trading, arbitraging, and structural trading - taking advantage of market frictions (”vulnerabilities”) such as stale quotes2 . Hagstromer and Norden (2013) find supporting evidence of order type specialisation among 1 2
See How Low Can You Go?, HFT Review, April 2010 See SEC Concept Release on Equity Market Structure
3
high frequency traders on the Swedish markets; Baron, Brogaard, and Kirilenko (2012) additionally show that these strategies are persistent through time. In our model, we model the complexity of the environment by assuming high frequency traders engage both in market-making activities as well as speculating short-term trends. We allow for the fast market maker to endogenously decide to become informed or not. In this respect, the paper extends the implications of Foucault, Hombert, and Rosu (2012) who investigate latency effects when market-makers are both slow and uninformed. We ask what would happen to spreads, information acquisition and welfare if the market suddenly offers a new technology that improve the HFTs’ reaction time even further - regardless of their strategy. We build a limit order market model with costly monitoring, endogenous information acquisition and deterministic market latency. We extend Foucault, Roell, and Sandas (2003) by allowing for trader heterogeneity (between HFT and non-HFT) in the time necessary to reach the market and then studying the equilibrium as we widen the gap between the two categories’ market response times. In the competitive equilibrium, HFT market makers earn zero rents and face a larger conditional probability of meeting an informed trader - hence a higher adverse selection risk. The additional risk is partly compensated by larger spreads, and partly by market markets withdrawing quotes. The unconditional probability of a liquidity trader (assumed to be a market taker) realising a trade is falling for larger market speeds, as the market is increasingly dominated by high-frequency traders. This result is robust to changing the competition on the market between the dealers; we consider the polar cases of Bertrand competition and a monopolist dealer. To empirically identify the effect of lower market latency on trading outcomes and adverse selection, we use as an instrument the introduction of the INET Core Technology on NASDAQ OMX (the incumbent market in the Nordics) on February 8, 2010. Prior to this event, NASDAQ OMX had a round-trip latency of 2.5 ms, lagging behind its competitors: Chi-X Europe with 0.400 ms or BATS with 0.270 ms. The new technology allowed NASDAQ OMX to reduce its round-trip latency 10 times, to 0.250 ms, effectively making the incumbent market the fastest venue for Nordic securities. This paper focuses on the precise channel between market latency and adverse selection given trading environment with exogenous order types. A potential extension in future versions of the paper is to endogenize the make/take decision of high frequency traders and study the interaction between such choices and market latency, especially through competition between limit order submitters. The structure of trading fees with limit order rebates is also an important factor influencing the endogenous order type decision. In our current model, spreads fall when limit orders are allowed to execute at lower latencies than market orders - which generates a positive welfare effect in periods 4
of little volatility. The rest of the paper is structured as follows: Section 2 briefly reviews the literature on the advantages and disadvantage of low-latency trading. Section 3 develops a theoretical model of the market using a competitive dealer assumption (Bertrand-like competition). We find that the optimal spread increases with market speed, whereas welfare is reduced. The perfect competition assumption is relaxed in Section 4, where we consider a monopolist dealer who can freely maximise his profits, and we obtain the same qualitative results. In Section 5 we present the dataset, formulate the econometric specification testable hypotheses of the model, and an identification strategy based on a natural experiment. Section 6 discusses the empirical results. Section 7 concludes.
2
Related Literature
The academic literature has brought forward a number of arguments in favour of low latency trading. Faster access to the market is argued to lead to competitive liquidity supply - as in Hendershott, Jones, and Menkveld (2011). Limit order submitters are able to react quicker on each other’s quotes, and the risk of being undercut by a different passive trader before a market order arrives is larger. This reduces the market power of a limit order submitter and provides the incentives to set tighter spreads. However, as Biais, Foucault, and Moinas (2013) also argue, the algorithmic trading proxy in Hendershott, Jones, and Menkveld (2011) could capture changes that go beyond fast trading - for instance, algorithmic trading might have lowered search costs across markets.
3
A second advantage of trading speed is that low latency trading can result in better price discovery - as argued by Hendershott and Riordan (2011). Improving the reaction times to new information implies that quotes and trading prices are also incorporating innovations faster. Pagnotta and Philippon (2011) claim that speed can be used as an instrument by markets to vertically differentiate in trading speed and attract different clienteles. Hence, fast markets would charge a premium to the traders with volatile private values, who value speed most. The other market participants, with a low preference for speed, would be able to use the slower market’s services for a lower fee. On the other hand, low latency trading may also have negative effects for the markets. High frequency traders with better information can have an unfair advantage in adversely selecting other market participants, especially since they can process news faster - see Foucault, Hombert, and Rosu (2012). Several papers point out to the positive relationship between adverse selection and 3 A Swedish government report - see Finansinspektionen Report 2012, states than 24 companies use algorithms to trade, while only 3 use high frequency trading in the operations. Thus, there is a clear distinction in practice between algorithmic and HF trading, with the latter being a strictly smaller subset of the first.
5
high frequency trading. Hendershott and Riordan (2011) find a larger permanent impact of higher frequency market orders compared with slower ones. The price impact is actually found large enough to overcome the bid-ask spread. In the same line, Baron, Brogaard, and Kirilenko (2012) show that high frequency traders earn short-time profits through aggressive orders, which is consistent with adversely selecting other market participants. Brogaard, Hendershott, and Riordan (2012) also find results consistent with low-latency traders imposing adverse selection costs on the other market participants. Biais, Foucault, and Moinas (2013) develop a theoretical model of low-latency trading showing that when some agents become fast, all traders incur higher adverse selection costs. Hoffmann (2010) focuses on adverse selection differentials across market venues. He finds that the adverse selection component is larger on the entrant venues, and positively related to market volatility. Furthermore, there is an ongoing debate about the possibility of low-latency trading inducing unnecessary market volatility, leading to events such as the Flash Crash. However, evidence so far leans against the hypothesis that high frequency traders were responsible for the Flash Crash of May 2010 -Kirilenko, Kyle, Samadi, and Tuzun (2011). Compared to Jovanovic and Menkveld (2011), this paper focuses on the informational aspects of faster markets. We endogenize the arrival times to capture the latency changes but we fix the order type by assuming liquidity traders to be takers. We also explicitly introduce high frequency traders who trade only on information (as in Foucault, Roell, and Sandas (2003)), rather than for a market-making reason. In low-latency environments, both liquidity suppliers and demanders (potentially with better information) take less time to access the market: limit orders submitters can withdraw/update quotes faster before being adversely selected, whereas speculators can act quicker on private signals and earn rents by adversely selecting the market makers. Our contribution is to analyse the implications of this symmetry in latency innovations between different types of high-frequency traders engaged in passive and short-term speculative strategies.
3
Market Latency with a Competitive Market Maker
In this section, we develop a model of costly monitoring in a limit order market with stochastic times to market, based on Foucault, Roell, and Sandas (2003), to generate hypotheses than can be tested empirically. We are interested how a drop in market latency, affecting mostly both high-frequency market-makers and speculators influences information acquisition by dealers, adverse selection probabilities, bid-ask spreads and welfare.
6
3.1
Primitives
This section presents the model’s primitives. Extensive motivation for these primitives is left to subsection 3.2. 3.1.1
Asset and market
There is a single risky asset in the economy, with a stochastic value v˜. Define τ as an exponential random variable with mean
1 α
and N (t) = It>τ . The dynamics of the asset value are given by the
following process: v (t) = v (0) + Y × N (t)
(1)
In equation 1, Y is a random variable capturing the size and sign of news. When a jump in the asset value occurs, it can either be interpreted as good or bad news of magnitude σ, where σ > 1. Hence, the distribution of Σ is given by: P (Y = σ) = P (Y = −σ) = 12 . After a jump at time t, the asset value is: vt+ = vt− ± σ. One can think about the asset dynamics as a compound Poisson jump process with intensity α, truncated after the first arrival: once the first jump occurs, there are no further changes in the asset value. The asset is traded on a limit order market with price priority - the largest bid and smallest ask execute first. 3.1.2
Agents
In the market for the risky asset, there are 2 types of high-frequency risk-neutral agents: a representative competitive market maker: HFT-M, who posts limit orders, and a representative speculator or ”bandit”: HFT-B, who can post market orders - in the terminology of Foucault, Roell, and Sandas (2003)). There is also a representative low frequency trader (LFT) who experiences liquidity shocks at random times. The HFTs arrive to market with a deterministic delay ∆H , whereas the LFT has a deterministic delay of ∆L . The high frequency traders have no private valuation for the asset and are risk neutral. HFT-B has no monitoring costs and perfectly observes the asset value at any time t, whereas HFT-M can invest in a monitoring technology allowing perfect tracking of the asset value by paying a positive cost c (the difference between the monitoring costs is motivated in subsection 3.2). The LFT has additional private values for the asset uniformly distributed between [−θ, θ], such that θ < σ, but no technology to track jumps in the asset before the HFTs do4 . The LFT receives a 4
The liquidity trader observes the information after the HFTs (they are an order of magnitude slower)
7
liquidity shock according to a Poisson process with intensity µ. Both HFT have a reservation utility of 0 (can decide to not participate on the market). The competitive market-maker HFT-M earns his reservation utility - his participation constraint holds strictly, as in Bertrand competition. 3.1.3
Timeline
Monitoring Stage (T=0) The market-maker chooses whether to acquire information (and pay the fixed cost c) - strategy I, or stay uninformed (thus saving the fixed investment) - strategy U . The bandit makes a similar decision, but has zero costs of monitoring. It follows immediately from the assumptions that it is optimal for him to always monitor the news process. Quoting Stage (T=1)
The value of the asset is publicly observable (the initial condition for the
Poisson process): v0 . The market-maker posts bid and ask quotes for the asset. As in Foucault, Roell, and Sandas (2003), the bid quote is set to v0 − s and the ask quote is v0 + s, where s is the half spread. Since HFT-M is risk neutral (has no inventory concerns) and the liquidity traders are uniformly distributed and they are informed only at T = 3, s = sa = sb Trading Stage (T=2)
5
The trading game starts once quotes are posted in the market and runs
in continuous time until a quote is either consumed or withdrawn. Thus, three events can happen: 1. The LFT receives a liquidity shock arrives to the market before any news and executes a market order f his private value θi is larger than the half-spread |θi | ≥ s. 2. There is news before the LFT arrives to the market. Then, either: (a) The LFT executes a market order before any HFT arrives to the market (during the ∆H interval - HFT latency) (b) If the LFT does not trade in the ∆H interval following news, an HFT would arrive to the market first: i. With probability γ, HFT-M arrives before HFT-B and he will withdraw the stale quotes to avoid a loss. ii. With probability 1 − γ, HFT-B is first and he will execute a market order to make a profit on the stale quote. 5
As Foucault, Roell, and Sandas (2003) argues, since the problem is symmetric, sa 6= sb would make no difference.
8
For the remainder of the paper, we will take γ = 12 : conditional on an HFT arriving first after news, the HFT-M and HFT-B arrivals are equiprobable. The game tree and utility functions associated with this model is presented in Figure 1: [Figure 1 here]
3.1.4
Strategy Spaces
The market maker and the bandit have 3 possible actions, presented in the following table. In addition, the dealer chooses the equilibrium spread corresponding to each of these pure strategies: there is a one-to-one correspondence between the set of strategies below (excluding s) and the set of dealer strategies including the spread. This stems from the fact that, as we prove in the next subsection, the optimal monitoring strategy for the bandit is always to purchase information. Hence, between T0 and T1 , the information set of the dealer is unchanged, which is equivalent to the monitoring and spread decisions being taken at the same time.
3.2
Strategy
Monitoring
Decision after asset value jump
IR
become Informed
Rush to market
IN
become Informed
Do Not Rush to market
UN
become Uninformed
Discussion of the assumptions
The trading environment, asset value dynamics and trader types are largely based on Foucault, Roell, and Sandas (2003). We change the focus from externalities to the market speed game, and thus we do not model the interactions between dealers. Hence, we build our model around a ”representative” HFT from each type: one bandit and one market maker. We can think of the representative high frequency traders as the aggregate of a continuum of HFT-M and HFT-B. The perfect competition assumption, which assumes that market makers constantly undercut each other until there are no profit opportunities in expectations is relaxed in the next section, where we consider the opposite polar case: a monopolist HFT-M who can extract maximum rents from liquidity traders. Our main results are qualitatively robust with respect to this change of the competitive environment. Since we abstract from externalities among dealers, which in Foucault, Roell, and Sandas (2003) were necessary to keep the spreads from exploding, we introduce a private value distribution for 9
the liquidity traders, leading to a downward sloping demand function, as in Ho and Stoll (1983) or Hendershott and Menkveld (2011). This ensures that the dealer cannot completely eliminate the adverse selection risk through spreads, as he will then lose all profitable trading opportunities with the liquidity traders. Monitoring strategies are decided upon before the trading starts, as in Foucault, Roell, and Sandas (2003), but discrete in the effort/signal probability set {cost, P (Inf ormation)} ∈ {{0, 0} , {c, 1}}. We argue that investments in monitoring algorithms are done in larger update batches (a fixed cost c) than being fine-tuned for each trade. We model our game with the HFT-B having zero monitoring cost. This assumption is meant to capture the idea that while any bandit can act on a signal and adversely select the dealer, the market maker has to monitor the news and defend against all bandits. Our model can be thought of a reduced version for the following environment: a ”representative” bandit stands in front of many speculators who might observe a signal with low frequency. If there are enough of these ”bandits”, then a representative HFT-B will obtain such a signal at high frequency. When deciding upon monitoring, the dealer has to be able to outsmart any speculator if he was to avoid adverse selection. Our results are robust to relaxing the assumption of γ =
1 2
to any γ ∈ [0, 1). The comparative
statics with respect to the HFT latency (∆H ) are qualitatively the same for any interior probability γ. The results are strongest for the case γ = 0. This corresponds to the situation when the HFT-M cannot use any information to update his quotes, so he is practically an uninformed market-maker (as in Foucault, Hombert, and Rosu (2012) for example). The other extreme, when γ = 1 corresponds to the case where the HFT-B can never speculate on information; this is a trivial case where there is no adverse selection and therefore the latency is irrelevant to the model.
3.3 3.3.1
Expected Payoffs of HFTs Irrelevance of the LFT market delay ∆L
We define the private value process of the LFTs by N 0 (t) - a Poisson process with intensity µ. Then the LFT market arrival process can be defined as M (t) where it holds that: M (t + ∆L ) = N 0 (t)
(2)
Thus, if a liquidity trader receives his private value at time t (jump in the N 0 (t) process), it will arrive on the market at time t + ∆L (a jump in the M (t) process). It is trivial that if the inter-arrival jump time in the N 0 process is exponentially distributed, then the same property holds for the M (t) process, as all arrival times are shifted by the same deterministic quantity.
10
Lemma 1. The market arrival process of the liquidity traders is equivalent in distribution to a Poisson process of intensity µ if the private value shock process starts at least ∆L before the quotes are posted. Hence, the deterministic delay ∆L is irrelevant for the distribution of LFT market arrival times. Proof. See appendix A. 3.3.2
Outcomes of the game
We consider now all the potential outcomes of the game and their respective probabilities, as well as payoffs for the HF traders. LFT arrival before news If a LFT arrives at τLF T before any jump in asset value, he will trade if his private value exceeds the half-spread in absolute value. Since both news and market arrival of liquidity traders can be modelled as Poisson processes with intensities α and µ, the probability of this outcome is given by: P {τLF T < τN ews } =
µ µ+α
(3)
Given the game ends with a LFT arrival, the market marker earns in expectation s (1 − s) - the half spread times the probability of trade and the HFT-B earns 0 (no trade). LFT arrival after news The probability of the jump in asset value arriving first is the complement of the previous outcome: P {τLF T > τN ews } =
α µ+α
(4)
Given this event, there are 3 potential outcomes: No LFT arrives in ∆H , HFT-M withdraws quote
This outcome is only possible if HFT-
M monitors the asset value. The probability of no jump in M (t) during an interval of ∆H is given by exp (−µ∆H ), from the proprieties of Poisson processes. Conditional on it, HFT-M has a probability of
1 2
of arriving before the bandit and withdrawing his quote - we prove in the following section that
HFT-B has IR as a dominant strategy. The full probability of the outcome is thus given by: P (HF T − M f irst) =
α µ+α | {z }
N ews bef ore LF T
11
× exp (−µ∆H ) × | {z } HF T f irst
1 2 |{z}
HF T −M f irst
(5)
In this case, both the market marker and the bandit earn zero profits. No LFT arrives in ∆H , HFT-B executes market order In the case HFT-M monitors the news, the outcome probability is symmetrical to HFT-M arriving first to the market: P (HF T − M f irst) =
α µ+α | {z }
× exp (−µ∆H ) × | {z }
N ews bef ore LF T
HF T f irst
1 2 |{z}
(6)
HF T −B f irst
In the case HFT-M is uninformed, the outcome probability is double, since there is no more competition between the bandit and the market-maker to arrive on the market. Given the game ends with a HFT-B arrival, the bandit gains σ − s (the extent to which the quote is stale) whereas the market-maker loses the same amount. One LFT arrives in ∆H , LFT executes market order The probability of a jump in M (t) during an interval of ∆H is given by 1 − exp (−µ∆H ). The outcome probability is thus: P (HF T − M f irst) =
α µ+α | {z }
N ews bef ore LF T
× (1 − exp (−µ∆H )) | {z }
(7)
LF T f irst
The payoffs in this case are identical with the first outcome described: the market marker earns in expectation s (1 − s) - the half spread times the probability of trade and the HFT-B earns 0 (no trade).
3.4
Dominated/Dominating Strategies
In this subsection, we seek to restrict the strategy spaces of the market maker and bandit, by eliminating the strategies which are not optimal and cannot be part of an equilibrium under any parameter values. 3.4.1
The ”Bandit” : HFT-B
We begin by analysing the HFT-B’s dominated strategies. We show that the bandit’s optimal strategy is to always monitor and always submit a market order after news. With costless information, this is intuitive: the bandit does not risk any losses and faces a positive probability of earning an adverse selection profit - hence he has the incentive to post a market order if the dealer’s quotes no longer reflect the asset’s true value.
12
Lemma 2. (1) The bandit will always choose to monitor and submit a market order after news arrive to market - strategy IR is a strictly dominant strategy, for any s < σ. (2) The bandit will never rush to the market if s ≥ σ. Proof. See appendix A. The first result is natural since the HFT-B can obtain information costlessly. Remaining uninformed and taking an action is equivalent then to monitoring and taking the same action (practically not acting on information). Thus, HFT-B cannot be worse off if he monitors and potentially he can become better off by switching his rush/do not rush decision. The second result implies that the bandit has a single unique strategy: always post a market order when the quote available allows him to make a profit - when the value of the asset changes before the market maker gets to update his quotes. The worst case scenario is that he will not arrive first at the market and thus miss the opportunity. When there are no news, HFT-B has no incentive to post market orders, as he has no private value for the asset and thus no incentive to pay the spread. The intuition for the final part of the lemma is that the potential adverse selection profit for HFT-B is the difference between the size of the news and the stale half-spread, thus σ − s. If this quantity is negative, the bandit ends up paying more in trading costs than he will earn by the change in asset value. 3.4.2
The Market-Maker: HFT-M
Lemma 3. (1) For the HFT-M, the strategy IN is strictly dominated in the monitoring-trading subgame. That is, the market-maker will never choose to pay for information and then never rush to withdraw his quotes. (2) It is never optimal for the dealer to set a half-spread s ≥ 1 or s = 0. Hence, the half-spread strategy space is reduced from R+ to (0, 1). The market-maker’s positive cost deters him from monitoring if his strategy is to not to act on the information bought. Monitoring is only useful in a separating equilibrium: HFT-M takes a different action if there is news than if there is no new information. In the third part, we claim that the market-maker will never set a spread so high that it will drive out all the demand from liquidity traders - his only rationale for being in the market. Since private values never exceed 1 in absolute value, any higher spread will deter the liquidity traders from posting market orders. Conversely, a spread of 0 will give HFT-M no profits, while still exposing him to adverse selection risk from the bandit.
13
3.5
Equilibrium results under Bertrand competition
Using Lemmas 2-3, we know that the equilibrium of the game is of the general form: • at T = 0: the bandit always monitors the news. The market maker can either monitor or not. • at T = 1: HFT-M sets a half-spread s ∈ (0, 1) • at T = 2: HFT-B always rushes to the market after news, never if there are no news. If the market-maker monitored at T = 0, he will behave in the same manner as the bandit. The market maker’s utility functions for the 2 remaining potentially optimal HFT-M strategies are as follows:
IR : EUHF T −M
1 µ α = s (1 − s)+ exp (−µ∆H ) (s − σ) + (1 − exp (−µ∆H )) s (1 − s) −c µ+α µ+α 2 (8)
U N : EUHF T −M =
µ α s (1 − s) + [exp (−µ∆H ) (s − σ) + (1 − exp (−µ∆H )) s (1 − s)] µ+α µ+α (9)
Both these functions are strictly concave in s (negative second derivatives): ∂ 2 EU (IR) ∂ 2 EU (U N ) = =2 ∂s2 ∂s2
α exp (−µ∆H ) − 1 α+µ
0), the equilibrium spread will be set at the lowest value which makes the U N strategy break off - the smallest solution on (0, 1) of the equation EUD (s|U N, α, σ, c, ∆H ) = 0: s∗U N (C) = min {s|EUHF T −M (s|U N, α, σ, c, ∆H ) = 0} 0 ∆C H 2 ∗ (1+s ) IR(C) α ∗ − c, ∆H ≤ ∆C H µ+α exp (−µ∆H ) × (1 − sIR(C) ) 2 α µ+α
(19)
Generally, as both the spreads s∗U N and s∗IR are decreasing in ∆H and the probability of a liquidity trader arriving first also has a similar behaviour, this would lead to lower welfare as market speed rises. [Figure 3 here] At the threshold point ∆C H , when HFT-M starts monitoring, there are 2 effects on welfare, summarised in the table below:
18
Effects of Trading Speed on Welfare (Competitive Case) at ∆H = ∆C H No.
Agent
Effect on Welfare
Influence (+/-)
Total Effect
1
LFT
lower trade probability
W elf are &
Negative
2
HFT-M
monitoring costs
W elf are &
Negative
According to the results in Figure 3 robust to various other parameter specifications, welfare drops as we decrease latency. At the threshold ∆C H where the competitive dealer switches the strategy, there is a downward jump in welfare due to monitoring costs.
4
Market Latency with a Monopolist Market Maker
The analysis in sections 3 focused on the case of a competitive market maker. In this section, we show that the same qualitative results hold if we allow for HFT-M to have market power and earn positive expected profits. We are considering the opposite polar case, that of a monopolist HFT-M, than can set s in the quoting stage of the original game (T = 1) to maximise its profits.
4.1
Optimal Spreads with a Monopolist HFT-M
The market-maker will select the monitoring strategy as well as the spread that maximises his profit. Conditional of choosing any of the IR or U N strategies (note that the others are still dominated under the results in lemmas 2 to 3), he will choose the corresponding spread that maximises the expected utility. Note that the expected utility functions are strictly concave and, for all un-dominated strategies, the expected utility for s = 1 is larger than the expected utility for s = 0 (zero profits from liquidity traders, but a smaller loss if adverse selection occurs). Hence, searching for the maximum of the utility functions on s ∈ (0, 1) it will occur either at an interior point, when the first order condition is zero, or at s = 1. If the maximum is at s = 1, then the maximum utility is negative (only adverse selection losses). Solving the first order conditions of the utility functions, we find that: s∗IR(M ) =
2−A 1 ; s∗U N (M ) = 4 (1 − A) 2 (1 − A)
(20)
Lemma 7. All interior optimal half-spreads s∗(M ) are decreasing in the HFT latency, ∆H and in the probability of news, α. Proof. First derivatives of s∗(M ) with respect to ∆H and α are strictly negative, respectively positive.
19
Lemma 8. The level of the half-spread, for a given ∆H and α, is always larger for the UN strategy than for the IR strategy. That is, s∗U N (M ) > s∗IR(M ) , for any given ∆H and α. Proof. Immediate mathematic calculation.
4.2
Equilibrium
In equilibrium, HFT-M compares the maximum payoff he can get from each strategy (U N , IR or stopping the game) and sets the optimal quote for that particular strategy, as a function of the primitive parameters α, σ, ∆H , µ and c. For ∆H → ∞ (intuitively, the probability of a liquidity trader arriving first is equal to 1), the U N strategy yields the maximum payoff (basically there is no adverse selection risk, nor any successful opportunity for the dealer to withdraw the quote, were he to wish so). As speed increases, the gains from monitoring also rise to offset the adverse selection risks. For very low latencies, the risk of being picked off is so large that the HFT-M cannot set any feasible spread (s < 1) and the market breaks down. The result is stated in the following lemma: 3α , the market breaks down as both the optimal spreads for the Lemma 9. For ∆H < µ1 ln 2(α+µ) UN and IN strategy are larger than 1 - and attract no LFT demand. Proof. Immediate calculation shows that for A ≥ that
s∗U N (M )
> 1. Since
∂A ∂∆HF T
2 3
we have that s∗IR(M ) > 1 and for A ≥
< 0, any latency lower than the threshold ∆HF T =
1 2 A−1
we have 2 3 will
render both optimal quotes Computing the inverse function of A yields the mentioned unfeasible. 1 3α latency threshold of µ ln 2(α+µ) . 4.2.1
Equilibrium Dealer Discrete Strategy Choice
Proposition 3. In the case of a monopolist market maker, the IR strategy is optimal if and only if the following condition holds: 2A2 − 16c (1 − A) + 8σA (1 − A) ≥ 0
(21)
For ∆H → ∞, the UN strategy is always optimal for HFT-M. The condition (21) is monotonically relaxed as ∆H decreases. Hence, there can be at most one latency switching point ∆M H below which monitoring becomes optimal. The existence condition of such a threshold requires that α is large enough relative to µ: 4
α µ−α + 16c + 8σ >0 α+µ α+µ 20
If the condition holds with equality, then HFT-M is indifferent between the UN and IR strategies and any mixed strategy between the two. Proof. See appendix A We find in equilibrium is that for lower market speeds, the market-maker always starts by not monitoring and never trying to withdraw his quotes. The adverse selection risk increases with lower latencies, which HFT-M accommodates through setting a higher spread. At some speed threshold ∆M H , the adverse selection risk becomes so large that the market-maker becomes willing to monitor the news. At this speed, the probability he will trade with a LFT conditional on news becomes so low that the monitoring costs are lower than the expected losses he will make from being adversely selected. Switching from UN to IR - Market Maker’s Tradeoff Gains for Dealer
Losses for Dealer
LFT arrives first: (1 − s) s %
monitoring costs c
News
arrives
first: News
P (trade − HF T − B) &
arrives
first:
P (trade − F ) &
As the equilibrium spreads are lower for IR, the market maker will earn more in expectation conditional he meets a LFT. (s(1 − s) is decreasing in s for s ≥ 12 , the frictionless monopoly quote). The equilibrium results for the HFT-M’s choice are presented in Figures 4. [Figure 4 here] In the terminology of our game, the optimal strategy of HFT-M has the following form: (U N, s∗ ) UN HF T − M → (IR, s∗ ) IR
4.2.2
∆H > ∆M H (α, c, σ, µ) ∆H ≤ ∆M H (α, c, σ, µ)
(22)
Comparative Statics for ∆M H
The speed threshold ∆M H is decreasing in the market volatility parameters α and σ and increasing in the information costs c. This is intuitive: in less volatile markets, information asymmetries manifest with a lower frequency which makes paying the information costs suboptimal unless the loss conditional on being adversely selected increases (with market speed). In a similar fashion, lower costs to obtain information make the dealer monitor even when the adverse selection risk is not as high (for a higher ∆M H ). 21
4.2.3
Welfare
The welfare benchmark we are considering is the situation where all private values are realised, as in the competitive model. However though, we need to change the benchmark from the first best to a frictionless monopoly, as we are increasing the market power of the dealer, which is bound to reduce welfare. Without the adverse selection problem, the dealer will simply maximise his monopoly profit and set a spread of
1 2
(solve max s(1 − s) =⇒ s∗M on = 12 ). In this case, a liquidity trader will accept
the terms with probability 1+s∗M on 2
=
3 4.
1 2
and the expected value of his private value, in absolute terms will be
Hence, the monopoly welfare is given by: W elf areM onopoly =
1 1 + s∗M on × = 0.375 2 2
(23)
This value is the benchmark we should use to analyse the effect market speed is having on welfare through adverse selection rationales. In our model, welfare will be given by:
W elf areM M odel
µ + µ+α = µ + µ+α
(1+s∗U N (M ) ) exp (−µ∆H ) × (1 − s∗U N (M ) ) , ∆H > ∆M H 2 ∗ (1+s ) IR(M ) α ∗ − c, ∆H ≤ ∆M H µ+α exp (−µ∆H ) × (1 − sIR(M ) ) 2 α µ+α
(24)
Again, both the spreads s∗U N (M ) and s∗IRN (M ) are decreasing in ∆H and the probability of a liquidity trader arriving first increases in ∆H : which lead to lower welfare as market speed increases, at least as long as the dealer does not switch strategies (see figure 5). At the threshold point ∆M H , when HFT-M starts monitoring, there are 3 effects on welfare, summarised in the table below: Effects of Trading Speed on Welfare (Monopoly) at ∆H = ∆M H No.
Agent
Effect on Welfare
Influence (+/-)
Total Effect
1
Liquidity
lower spreads
W elf are %
Positive (1+2)
2
Liquidity
lower trade probability
W elf are &
Positive (1+2)
3
Dealer
monitoring costs
W elf are &
Negative (1+2+3)
The lower spreads benefit is stronger that the loss from lower trade probability with the liquidity traders, which generates positive welfare for the liquidity traders when the dealer starts monitoring. 22
However, after we include the dealer’s costs, the total effect is negative, which leads to an even lower welfare after monitoring becomes optimal for the dealer.
4.3
Model Predictions
For both the perfect competitive and monopolistic market maker, the model’s equilibrium spread (for both the competitive and monopoly settings) is increasing in the HFT speed parameter ∆H , as well as the intensity of news α. Also, the effect of market latency on the model equilibrium spread is stronger if σ is larger. In the model, there is no other friction determining the spreads apart from the information asymmetry. Hence, our predictions are formulated in terms of the adverse selection component of the spread: 1. A drop in market latency will result in larger adverse selection costs on limit orders submitted by HFT: ∆H ↓−→ s ↑ 2. The intensity of news arrival results in larger adverse selection costs: α ↑−→ s ↑ ∂s 3. The effect of a drop in market latency is larger for more volatile stocks: ∂∆ is increasing in H σ (see proof of Lemma 5)
5
Empirical Strategy
5.1
Dataset
Trade Data The trade data for this project is provided by the European Multilateral Clearing Facility (henceforth EMCF) and consists of detailed individual trade information on equities from Sweden, Denmark and Finland. The period spanned by our dataset is of 1 year, from September 1st , 2009 to September 10th , 2010. Due to the fact that up to October 19th , 2009 only a small part of the trades were cleared through EMCF (who launched mandatory CCP services from that point onwards), we have decided to exclude from analysis the first month covered as unrepresentative, and focus on the remaining 11 months. As a central counterparty institution, EMCF stands ready to become a third party in all the trades - by buying the security from the original seller and then selling it to the original buyer. This is known as novation process, in which 2 contracts (between EMCF and both parties) are created instead of a single one between buyer and seller. Hence, data on all executed trades is available at EMCF, including agency stamps: whether one of the original parties executed the trade for a client
23
or for its own account. More information on the EMCF central counterparty operation is included in the Internet supplements of this paper 6 . The dataset contains information on approximately 70 million trades, including date and time stamps 7 , trader ID (anonymised), transaction price, quantity, sign of the transaction, trading platform and whether the trade was executed on its own account or for a client (principal trade or agent trade). Each of the trades takes place either on NASDAQ OMX (the incumbent market prior to 2007) or on one of the new (entrant) markets, such as Chi-X, BATS Europe, NASDAQ Europe, Burgundy or Quote MTF. For a snapshot of the dataset and the type of variables we use in this paper, see Appendix D. Order Book Data Information on the available quotes for all the stocks in the sample on all markets is collected from the Thomson Reuters Tick History database, through SIRCA. At the millisecond level, there are approximately 2.2 billion data points. This represents information on the top of the order book for all exchanges: best bid and ask prices and the quantities demanded/supplied. Since the trade data is available at the level of seconds, we have selected the last quote in each second to match the trade and order book datasets. Complementary Data
Information from the main EMCF dataset is complemented by metadata
on each of the securities, obtained from Datastream - number of shares outstanding and ISIN codes, as well as daily exchange rates between Euro and Swedish/Danish Krona. All trading prices are converted to Euros at the daily exchange rate to provide comparability across stocks. Data on intraday market volatility (on the OMX Nordic 40 index high and low daily prices) is obtained via the Thomson Reuters Tick History (TRTH) system, for the days in the EMCF sample. The universe of our sample consist of 226 traders being active in 242 stocks. The average trader is present on the market in 157 out of 228 days and trades in about a third of the available stocks. The average trade value over the full dataset was approximately 20.62 thousand Euro. Variables We aggregate the data up to three hierarchical levels: first, we build a stock-day panel with information on average price, daily volume, daily volatility, market capitalisation and stock fragmentation and effective spreads on incumbent and entrant markets. Then, for each stock-day, we look at the trader IDs which were active in that particular security (third aggregation level). We build thus a multi-level panel with 3 dimensions: a stock-day panel extended in the cross-sectional dimensions of traders. In the remainder of the paper, we index days by t, securities by i and traders 6 7
See document http://db.tt/l03D3h2V converted to standard GMT, including hours, minutes and seconds
24
by j. The list of variable definitions and comments on their measurement are provided in Appendix C. There we also present a table with variable short-names, for ease of exposition. The final stock-day-trader-agency multilevel panel has approximately 1.7 million observations.
5.2
Measurement and model specification
Measuring the model spread Absent any inventory or order processing costs, the positive spread in our model stems only from the adverse selection component (and dealer’s market power in the monopoly setting). Hence, the dependent variable for testing is the adverse selection component of the spread, computed as in Hendershott, Jones, and Menkveld (2011) and Hoffmann (2010). For each trade, define: pt is the transaction price and mt is the prevailing midpoint at the transaction time; the sign indicator qt takes the value qt = 1 for buys and qt = −1 for sell transactions (taking the market taker perspective). The effective spread then is defined as the percentage deviation from the midpoint: ES = qt
pt − mt mt
(25)
Assuming the market maker will close his position on average in∆ = 5 min, we decompose ES in an adverse selection and a realised spread component. The adverse selection component then measures whether and by how much the price moved against the quote submitter in the period immediately following the trade: ES = qt |
pt − mt+∆ mt+∆ − mt + qt mt m {z } | {z t } AS
(26)
RS
Volatility Volatility is measured by 2 variables: we capture the dynamics of systemic risk by the daily range based volatility of OMX Nordic 40 Index (σtM kt ) and the cross-section of risk by the idiosyncratic risk for each security (σiID ). The idiosyncratic risk is computed as the estimate of residual variance from regressing daily stock returns on daily index returns in the year following the event (February 2010-February 2011). Identification strategy We use the introduction of the INET technology on NASDAQ OMX on February 8, 2010 as an instrument for an exogenous change in market speed. The latency dropped ten times, from 2.5 ms to 250 µs. The market speed jump is captured by a time dummy DEvent , which takes value 1 after February 8, 2010.
25
To allow for heterogenous effects between high- and low- frequency traders (for identification details, see subsection 6.2), we define a new dummy variable: DHF T , which takes value 1 for HFT accounts and 0 otherwise. Econometric Model The benchmark model is a fixed-effects panel linear regression, estimated by least squares. We regress the adverse selection component of the spread (expressed in bps) aggregated across stocks, traders and days on event dummies, HFT dummies, their interaction, volatility variables and stock-specific fixed effects. We choose the most conservative standard errors, double-clustered at stock and day level, following the methodology in Petersen (2009). The equation is given by:
LF T HF T ASijt = β0 DEvent + β1 DEvent + β2 DHF T + β3 σtM kt + β4 σiID DEvent + β5 log V oltrad + δi + εijt (27)
Hypotheses We test the following hypotheses: 1. H01 : β1 > 0 and H02 : β0 > 0. The adverse selection component of the bid-ask spread increases once the market latency drops, both for high and low frequency traders. This is the main result of Lemmas 5 and 7 (under different competition assumptions, it holds that
∂s∗ ∂k
>0)
2. H03 : β3 > 0. The adverse selection component of the bid-ask spread increases in more volatile periods (in the model,
∂s∗ ∂α
> 0 - Lemma 7 ).
3. H04 : β4 > 0. The effect of market speed on adverse selection is larger for riskier stocks. Under both competition settings, the marginal effect of speed on spreads increasing in α:
6
∂ 2 s∗ ∂k∂α
> 0.
Results
6.1
Summary Statistics
The volume-weighted means of adverse selection spread components for all trades are reported in Table 1, separately for the periods before and after INET was introduced on NASDAQ OMX. [Table 1 here] Plotting the distribution of adverse selection averages for each stock, day and trader in the sample on NASDAQ OMX, before and after the INET was introduced, we observe that the centre of 26
the probability mass shifts to the right in the second part of the sample. This finding is consistent with larger mean adverse selection costs which are not due to an increase in the number of ”tail” adverse selection events (which would be the case for instance if the period after INET would have included days with extraordinary volatility in some stocks at least). [Figure 6 here]
6.2
High Frequency Traders Identification
To identify high frequency traders in our dataset, we follow Kirilenko, Kyle, Samadi, and Tuzun (2011) and, for each stock and day in our sample, we highlight the trader accounts who simultaneously fulfil the following 3 conditions across all markets. Then, we compute a ”HFT ratio” by dividing the number of stock × days when a particular trader behaved as a HFT to the total number of appearances in the sample. 1. The account traded more than 10 contracts on a given stock in a given day. 2. The average of the absolute value of the end-of-day net position, expressed as a fraction of the account’s total trading volume for the day is not more than 5% 3. The average of the square root of the sum of squared deviations of the minute-end net contract holdings from the net contract holdings at the end of the day, expressed as a fraction of an account’s total contract trading volume during that day, is not more that 1.5%. ”Contract holdings” are defined as the net number of contracts bought or sold from the beginning of the day until the end of the minute for which the calculation is made. We find in total 7 trader accounts that have a pronounced HFT profile compared to the mass of traders. Among those, only 4 accounts trade on NASDAQ OMX, while 3 use exclusively alternative markets such as Chi-X or BATS Europe. The identified HFTs account for 5.43% of the total NASDAQ OMX volume (denominated in Euro), with one particular account being strongly dominating (4.91% of the total NASDAQ OMX volume and the third trader in the market by volume, as well 90% of the total HFT volume). [Figure 7 here]
27
6.3
Estimation Results
Table 2 shows the results of the empirical analysis, estimating the equation:
LF T HF T ASijt = β0 DEvent + β1 DEvent + β2 DHF T + β3 σtM kt + β4 σiID DEvent + β5 log V oltrad + δi + εijt (28)
We consider different symmetric estimation windows around the INET implementation: 2 months (December 8, 2009 - April 8, 2010), 3 months (November 8, 2009 - May 8, 2010) and 4 months (October 19, 2009 - June 8, 2010). [Table 2 here] The empirical findings are summarised below: 1. We find that the drop in market latency has a positive significant effect on adverse selection of approximately 0.4 bps (approximately 7%). Without distinguishing between high- and lowfrequency traders, this effect is strongly significant for all windows considered, in specification with or without other control variables. 2. When allowing for separate effects between high frequency and low frequency traders, we find an increase in the adverse selection spread component for LFT of approximately the same magnitude (0.4 bps), which is significant and persistent across all time windows considered (β1 > 0). 3. For the HFT, the effect is also significant and positive for 2 months after the event date (about 0.55 bps), but it declines as we move further away from the event date - while becoming statistically insignificant 4 months after the implementation date. This could point to monitoring technology adjustments. 4. We find a positive and significant relationship between adverse selection and the volatility of the index. A standard deviation increase in market volatility leads to a 0.27-0.36 bps increase in the adverse selection component of the spread. We find thus empirical support for H03 . 5. The increase in the adverse selection is larger for securities with larger idiosyncratic risk. A standard deviation increase in idiosyncratic risk leads to a 0.44 bps additional market latency effect on adverse selection spread components. We find thus empirical support for H04 . 6. Before the INET implementation, HFTs have much lower adverse selection costs than LFTs: between 3-3.5 bps, very strongly significant and consistent across all model specifications. 28
Placebo Analysis We test the relationship between the adverse selection spread component and the event date for the other markets Nordic securities are traded on (the largest of which are Chi-X Europe and BATS). These constitute a placebo group, as there was no similar speed improvement on any other market during the same period.The results for a symmetric 2-months event window around the event date are presented in Table 3. Note the event coefficients are no longer statistically significant, regardless if we consider HFT and LFT separately or not. [Table 3 here]
7
Concluding Remarks
This paper studies the effects on adverse selection and information asymmetries of the lower latency technologies implemented by trading venues. We focus on symmetric and exogenous technology improvements across HFT traders, regardless whether they act as market-makers trying to capture the spread or as speculators earning profits on short-term price trends. For empirical identification, we use a natural experiment in the Nordic markets. In February 2010, NASDAQ OMX implemented the INET technology, which reduced round-trip latencies tenfold. We find that adverse selection increased by 15% on NASDAQ OMX following the latency drop, after controlling for market volatility, volume and realised spreads. On the other trading venues, adverse selection dropped in the post-event period, which might indicate the migration of informed speculators from the slower to the faster market. To explain our results, we develop a costly-monitoring theoretical model of the limit order market. We find that lower market latencies lead to most market interactions to take place between high frequency traders. This phenomenon, the “crowding-out” of low-frequency (liquidity) traders results in larger adverse selection risks. Consequently, wider spreads are set to compensate the extra risk, but also market-makers will improve their monitoring levels (when they do, spreads actually drop in equilibrium). As the latency is reduced indefinitely, there are no more monitoring investments to be undertaken by market-makers or speculators. Trading becomes increasingly more a zero-sum game between high-frequency traders, leading to a lower trade to quotes ratio (due to quote withdrawals) and lower welfare gains, as liquidity traders get to realise their private values less often. An alternative policy to a symmetric latency drop between limit and market orders, stipulating that only the limit orders should benefit from lower latencies will reduce the adverse selection risks and result in tighter spreads, as the speculators no longer benefit from the faster trading.
29
Table 1: Adverse selection spread components on NASDAQ OMX. We report volume-weighted means of AS components for all trades in Nordic markets. The interquartile range is provided in parantheses. Window
Before INET
After INET
∆(%)
Panel A: Adverse Selection Spread Component (bps) 1 month 2 months 4 months
2.59
(2.32−2.79)
2.42
(2.16−2.69)
2.60
(2.17−2.89)
2.71
4.63%
2.62
8.26%
2.73
5.38%
(2.26−3.01) (2.16−2.95) (2.20−3.19)
Panel B: OMX Nordic 40 Daily Volatility 1 month 2 months 4 months
1.04%
0.94%
(0.72−1.30)
(0.64−1.13)
0.88%
0.88%
(0.63−1.08)
(0.62−1.06)
0.98% (0.66−1.33)
0.98
−9.61% −0.01% −0.05%
(0.63−1.31)
Panel C: Average Daily Volume Traded (EUR million) 1 month 2 months 4 months
2605.63
(2314.35−2672.85)
2120.95
(1632.14−2508.74)
1956.77
(1631.30−2283.53)
2215.54
−14.97%
2226.99
4.99%
2577.56
31.72%
(1973.25−2282.24) (1999.90−2440.97) (2075.13−2899.43)
30
Table 2: Adverse selection costs in basis points is regressed on event dummies/ event dummies interacted with the trader type (HFT / LFT). In several specifications, we also allow for the INET effect to vary in the cross-section of stocks with the idiosyncratic volatility of the security. Multiple event windows are considered: from 2 months to 4 months around the INET implementation. Volatility and volume measures are standardized to have mean zero and variance one. We use double-clustered standard errors (as in Petersen (2009)) and stock-specific FE. Panel A: NASDAQ OMX (2 months around event date) Variable
(1)
(2)
(3)
(4)
(5)
LF T DEvent
−
−
0.372∗∗
0.562∗∗∗
0.551∗∗∗
HF T DEvent
−
−
0.558∗∗
DHF T
−
−
−3.625∗∗∗
DEvent
0.351∗∗
0.525∗∗∗
−
−
−
σtM kt
−
0.257∗∗∗
−
0.274∗∗∗
0.254∗∗
σiID × DEvent
−
−
−
log V oltrad
−
−0.913∗∗∗
No. Obs.
288909
288909
2.22
2.28
2.47
3.43 2.92
−19.14
3.49 0.767∗∗∗ 3.17 −3.241∗∗∗ −16.82
2.92 0.441∗∗∗ 4.61 −0.832∗∗∗ −7.65
−8.27
288909
288909
3.47
0.754∗∗∗ 3.29
−3.145∗∗∗ −16.3
2.89
− −0.862∗∗∗ −7.79
288909
Panel B: Window around event - 3 months Variable
(2)
(3)
(4)
(5)
−
− −
−
−
DEvent
0.419∗∗∗
0.532∗∗∗
0.561∗∗∗ 4.04 0.644∗∗ 2.64 −3.339∗∗∗ −17.51
0.553∗∗∗
−
0.435∗∗∗ 2.99 0.452∗∗ 2.16 −3.718∗∗∗ −19.71 −
−
−
σtM kt
−
0.342∗∗∗
−
0.341∗∗∗
0.338∗∗∗
σiID × DEvent
−
−
−
log V oltrad
−
−0.857∗∗∗
−
No. Obs.
446819
446819
446819
LF T DEvent HF T DEvent DHF T
(1)
2.97
4.05 4.61
−9.06
4.51 0.449∗∗∗ 4.08 −0.791∗∗∗ −8.59
446819
4.05
0.637∗∗∗ 2.91
−3.294∗∗∗ −17.17
4.57
− −0.806∗∗∗ −8.57
446819
Panel C: Window around event - 4 months Variable
(1)
(2)
(3)
(4)
(5)
LF T DEvent
−
−
0.417∗∗∗
0.446∗∗∗
0.443∗∗∗
HF T DEvent
−
−
−0.144
−0.023
−0.039
DHF T
−
−3.442∗∗∗
−3.104∗∗∗
−3.002∗∗∗
DEvent
0.394∗∗∗
0.408∗∗∗
−
−
−
σtM kt
−
0.366∗∗∗
−
0.366
0.362∗∗∗
σiID × DEvent
−
−
− 31
0.434∗∗∗
−
log V oltrad
−
−0.862∗∗∗
−
−0.784∗∗∗
−0.808∗∗∗
No. Obs.
566841
566841
566841
566841
566841
2.97
3.11
−0.59
3.34 5.74
−10.16
−17.25
3.55
−0.09
−15.57
5.71
4.52
−9.53
3.56
−0.16
−15.02
5.72
−9.61
Table 3: Placebo Analysis: Adverse selection (bps) on alternative markets (other than NASDAQ OMX) is regressed on event dummies/ event dummies interacted with the trader type (HFT / LFT). In several specifications, we also allow for the INET effect to vary in the cross-section of stocks with the idiosyncratic volatility of the security. We consider a 2 months window around the INET implementation. Volatility and volume measures are standardized to have mean zero and variance one. We use double-clustered standard errors (as in Petersen (2009)) and stock-specific FE. Variable LF T DEvent HF T DEvent
DHF T DEvent
(1)
(2)
(3)
(4)
(5)
0.189
0.184
0.176
−0.171
−0.212
−0.184
−0.255
−1.636∗∗∗
−1.576∗∗∗
−
−
−
1.25
−0.87
−1.22
−0.007 −0.01
0.167 1.11
1.13
−0.89
−6.74
1.14
−0.77
−6.53
σtM kt
0.277∗∗∗
0.328∗∗∗
0.273∗∗∗
σiID × DEvent
−
−0.032
−
log V oltrad
−0.487∗∗∗
−0.468∗∗∗
−0.452∗∗∗
1.13% 141853
0.67% 141853
R2 No. Obs.
2.44
2.98
−0.13
−4.6
1.06% 141853
1.1% 141853
−4.23
1.11% 141853
32
2.41
−4.29
33
end game
quotes s HFT-M not monitor (pays 0)
HFT-M
quotes s monitors (pays c) HFT-M
end game
Pre-Trading
HFT-B executes market order
HFT-M withdraws quotes
∆H News
LFT executes market order
∆H
∆H
LFT executes market order
News
News
Figure 1: Model Timing
(IV)
(III)
(II)
(I)
News
Trading Outcomes
HFT-M Utility 0.15 Cannot profitably undercut UN (competition proof) 0.10
0.05
0.2
0.4
0.6
0.8
1.0
Half Spread
Strategy UN is profitable at lower spreads than IR break-even spread.
-0.05
-0.10
IR Strategy UN Strategy
(a) Dealer’s Utility Functions - ∆H = 1.5
HFT-M Utility 0.05
Cannot profitably undercut IR (competition proof)
0.2
0.4
0.6
0.8
1.0
Half Spread
-0.05 Strategy IR is profitable at lower spreads than UN break-even spread.
-0.10
-0.15
-0.20
IR Strategy UN Strategy
(b) Dealer’s Utility Functions - ∆H = 0.5
Figure 2: Dealer’s utility functions for the UR and IR strategies. We take α = 0.2, µ = 0.65, σ = 1.4 and c = 0.07. Note that in the first panel, with34lower market speed (∆H = 1.5), the break-even spread from the informed strategy can always be profitably undercut by choosing not to acquire information. As the latency drops (second panel, ∆H = 0.5), the situation is reversed and the competition-proof spread becomes the break-even spread of the informed (IR strategy).
Optimal Half-Spread 0.7
0.6
0.5
Switch to monitoring (monopoly)
0.4
0.3 Switch to monitoring (competition) 0.2
Market breakdown (competition)
15
10
0.1
Market Latency HInverted ScaleL
5
Bertrand Competition Monopolist HFT-M
(a) Comparison of equilibrium half-spreads against market latency for competitive and monopoly market-makers (Parameter calibration: α = 0.2, µ = 0.25, σ = 1.25 and c = 0.025)
Welfare 0.5
0.4
Switch to monitoring (competition)
0.3 Switch to monitoring (monopoly)
0.2 Market breakdown (competition)
15
10
5
Market Latency HInverted ScaleL
Bertrand Competition Monopolist HFT-M
(b) Comparison of welfare against market latency for competitive and monopoly market-makers (Parameter calibration: α = 0.2, µ = 0.25, σ = 1.25 and c = 0.025))
35 Figure 3: Equilbrium Spreads, Welfare and Market Speed in the competitive and monopoly market-makers environments
Dealer Utility
0.20
0.15
Strategy Switch Threshold for HFT-M 0.10
0.05
15
10
5
2
Market Latency HInverted ScaleL
UN Strategy IR Strategy
(a) Low News Intensity (α = 0.2, µ = 0.25, σ = 1.25, c = 0.025)
Strategy Switch Threshold for HFT-M
Dealer Utility
0.20
0.15
0.10
0.05
15
10
5
2
Market Latency HInverted ScaleL
-0.05
-0.10
UN Strategy IR Strategy
(b) High News Intensity (α = 1.0, µ = 0.25, σ = 1.25, c = 0.025)
Figure 4: HFT-M’s choice between strategies in the monopoly case: expected utilities as a function of market latency. We also include different news intensity (α) regimes to illustrate the relationship between the strategy switching point and the frequency of news. 36
Half-Spread 0.62
0.60
Market-maker starts monitoring: spread drops 0.58
0.56
0.54
15
10
5
2
Market Latency HInverted ScaleL
alpha=0.2, c=0.025 alpha=0.3, c=0.025 alpha=0.2, c=0.05
(a) Optimal equilibrium spreads
Market-maker starts monitoring
Welfare
0.35
0.30
0.25
15
10
5
2
Market Latency HInverted ScaleL
alpha=0.2, c=0.025 alpha=0.3, c=0.025 alpha=0.2, c=0.05
(b) Welfare
Figure 5: Equilibrium spreads and welfare under monopolist HFT-M for various news intensities 37 and information cost parameters
Figure 6: Distribution of adverse selection averaged at stock-day-trader levels. The blue solid line corresponds to the distribution before February 8, 2010 whereas the red dotted line corresponds to the post-event distribution. Note the shift to the right of the probability mass following the INET event.
38
Figure 7: HFT Profiles and Trading Aggressiveness. Each dot stands for a trader account. HFT profiles (between 0 and 1) are measured as the proportion of the number of stock × days an account simultaneously passes the three Kirilenko, Kyle, Samadi, and Tuzun (2011) HFT criteria in the total number appearances in the sample. Aggressiveness is simply measured as the proportion of market order executed volume in total traded volume, for each trader account. The size of the dots is proportional to the Euro-denominated volume traded on NASDAQ OMX by each particular agent.
39
A
Proofs of Lemmas and Propositions
Lemma 1 Proof. It can be proven that the increments of M (t) and N (t) have the same distribution. From the properties of the Poisson processes, we can write for any s < t and k ∈ {0, 1, 2...}: µk (t − s)k P N 0 (t) − N 0 (s) = k = exp (−µ (t − s)) k!
(29)
Similarly, for M (t), we have: µk (t − s)k P {M (t) − M (s) = k} = P N 0 (t − ∆L ) − N 0 (s − ∆L ) = k = exp (−µ (t − s)) k!
(30)
The previous relations holds because the interval (s − ∆L , t − ∆L ) has the same length as (s, t):
µk (t − ∆L − s + ∆L )k P N 2 (t − ∆L ) − N 2 (s − ∆L ) = k = exp (−µ (t − ∆L − s + ∆L )) k!
(31)
Lemma 2: Part 1 Proof. If the bandit does not monitor the news (strategy UN) or monitors the asset value but does not submit a market order (strategy IN), he will earn an expected payoff of 0 (no trade and no monitoring cost). It is enough then to show that for s < σ, the expected payoff from strategy IR is larger than zero. If the market-maker also monitors the news, the payoff of HFT-B is given by the following expression: ΠHF T −B =
1 α × exp −µ∆H (σ − s) > 0 2 α+µ
(32)
The expression (σ − s) is the profit conditional on arriving first to the market. The probability of the HFT-B arriving first to the market is given by 3 components:
α α+µ
- the probability there is a
value jump before an LFT consumes the quote; exp −µ∆H - the probability that there are no LFTs arriving in the interval from observing the jump to market arrival and
1 2
- the probability HFT-B
arrives before HFT-M. If the market-maker does not monitor the news (or does not rush to market after a jump), the payoff of HFT-B is as follows: 40
ΠHF T −B =
α exp −µ∆H (σ − s) > 0 α+µ
(33)
The components of the profit are the same as before, except that the probability of HFT-B arrives before HFT-M is now equal to 1 rather than 12 . Hence, IR a strictly dominant strategy for the bandit. Part 2 If s > σ, we see from the proof of Part 1 that the profit from the strategy IR is negative, regardless of HFT-M’s strategy. Hence, it is optimal for the bandit to never act on information, as the spread is larger than the potential benefit from the stale quote.
Lemma 3: Part 1 Proof. With strictly positive monitoring costs, the strategy IN is strictly dominated by U N . This is natural, since paying for information and not acting contingent on the value jumps yields a lower payoff than not paying for information, with the difference being exactly the cost of obtaining the information. Formally: EUHF T −M [IN ] = EUHF T −M [U N ] − c Part 2 We consider 2 possible cases: s ∈ [1, σ]and s ≥ σ, making use of the fact that σ > 1. Note that for s ≥ 1, there are no liquidity traders willing to trade. If s ∈ [1, σ] and there are news, then the bandit will rush to the market. If the bandit reaches the market first, the dealer has a negative payoff s − σ < 0, whereas if the market-maker is first to the market, he gains 0. No trade with liquidity agents will occur now, and the market-maker is (weakly) worse off than ending the game. If s > σ, both the bandit and the liquidity traders will stay off the market and HFT-M’s payoff is 0. If s = 0, the gain he makes from liquidity traders is zero, whereas the losses he can incur from the bandit are maximised (−σ). Hence, in order for the dealer to earn a positive payoff, it should always set the half spread in the interval (0, 1), which is the conclusion required. Lemma 5.
41
Proof. We start with the optimal spread for the uninformed strategy (s∗U N (C) ). To show this function is decreasing in ∆H it is enough to show that ∂s∗U N (C) ∂∆H
=
∂s∗U N (C) ∂A(∆H ,·)
∂s∗U N (C) ∂A (∆H , ·)
> 0, since by the chain rule we have that:
×
∂A (∆H , ·) ∂∆H | {z } 0 is equivalent to showing: p 1 − 4A (1 − A) σ − 1 + 2 (1 − A) σ > 0
If the U N strategy yields positive utility, we have that: s∗U N (C) < 1 ⇐⇒ 2 (1 − A) > 1 −
p 1 − 4A (1 − A) σ
Thus, we have that: p p 1 − 4A (1 − A) σ + 2 (1 − A) σ > 1 − 4A (1 − A) σ (σ − 1) > 0 which is true given our parameter restrictions. Next, we turn to the IR equilibrium spread. Similarly, we need to prove only that
∂s∗IR(C) ∂A(∆H ,·)
that is: ∂s∗IR(C) ∂A
Proving
q
2 A − 2 + 8c (1 − A) + 4σ (1 − A) + 2 1 − A − 4 (1 − A) 2 q = 2 8 (1 − A)2 1− A − 4 (1 − A) A 2 2σ+c
∂s∗IR(C) (k) ∂A
A 2σ
+c
> 0 is equivalent to showing: s
A − 2 + 8c (1 − A) + 4σ (1 − A) + 2
A 1− 2
2
If the IR strategy yields positive utility, we have that:
42
− 4 (1 − A)
A σ+c >0 2
>0
> 0,
s∗IRN (C)
s < 1 ⇐⇒ A − 2 + 2
A 1− 2
2
− 4 (1 − A)
A σ+c >A−1 2
(34)
Then, after some algebraic manipulation, we only have to show that: 8c (1 − A) + 4 (σ − 1) (1 − A) ≥ 0
(35)
which again is true given our parameter restrictions (we know σ > 1 and A < 1).
Lemma 6. Proof. We note first that the slope of the IR utility function is always larger than the slope of U N . Simple algebraic manipulation of the first derivatives for the expected utility functions yields: SlopeIR − SlopeU N
A A = 1 − − 2s (1 − A) − [1 − 2s (1 − A)] = − < 0 2 2
(36)
If EUHF T −M (s = 0|U N ) > EUHF T −M (s = 0|IR) and since the U N utility grows faster than the IR utility, then the smallest solution of EUHF T −M (s) = 0 will be for the uninformed, U N strategy. Hence, the U N strategy is optimal. The condition EUHF T −M (s = 0|U N ) > EUHF T −M (s = 0|IR) is equivalent to −σA > − σ2 A − c, which can be written as: c > σ2 A. This completes the proof. Proposition 2. Proof. For IR to be optimal it needs to hold that s∗IR(C) ≤ s∗U N (C) - the first root of IR utility function is smaller than the first root of U N expected utility. Since in the proof of lemma 6 we have shown that the U N utility is growing faster on the increasing section, the condition s∗IRN (C) > s∗U N (C) is equivalent with the condition that the IR utility is always above U N in the negative quadrants: EUHF T −M (s|IR) > EUHF T −M (s|U N ), ∀s < s∗IR(C) .
(37)
This condition is equivalent to:
EUHF T −M (s|IR) − EUHF T −M (s|U N ) ≥ 0 ⇐⇒
A (σ − s) − c ≥ 0 , ∀s ∈ 0, s∗IR(C) 2
The inequality above is monotonically decreasing in s, so if it holds for the largest s in the domain - s∗IR(C) , it will hold for all lower values of the half spread. The sufficient condition is thus: 43
A σ − s∗IR(C) − c ≥ 0 2
(38)
As s∗IR(C) is increasing in c (see definition), it is easy to see the condition is tightened for higher values of the cost (the LHS is decreasing in the monitoring cost). The condition states that: C IROptimal (α, µ, ∆H , c, σ) =
A (α, µ, ∆H ) σ − s∗IR(C) − c ≥ 0 2
(39)
We will prove that the condition is monotonically relaxed as ∆H decreases, under the assumption ∗ sIR(C) exists and it is a feasible strategy (0 ≤ s∗IR(C) ≤ 1). If that is the case and there exists a ∗ ∆C H corresponding to a sIR(C) for which the condition (39) holds with equality, it will hold for all ∆H < ∆C H . The first derivative of (39) is given by: C ∂IROptimal
∂∆H
" # " # ∂s∗IR(C) ∂A ∂s∗IR(C) 1 1 = −σµA + s∗IR(C) µA − A = µA s∗IR(C) − σ + µA2 2 ∂A ∂∆H 2 ∂A (40)
Since µ and A are strictly positive, proving s∗IR(C)
C ∂IROptimal ∂∆H
−σ+A
< 0 is equivalent to showing that:
∂s∗IR(C) ∂A
0. This imposes a lower bound s∗IR(C) =
on the cost term in
∂s∗IR(C) ∂A :
A 2 8 (1 − A) c ≤ 2 1 − − 4σ (1 − A) A 2
(43)
Hence, it holds that: ∂s∗IR(C) ∂A
≤
A−2+2 1−
A 2 2
+ 4σ (1 − A)2 + 2B
8 (A − 1)2 B
44
(44)
The condition s∗IR(C) > 0 imposes that: 2B < 2 − A, so we can redefine the upper bound from above: ∂s∗IR(C) ∂A
≤
1 2
(2 − A)2 + 4σ (1 − A)2 2
8 (A − 1) B
1 1 ≤ σ 2 B
The condition s∗IR(C) < 1 imposes that: 2B > 3A − 2, equivalent to:
(45) 1 2B
≤
1 3A−2 .
Hence, putting
these results together (and σ > 1), we have that:
s∗IR(C)
−σ+A
∂s∗IR(C) ∂A
A ≤1−σ+ σ =1−σ 3A − 2
2A − 2 3A − 2
≤
A 2 ≤ 0 , ∀A ≤ 3A − 2 3
Since we prove in Lemma 9 that the market breaks down for A ≤ 12 , the condition
C ∂IROptimal ∂∆H
(46)
≤0
holds in any non-trivial equilibrium situation when the market maker posts any quotes at all.
Proposition 3 Proof. The HFT-M expected utility functions for both IR and UN strategies, evaluated at the optimal spreads s∗IR(M ) and s∗U N (M ) are given by: EUHF T −M (IR) =
4 (1 − A) + A2 + 16c (A − 1) + 8σA (A − 1) 16 (1 − A)
(47)
4 + A2 (16σ − 1) − 16Aσ 16 (1 − A)
(48)
EUHF T −M (U N ) =
Hence, we have that EUHF T −M (IR) ≥ EUHF T −M (U N ) if and only if the following holds: M IROptimal (α, µ, ∆H , c, σ) = 2A2 − 16c (1 − A) + 8σ (1 − A) A ≥ 0
(49)
We note that the derivative with respect to the cost is negative - larger monitoring costs tighten the condition under which the informed strategy becomes optimal: M ∂IROptimal
∂c
= −16 (1 − A) < 0
The derivative with respect to ∆H is given by: M ∂IROptimal
∂∆H
=
M ∂IROptimal ∂A ∂A = (4A + 16c + 8σ (1 − 2A)) ≤0 ∂A ∂∆H ∂∆H
45
(50)
The previous holds for any A ≤ ∂A ∂∆H
1 2
- when the market does not break down, since we know that
< 0.
For ∆H → ∞ we have that the uninformed strategy is always optimal: M lim IROptimal = −16c < 0
∆H →∞
For ∆H → 0 we have that: lim
∆H →0
M IROptimal
α µ−α =− 4 + 16c + 8σ α+µ α+µ
µ
α If it holds that 4 α+µ + 16c + 8σ µ−α α+µ > 0 (for α sufficiently large relative to µ), there exists a M latency threshold ∆M H such that for all ∆H < ∆H the market maker chooses to be informed and for
all ∆H > ∆M H the market maker chooses to remain uninformed. The monotonicity of the optimality condition assures the uniqueness of this threshold.
B
Econometric Methodology Details
The models are estimated by Panel Ordinary Least Squares, by introducing a dummy variable for each stock (to control for stock fixed effect) and for each trader (controlling thus for trader intercepts). Since in our sample we have 242 stocks and 262 traders, this is equivalent to introducing 504 new parameters. However, due to the fact that we have over 1 million observations over time, this estimation is feasible from the point of view of the sufficiency of the degrees of freedom. Compared to random effects models, fixed effects linear models have the primary advantage of not making a strong exogeneity assumption of a random factor relative to the included regressors (see for instance Hsiao (2003) and Baltagi (2008)). If the panel is long enough, this approach yields consistent and efficient estimates provided the model is true, with very little assumptions. The least squares dummy variable approach (LSDV) is not a parsimonious model, due to the large number of parameters, but it is computationally easier than a maximum likelihood optimisation over more than 500 parameters. On the other hand, for models that have a limited dependent variable (such as the volume share sent to entrant markets), the OLS approach might yield inconsistent and biased estimates. Hence, maximum likelihood approaches are necessary for this type of variables.
46
C
Variable Definitions and Measurement
The list of variable short-names used throughout this paper is presented in the table below. This section describes how these variables are computed and provides relevant literature to motivate the choice for different measurements. Variable definitions and short labels Variable
Short Name
Definition
Levels
Adverse Selection Volatility Range (Absolute) Trade Imbalance Average Trade Size Agency Profile Average Price Market Capitalisation Aggressive Trading Effective Spread
AS σ (A)TI AvTr AgPr P MktCap AGG ES
Adverse Selection component of the bid-ask spread intraday volatility using high/low price (absolute) scaled difference buys - sells Volume divided by number of trades Ratio client volume / total volume Volume-weighted trading price Shares outstanding x average price market order volume in total volume volume-weighted half-spread per stock-day
SD, SDT SD SDT SDT SDT SD SD SDT SD
Security specific measures To the fragmentation metrics defined before, we define average prices and market capitalisation for a particular stock. The average price traded during the day is computed by taking all trade prices and weighting them by transactions volume; for a certain day t, with effective trading times τ , we have: P Pt =
τ ∈t (P riceτ
× Quantityτ ) τ ∈t Quantityτ
P
(51)
The market capitalisation of a stock (M kCap) is defined on a daily basis, by multiplying the number of stocks outstanding with the average price during that day. Measurement for daily return volatility There are 3 main measures for the intraday return volatility in the finance literature, as reviewed in Patton (2011): the squared daily returns, the realised volatility measure (seeAndersen, Bollerslev, Diebold, and Labys (2003)) and the intra-daily range (first proposed by Parkinson (1980)). The squared daily returns is the most naive option of the three, since it does not take into account the variation in prices during the day - it is possible that, after wide fluctuations, the closing price is not very different from the opening price, which would incorrectly lead to an estimated volatility smaller than the real value. The realised volatility measure is defined as the sum of n squared returns computed from 47
transaction prices over the day (RVt =
Pn
2 i=1 rit ).
This is an unbiased estimator of the true volatility
if the stock price follows a geometric Brownian motion (Patton (2011)) and has a lower variance than the squared return. If the grid we are sampling prices from is fine enough, the realised volatility comes arbitrarily close to the true volatility. However though, in the presence of microstructure noise, the observed price is not equal to the true price process (there is a bid-ask bounce, since some trades take place at the bid quote, others at the ask quote) - and the realised volatility measure can thus overestimate the true volatility - see Alizadeh, Brandt, and Diebold (2002). The volatility measure we choose to use in this paper is the scaled intra-daily range, defined as:
1
σt = p ln 2 ln (2)
supi {pit } inf i {pit }
(52)
The intra-daily range is considerably easier to compute - since we only need 2 prices for each day: the highest and the lowest one. This is a unbiased estimator of the true volatility, just like the realised volatility (the scaling factor √1 ensures unbiasedness under a geometric brownian 2
ln(2)
motion DGP for the prices). The efficiency is considerably higher than for daily squared return, and close to the efficiency of realised volatility, computed with a 2 hour sampling interval (Andersen, Bollerslev, Diebold, and Labys (2003)). Alizadeh, Brandt, and Diebold (2002) show that this measure is more robust to the presence of microstructure noise, thus being potentially a superior estimate in real-world situation. This measure is computed both at individual stock level, from the EMCF dataset, as well as market wide, using the OMX Nordic 40 index as a proxy for the Scandinavian markets.
48
D
Appendix: Snapshot of the dataset
Sample Data Snapshot. There are 8 possible platforms: XHEL (Nasdaq OMX Helsinki), XSTO (Nasdaq OMX Stockholm), XCSE (Nasdaq OMX Copenhagen), CHIX (Chi-X Europe), NURO (Nasdaq Europe), BATS (BATE Euope), QMTF (Quote MTF) and BURG (Burgundy). AGNT stands for client trade, whereas PRCP stands for principal (own trade). Dates are formated as 1yymmdd, to facilitate comparison across decades (2009-2010). Quantity Trader ID
Origin
Buy/Sell Currency Symbol
Maker/Taker
1090901 9.59
8300
150002
AGNT
S
EUR
NOKI
1
90014
1090901 9.57
724
140001
PRCP
B
EUR
NOKI
-1
101014
1101003 9.81
901
300001
PRCP
S
DKK
MAERS 1
Platform Time
Date
XHEL ... CHIX ... BATE
90001
Price
References Alizadeh, Sassan, Michael W. Brandt, and Francis X. Diebold, 2002, Range-based estimation of stochastic volatility models, The Journal of Finance 57, 1047–1091. Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Paul Labys, 2003, Modeling and forecasting realized volatility, Econometrica 71, 579–625. Baltagi, B., 2008, Econometric Analysis of Panel Data (John Wiley & Sons). Baron, M., J. Brogaard, and A. A. Kirilenko, 2012, The trading profits of high-frequency traders, working paper, Un. Biais, B., T. Foucault, and S. Moinas, 2013, Equilibrium fast trading, working paper, Toulouse School of Economics. Brogaard, J., T. Hendershott, and R. Riordan, 2012, High frequency trading and price discovery, working paper, University of Washington. Foucault, T., 2012, Market Microstructure: Confroning Many Viewpoints (Wiley Finance). , J. Hombert, and I. Rosu, 2012, News trading and speed, working paper.
49
Foucault, Thierry, Ailsa Roell, and Patrik Sandas, 2003, Market making with costly monitoring: An analysis of the soes controversy, Review of Financial Studies 16, 345–384. Hagstromer, B., and L. Norden, 2013, The diversity of high-frequency traders, Available at SSRN: http://ssrn.com/abstract=2153272. Hasbrouck, J., and G. Saar, 2010, Low-latency trading, AFA 2012 Chicago Meetings Paper; Johnson School Research Paper Series No. 35-2010. Hendershott, T., C. M. Jones, and A. J. Menkveld, 2011, Does algorithmic trading improve liquidity?, Journal of Finance 66, 1–33. Hendershott, T., and R. Riordan, 2011, High-frequency trading and price discovery, working paper. Hendershott, Terrence J., and Albert J. Menkveld, 2011, Price pressures, WFA 2010 paper. Ho, Thomas S. Y., and Hans R. Stoll, 1983, The dynamics of dealer markets under competition, The Journal of Finance 38, 1053–1074. Hoffmann, Peter, 2010, Adverse selection, transaction fees, and multi-market trading, working paper pp. –. Hsiao, C., 2003, Analysis of Panel Data (Cambridge University Press). Jovanovic, Boyan, and Albert J. Menkveld, 2011, Middlemen in limit-order markets, SSRN eLibrary. Kirilenko, Andrei A., Albert (Pete) S. Kyle, Mehrdad Samadi, and Tugkan Tuzun, 2011, The flash crash: The impact of high frequency trading on an electronic market, SSRN eLibrary. Pagnotta, Emiliano, and Thomas Philippon, 2011, Competing on speed, SSRN eLibrary. Parkinson, Michael, 1980, The extreme value method for estimating the variance of the rate of return, The Journal of Business 53, 61–65. Patton, Andrew J., 2011, Volatility forecast comparison using imperfect volatility proxies, Journal of Econometrics 160, 246–256. Petersen, Mitchell A., 2009, Estimating standard errors in finance panel data sets: Comparing approaches, Review of Financial Studies 22, 435–480.
50