6_tcash.tex 7.0 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172
  1. \section{Tornado Cash Heuristics}
  2. \label{sec:tornado}
  3. Focusing on the subset of Ethereum transaction data involving Tornado cash deposits and withdrawals, we discuss a five heuristics for identifying compromised deposits.
  4. \subsection{Address Match}
  5. \begin{figure}[h!]
  6. \centering
  7. \includegraphics[width=0.75\linewidth]{figures/tcash/h1.png}
  8. \caption{Address match -- The triangle represents a single address withdrawing and depositing to a TC pool.}
  9. \label{fig:tornado}
  10. \end{figure}
  11. Suppose the address making a deposit transaction to a Tornado Cash pool matches the address making a withdrawal transaction (from the same pool). In that case, the two transactions can be linked, and the corresponding deposit is compromised as the user identity may be revealed. These may be TORN yield farmers who deposit and withdraw to the same address and are only profit-motivated or clumsy Tornado cash users.
  12. \subsection{Unique Gas Price}
  13. \begin{figure}[h!]
  14. \centering
  15. \includegraphics[width=0.7\linewidth]{figures/tcash/h2.png}
  16. \caption{Unique gas price -- two addresses depositing and withdrawing with the same 27.4 gwei gas price.}
  17. \label{fig:tornado}
  18. \end{figure}
  19. Prior to EIP-1559, Ethereum users could specify the gas price when making a deposit or withdrawal to a Tornado Cash pool. Those who do so tend to specify gas prices that are identical for deposit and withdrawal transactions. User-specified gas prices follow common patterns such as being round numbers or being unique to an individual. Care must be taken to remove transactions made by a relayer service which may set gas prices as well. Relayer-specified prices also tend to be oddly specific and not round numbers. In practice, relayers can be filtered out by decoding the input code from Ethereum transaction data.
  20. \subsection{Linked ETH Addresses}
  21. \begin{figure}[h!]
  22. \centering
  23. \includegraphics[width=0.6\linewidth]{figures/tcash/h3.png}
  24. \caption{Linked ETH addresses -- the green arrows represent interactions between two addresses A and D outside of TC. Addresses A and D deposit and withdraw from the same Tornado Cash pool, respectively.}
  25. \label{fig:tornado}
  26. \end{figure}
  27. This heuristic aims to link withdraw and deposit transactions on Tornado Cash by inspecting ETH (non-Tornado Cash) interactions. This is done by constructing two sets, one corresponding to the unique Tornado Cash deposit addresses and one to the unique Tornado Cash withdraw addresses, to then make a query to reveal transactions between addresses of each set: when at least three such transactions are found, the withdraw and deposits addresses will be considered heuristically linked in Tornado Cash. The more transactions found, the more confident we are in the link.
  28. \subsection{Multiple Denomination}
  29. \begin{figure}[h!]
  30. \centering
  31. \includegraphics[width=0.6\linewidth]{figures/tcash/h4.png}
  32. \caption{Multi Denomination -- Addresses A and D deposit and withdraw the same number of times from the same three Tornado Cash pools, respectively.}
  33. \label{fig:tornado}
  34. \end{figure}
  35. Previous heuristics examine isolated pairs of deposit and withdraw transactions. This heuristic, however, studies an address' history of transactions. We compute the portfolio of an address' withdrawals across Tornado Cash pools, hence ``multi-denomination''. For instance, Alice may have withdrawn from the 1 ETH pool twice, the 0.1 ETH pool five times, the DAI pool once. Then we search for all addresses whose portfolio of deposit transactions is \textit{exactly the same} as Alice' withdrawal portfolio. An address depositing into the 1 ETH pool three times, the 0.1 ETH pool five times, and the DAI pool once would be a valid match. (Optionally, we can relax this to search for deposits with at least as many as Alice' withdrawals, though this risks more false positives). All Tornado Cash transactions under the matched deposit and withdrawal address are linked to each other.
  36. We ignore all addresses that make fewer than three transactions to Tornado Cash pools, and ignore all addresses that interact with only one pool. Additionally, to reduce the likelihood of false positives, we further constrain deposits within a multi-denomination reveal to occur within a 24 hour window; similarly we separately constrain withdraws within a multi-denomination reveal to occur within a 24 hour window. This intuition being that careless users will deposit or withdraw all at once.
  37. % Unlike prior heuristics, this one has a higher chance of false positives. To address this, we assign a confidence to clustered transactions based on the difference in count between deposit and withdrawal portfolios. For an address $v$, we assign:
  38. % \begin{equation*}
  39. % \kappa(v) = \frac{1}{\sum_{p \in \textup{pools}} |(\#D(v,p)) - (\#W(v,p))| }
  40. % \end{equation*}
  41. % where $\#D(v,p)$ is the number of deposit transactions made by address $v$ to pool $p$, and similarly $\#W(v, p)$ is the number of withdrawal transactions. We remove all transactions of addresses with confidence $\kappa(v) < 0.1$.
  42. \subsection{TORN Mining}
  43. \begin{figure}[h!]
  44. \centering
  45. \includegraphics[width=0.6\linewidth]{figures/tcash/h5.png}
  46. \caption{TORN Mining -- Address D was given 10 TORN upon withdrawing from the 1 ETH pool in return for anonymity points, linking address D to a deposit 100 blocks prior. Searching records, only address A deposited in the 1 ETH pool 100 blocks prior, compromising address D. Note that the numbers presented here are for explanatory purposes.}
  47. \label{fig:tornado}
  48. \end{figure}
  49. In February 2021, Tornado Cash introduced anonymity mining. It was an incentive scheme to encourage more deposits in Tornado Cash pools, thereby increasing their anonymity sets. Tornado Cash rewarded participants a fixed amount of anonymity points (AP) based on how long they left their assets in a pool.
  50. After withdrawing assets, users can claim Anonymity Points. The amount withdrawn is recorded in the transaction. If a user uses a single address to claim all of their anonymity points, one can calculate the exact number of Ethereum blocks that their assets were in the pool because the AP yields were public and fixed. If there is a unique deposit / withdrawal combination in a pool separated by this number of Ethereum blocks, the transactions are assumed linked. This is more likely when the deposit or withdrawal in the pair also claimed the AP. This heuristic is most effective if AP is being claimed for a single deposit and is harder to compute for multiple deposits.
  51. \subsection{Connection to Ethereum-based Clusters}
  52. Unlike the Ethereum-wide heuristics (i.e., DAR and NODE), which find clusters of compromised \textit{addresses}, Tornado Cash heuristics find clusters of compromised \textit{transactions}. However, aside from address matching, a subset of these Tornado Cash heuristics can also be applied on the address level: given a cluster of compromised transactions, find the sender address for each transaction, and compute the unique set, removing any addresses in our list of known addresses. These clusters can be added to the ones discussed in Section~\ref{sec:eth}. This in turn, will further enrich the anonymity score.