Making equations copyable in pdf

The PDF format supports a feature "ActualText" that is used for copy-paste instead of the actual typeset text. However, it is not supported by all PDF viewers, but Acrobat Reader does support it.

\documentclass{article}
\usepackage{accsupp}

\newcommand*{\copyable}[1]{%
  \BeginAccSupp{%
    ActualText=\detokenize{#1},%
    method=escape,
  }%
  #1%
  \EndAccSupp{}%
}

\begin{document}
\[
  \copyable{\int x\sin ax\;\mathrm{d}x = \frac{\sin ax}{a^2}-\frac{x\cos ax}{a}+C}
\]
\end{document}

Result

Copy-paste result (\detokenize adds spaces after command names):

\int x\sin ax\;\mathrm {d}x = \frac {\sin ax}{a^2}-\frac {x\cos ax}{a}+C

A more verbatim-like copy is much more complicated:

\documentclass{article}
\usepackage{accsupp}

\makeatletter
\newcommand*{\copyable}{%
  \begingroup
  \@sanitize
  \catcode`\%=14 % allow % as comment char, also needed for \%
  \@copyable
}
\newcommand*{\@copyable}[1]{%
  \endgroup
  \BeginAccSupp{%
    ActualText=\detokenize{#1},%
    method=escape,
  }%
  \scantokens{#1}%
  \EndAccSupp{}%
}
\makeatother

\begin{document}
\[
  \copyable{\int x\sin ax\;\mathrm{d}x = \frac{\sin ax}{a^2}-\frac{x\cos ax}{a}+C}
\]
\end{document}

Copy-paste result:

\int x\sin ax\;\mathrm{d}x = \frac{\sin ax}{a^2}-\frac{x\cos ax}{a}+C

However, the argument of \copyable is read with verbatim catcodes. Therefore this trick will not work, if \copyable is inside an argument of another macro (or in environments of package amsmath). In this case the \detokenize has a function to get the previous version with spaces after command names at least.

Discussion about areas

\copyable inserts whatsits that does not influence the mathematical spacing. Thus the whole equation can be split:

\[
  \copyable{\int x\sin ax\;\mathrm{d}x}
  \copyable{=}
  \copyable{\frac{\sin ax}{a^2}}
  \copyable{-}
  \copyable{\frac{x\cos ax}{a}}
  \copyable{+}
  \copyable{C}
\]

And where the mapping from formula to copy-paste text is correct, \copyable can be omitted:

\[
  \copyable{\int x\sin ax\;\mathrm{d}x}
  =
  \copyable{\frac{\sin ax}{a^2}}
  -
  \copyable{\frac{x\cos ax}{a}}
  + C
\]

The area that is shown in the PDF viewer for selecting depends on the formula and especially the PDF viewer.

Acrobat Reader usually only shows the area above the first symbol of the formula. In case of the integral sign, a small rectangle above it. The latest example with AR9/Linux:

AR selecting all

Evince 3.4.0 is a little better in the coverage of the rectangle. The first equation with the whole equation in \copyable:

Evince selecting the whole equation

But Evince has problems in the separation of the equations. Here I wanted to select the second equation only, but it gets mixed up with the third:

Evince selecting second equation

Okular 0.14.3 has a nicer selecting tool, but if the \copyable is split up into terms (second and third equations of the latest example), then the terms get mixed up. - Fatal for mathematicians.


It is also possible to use the Mathpix desktop app (for Mac and PC(beta)) to take a screenshot of the PDF and it will render the LaTeX instantly (and already have it copied to your clipboard, all you have to do is paste into your editor of choice).

Here is a brief demo using Mathpix to extract LaTeX from a PDF and pasting into Overleaf:

enter image description here