Skidmore Computational Physics & ML Lab A computational physics and machine-learning research group at Skidmore College

Preparing LaTeX files for arXiv submission

Submitting to the arXiv requires some preparation of your LaTeX files to ensure proper formatting and avoid common pitfalls. This guide walks through the essential steps, from bibliography management during writing to final file cleanup before upload.

During writing

Use \bibliographystyle{utphys} to format your bibliography. Jacques Distler wrote this style to work seamlessly with arXiv citations.

At the end of your LaTeX file, before \end{document}, add \bibliography{inspire,extra}. The inspire.bib file contains references from INSPIRE-HEP, while extra.bib holds references not available there.

Create the extra.bib file and add citations as needed. Many mathematics papers, for instance, don’t appear on INSPIRE and require manual citation.

Autogenerate inspire.bib using the INSPIRE bibliography generator. In your main .tex file, cite works using their INSPIRE key (preferred) or arXiv number - for example, \cite{Apte:2024vwn} or \cite{2405.14830}. After finishing your draft, upload the main tex file to the bibliography generator, which creates the inspire.bib contents automatically. This approach ensures correct arXiv numbers and proper citation formatting.

Final draft

Clean both .bib files using bibtex-tidy. This tool merges or deletes duplicates. You should also use it to remove problematic fields like url, issn, and month that can disrupt reference formatting.

Comment out extra packages like todonotes and similar development tools.

Process your main .tex file through the arxiv-latex-cleaner. This removes all comments, preventing embarrassing situations where personal notes appear in the source. You can install this locally with Python or use Google Colab: run !pip install arxiv-latex-cleaner, upload files to the content folder, then execute !arxiv_latex_cleaner . (the period indicates the current directory). This creates a content_arXiv directory with the cleaned files.

Format the cleaned .tex file using the LaTeX formatter. While not essential, this creates nicely indented, readable code.

Uploading to arXiv

Create a file named 00README.XXX in your submission folder’s base directory. Enter nohypertex as the file’s content. This prevents arXiv from running its own hyperref version, using instead the version you explicitly load.

Compile your file one final time. Delete all files except main.tex, main.bbl, utphys.bst, and 00README.XXX. Zip these files (not the folder containing them) and upload the zip file as your submission.

Previous post
Mathematica tips for numerical linear algebra (4)