Preprints

What are preprints?

A preprint is a version of a document that precedes the formal peer review and publication stage in peer-reviewed journals. A preprint of a document may be available before, and remain available after, publication. As a result, preprints may contain content similar, or identical, to the documents they preceded, and could therefore influence the Similarity Report.

What to do with preprints?

iThenticate has identified a list of websites that contain preprint sources. As the administrator, you have the ability to decide how users of your account see these sources.

  1. The options for preprint sources are available from the Settings page, accessible from the sidebar.
  2. From the Preprints heading, you will have the option to do the following with preprint sources:
  3. Label preprint sources

    This will label any source in the Similarity Report that iThenticate has identified as a preprint. With this option selected, preprint sources will appear in the Sources panel of the Similarity Report. Your account users will be able to exclude preprint sources from the Similarity Report using the Similarity Report settings page or from within the report.

    Label and exclude preprint sources

    This will label any source in the Similarity Report that iThenticate has identified as a preprint. iThenticate will automatically exclude these preprint sources. They will appear in the Similarity exclusion area of the Similarity Report. Users will not be able to reinclude these sources in the Similarity Report.

    Don’t label preprint sources

    This option means if any preprint sources are found in the Similarity Report, they will be identified as regular sources. There will be no differentiation between preprint source matches and regular matches.

  4. Use the Save button to confirm any changes you’ve made.

How do we identify preprints?

When enabled, this feature will label and exclude sources (depending on the setting) that have been identified as preprints.

Harvested metadata or a source's URL are used by iThenticate to clearly differentiate a preprint from a published article. Identification is achieved through use of the harvested metadata or by using the source's URL.

iThenticate has identified repositories that only host preprints to ensure that published content is not erroneously labeled for exclusion. iThenticate has identified repositories that only host preprint content, and therefore can be used with certainty to identify preprint sources.

Included preprint repositories

Below is a list of the current repositories that iThenticate uses to correctly identify preprints:

RepositoryURL
APSA Preprintshttps://preprints.apsanet.org/
arXivhttps://arxiv.org/
bioRxivhttps://www.biorxiv.org/
Cryptology ePrint Archivehttps://eprint.iacr.org/
EarthArXivhttps://eartharxiv.org/
EasyChairhttps://easychair.org/publications/preprints
engrXivhttps://engrxiv.org/
Mathematical Physics Preprint Archivehttps://web.ma.utexas.edu/mp_arc/
medRxivhttps://www.medrxiv.org/
Optimization onlinehttp://www.optimization-online.org/
PeerJ PrePrintshttps://peerj.com/preprints/
Preprints.orghttps://www.preprints.org/
Research Squarehttps://www.researchsquare.com/
SciELO Preprintshttps://preprints.scielo.org/
WikiJournal preprintshttps://en.wikiversity.org/wiki/WikiJournal_Preprints/

Identified but not yet included preprint repositories

iThenticate is always working towards expanding our preprints repository. Below is a list of preprints repositories that we are aware of and are actively working on adding to our repository:

RepositoryURL
Arabixivhttps://arabixiv.org/
BioHackrXivhttps://biohackrxiv.org/
BodoArXivhttps://osf.io/preprints/bodoarxiv
ChemRxivhttps://chemrxiv.org/
EcoEvoRxivhttps://ecoevorxiv.org/
ECSarXivhttps://ecsarxiv.org/
EdArXivhttps://edarxiv.org/
FocUS Archivehttps://osf.io/preprints/focusarchive
FrenXivhttps://osf.io/preprints/frenxiv
INA-Rxivhttps://osf.io/preprints/inarxiv
IndiaRxivhttps://indiarxiv.in/
Jxivhttps://jxiv.jst.go.jp/
LawArXivhttps://osf.io/preprints/lawarxiv
LIS Scholarship Archivehttps://osf.io/preprints/lissa
MarXivhttps://osf.io/preprints/marxiv
MataArXivhttps://osf.io/preprints/metaarxiv
MediArXivhttps://mediarxiv.org/
MetaArXivhttps://osf.io/preprints/metaarxiv/
MindRxivhttps://mindrxiv.org/
NutriXivhttps://osf.io/preprints/nutrixiv
OSF Preprintshttps://osf.io/preprints/
PaleorXivhttps://paleorxiv.org/
PsyArXivhttps://psyarxiv.com/
RIN arxiv (formerly INArxiv)https://rinarxiv.lipi.go.id/
SocArXivhttps://osf.io/preprints/socarxiv/
SportRxivhttps://osf.io/preprints/sportrxiv
ARPHA Preprintshttps://preprints.arphahub.com/
TechRxivhttps://www.techrxiv.org/
ViXrahttps://vixra.org/

Unincluded preprint repositories

iThenticate is also aware of sites that host both preprint content and published content, as well as sites that are not accessible to iThenticate. As a result, content from these sites cannot be correctly labeled. Below is a list of repositories that we are either unable to access or accurately label:

RepositoryURL
Authoreahttps://www.authorea.com/preprints
Beilstein Archiveshttps://www.beilstein-archives.org/
Cell Sneak Peekhttps://www.ssrn.com/index.cfm/en/cell-press-sneak-peeks/
ChinaXivhttp://chinaxiv.org/
ESSOArhttps://www.essoar.org/
JMIR Preprintshttps://preprints.jmir.org/
Nature Precedingshttps://www.nature.com/npre/
NBER Working Papershttps://www.nber.org/papers
Preprints with The Lancet on SSRNhttps://www.ssrn.com/index.cfm/en/the-lancet/
Preprints.ruhttps://preprints.ru/
SSRNhttps://www.ssrn.com/
Therapoidhttps://therapoid.net/en/preprint/
Zenodohttps://zenodo.org/

If you are aware of any preprint repositories which are missing from this page, or you are able to assist us with accessing repositories we have been unable to access so far, then please content@turnitin.com.