.. include:: /includes.rst.txt .. comments - headings # with overline, for parts * with overline, for chapters = for sections - for subsections ^ for subsubsections " for paragraphs .. _importing_for_a_corpus_target: .. index:: pair: Importing; Documents for Corpus .. index:: pair: Corpus; Importing Documenets Importing for a Corpus ^^^^^^^^^^^^^^^^^^^^^^ **Import Single Document** is available under the *Imports* tab only for Corpora collections. It supports manually importing an external file into the corpus, rather than going through a connector or importing an existing RDF representation of the corpus. When selected, use the **Browse…** button to select a source file. Its text and metadata will be parsed by the Apache Tika content analysis toolkit, which can handle these `supported formats`_. The **Show Imported data** button on the next screen allows reviewing retrieved information. Most supported file formats will present three sections: 1. common Metadata Properties such as file name, `media type`_, title, creator; 2. Content, which is the actual document’s text (where applicable); 3. Other Properties, which include various ones the importer was unable to label and are therefore referred to with their URIs. .. _supported formats: http://tika.apache.org/1.14/formats.html .. _media type: https://www.iana.org/assignments/media-types/media-types.xhtml