Documented all of the guesswork Paperless does

This commit is contained in:
Daniel Quinn
2016-03-28 14:54:09 +01:00
parent aea9ea50e5
commit 54443fa808
5 changed files with 97 additions and 39 deletions

View File

@@ -52,9 +52,12 @@ for PDF files to parse and index. The process is pretty straightforward:
wait 10 seconds and try again.
2. Parse the PDF with Tesseract
3. Create a new record in the database with the OCR'd text
4. Encrypt the PDF and store it in the ``media`` directory under
4. Attempt to automatically assign document attributes by doing some guesswork.
Read up on the :ref:`guesswork documentation<guesswork>` for more
information about this process.
5. Encrypt the PDF and store it in the ``media`` directory under
``documents/pdf``.
5. Go to #1.
6. Go to #1.
.. _utilities-consumer-howto: