• How to recognize text using ABBYY FineReader: step-by-step instructions. How to use ABBYY FineReader

    The conversation will be about the program ABBYY FineReader 12, that is, about its latest version. Without looking too far, we chose the most famous product from ABBYY, which, to its merits, is perfectly Russified. Already at first glance, Fine Reader (FR) gives the impression of a program with good Russian-language support: in this regard, indeed, everything is done at a very decent level, including background information.

    First - retreat. The question of how to convert all or some part of an archive into digital format is always relevant (and what, in fact, is meant by the word “digital”). Buying a scanner is unlikely to solve all problems. Of course, very often the documentation for the scanner comes with a disc or several with branded software. However, already at the sanitization stage it turns out that the quality of the scanning program leaves much to be desired or the format in which the saving takes place is, unfortunately, not suitable for storage. Why? Majority graphic formats do not separate text from the non-text space of the document, and therefore it is not possible to copy any passage from such a file.

    It is in such cases that functional text recognition programs come to the rescue, the capabilities of which, in particular, include extracting text from an image.

    Getting to know ABBYY FineReader

    Plastic bag ABBYY Finereader 12- Optical Character Recognition (OCR) system. Designed for both automatic input of printed documents into a computer and for converting PDF documents and photographs into editable formats (from the program manual)

    The acronym "OCR" is applicable to all data recognition applications (not just text). The source for data extraction can be a printed or electronic document. Once upon a time, not very long ago, few people knew about OCR, in one form or another, and the process of converting text into electronic form turned into a mere routine, right down to manual reprinting of the original text. Today, having a flatbed scanner (only a few use a manual scanner at home) and finereader 12- rest assured that there will be no difficulties in scanning and recognition.

    Starting with the sixth version, FineReader supports import and export to PDF format, patented by Adobe. Many readers have probably encountered difficulties in translating from this format to any other (doc, etc.), since indeed useful programs there is not so much in this area (the only thing worthy of attention is ABBYY’s subsidiary product, PDF Transformer). The fact is that such programs perform text recognition only once, as a result of which the “identity” of the result is not at all significant (depending on the complexity of the document), plus the formatting of the document is pretty much lost.

    In the case of FineReader, everything is different. The ninth version of the program introduces a technology called Document OCR. It is based on the principle of integral document recognition: it is analyzed and recognized as a single whole, and not page by page. At the same time, all kinds of columns, headers, fonts, styles, footnotes and images remain untouched or are replaced by those close to the original.

    Installing the package

    The demo version of Finereader 12 can be downloaded on the website Abbyy.ru, in the Download section, complete licensed version distributed on CD. You can find out about purchasing methods on the same website in the “Buy” section.

    On the ABBYY developers website you can download a demo version of the ABBYY FineReader package version 12 (or another one that is current today)

    ABBYY FineReader is distributed in several versions: Professional Edition, Corporate Edition, Site License Edition, etc. The difference between the Professional version and the others is that it is designed to work in corporate network with the possibility collaboration on document recognition. Otherwise, the difference is insignificant and depends on the choice of terms of the license agreement.

    It's hard to imagine that 12 years ago there was FineReader 2.0, which occupied about 10 MB of disk space. Over time, the package has grown tenfold and now, when installed, takes up to 300 MB. Is it a lot or a little - judge for yourself. The new FR supports 179 recognition languages, including little-known artificial languages ​​(Ido, Interlingua, Occidental and Esperanto), programming languages, formulas, etc. Let's not forget about support for various formats and scripts. So, if for some reason you want to limit the space a package takes up, during installation, select only those components that will be needed during operation.

    The choice of components affects the duration of installation, which, however, should not take much time. During the installation process you will be introduced to the main features of FR. After activation (via the Internet, via E-mail, using the received code, etc.), the program is ready for full functionality. In demo mode, you will certainly encounter various restrictions that, unfortunately, do not allow you to fully use the package.

    FineReader interface. Functionality

    Access to the program's capabilities is available both through scripts that will appear in the main menu immediately after the installation process, and, in fact, through the main interface.


    Screensaver when starting FineReader

    Appearance the program does not undergo any significant changes from version to version: the developers do not see the point in radically changing it. Considerable attention is paid to ergonomics, which is noticeable in all ABBYY products (Lingvo, PDF Transformer, FlexiCapture...). In other words, the Fine Reader 12 interface is well thought out and suitable for all users, including beginners. The principle of “Get results in one click” will appeal to those who are not used to setting up and changing something. On the other hand, more experienced users can carefully configure FineReader through the settings dialog (Tools -> Options…). The only caveat: for comfortable work in the application, it is advisable to set the screen resolution to 1280×800, so that all the tools are always, as they say, at hand.

    After launching the Fine Reader program, a window with buttons will appear quick access to program functions. This menu is also available through the Tools -> ABBYY FineReader menu, the “Main Scripts” button in the far right corner of the program, or through the Ctrl+N key combination (similar to Word, where this combination opens a new document).

    Scan to Microsoft Word: in the ninth version of FineReader, support for Microsoft Word 2007, which has not yet become popular, appeared. In turn, on the toolbar in Microsoft applications Office, a “branded” red icon appears in the add-ons section after installing FR.


    Menu for exporting a recognized FineReader document
    Selecting languages ​​for scanning and document recognition

    Besides Microsoft Office, FR supports integration with Microsoft Outlook, provides export of recognition results to the same Microsoft Word, Excel, Lotus Word Pro, Corel WordPerect and Adobe Acrobat. These features make working with the program somewhat easier and faster, especially if you have to work with it regularly.

    PDF or images in Microsoft Word: recognize data from a PDF or other type of graphic file supported by Finereader version 12. It should be noted that the technology for extracting text from a PDF file in FR is not just “peeling off” the text content (the text layer in PDF may be absent) from the graphic one. In fact, recognition technology is quite complicated: after analyzing the content of the document, the program decides what and how to do with the text: simply extract or recognize, and so on for each text fragment.

    Scan to Microsoft Excel: Scanning to XLS (Microsoft Excel format) may be justified if the scanned image contains tables.

    Scan to PDF: There are many reasons to scan to PDF. One of them is security: this is the only format familiar to FR in the settings of which you can set a password lock. The password is set not only for opening a document, but also for printing it and other operations. It is possible to choose one of three encryption levels: 40-bit, 128-bit based on the RC4 standard, 128-bit level based on the AES (Advanced Encryption Standard) standard.

    Convert photo to Microsoft Word: converting a file from a graphic format (and it can be PDF or a multi-page image) to DOC / DOCX.

    Open in Fine Reader: open graphic file(PDF, BMP, PCX, DCX, JPEG, JPEG 2000, TIFF, PNG) for FineReader recognition.

    Working in FineReader

    Now - briefly about the features of the program. The whole process is divided into scanning, recognition and saving the results. After you have chosen the type of program action, specified the file or device to scan, FineReader carries out its task step by step, which, by the way, is quite resource-intensive for the central processor.

    If you - happy owner dual-core processor, then, working in the Fine Reader 12 package, you can evaluate the power of the computer’s performance. The fact is that FR, having detected a dual-core processor, recognizes not one, but two pages of a document in parallel. It's a small thing, but it's nice.

    First comes scanning, then recognition and export of a temporary document to the selected format.


    PDF document recognition process

    Scanning. None presets in the FineReader application (except for selecting a reading device) you do not need to do anything before scanning. This is why scripts were invented: they are designed to simplify the execution of similar actions.

    Recognition. The simplification also affected other little things. So, if we recall previous versions of the program, before we had to manually change the language (languages, if there were several) of the document. Now this happens automatically, although not always. In the latter case, FR unobtrusively suggests checking the document language.

    Returning to FR recognition technology: why does the program first scan the entire document as a whole, and not page by page? As already mentioned, the text is recognized based on the entire content: fonts of similar size/typeface, tables and borders, indents, etc. are selected.

    Don't be surprised if FineReader 12 displays a message saying the page cannot be recognized because no areas of text were found. For the sake of experiment, we took pictures on mobile phone from the LCD screen the area of ​​a text document (however, knowing the result in advance). Fine Reader 12 did not recognize the text of the image, since it was clearly of a quality that was clearly not sufficient for this. On our second visit we took a photo digital camera page with text in normal lighting.

    FineReader recognized the passage without any problems, preserving the formatting and highlighting with markers some questionable moments or characters that may have variable spelling.

    As you can see in the image, these are mainly periods, hyphens, commas - in general, small characters. In addition, it is clearly visible that the program took into account the unevenness and curvature of the photographed page and aligned the lines of text. Conclusion - FR did an excellent job with its albeit not very difficult task.

    Occasionally, some minor points may go unnoticed by the Fine Reader program, but they can be easily corrected manually. Fortunately, the package has its own WYSIWYG editor, the capabilities of which are quite sufficient to make the final editing of the document. Spell checking is also available.

    How can we improve recognition accuracy so that we can spend less time editing text? First, you can connect a custom Microsoft Word dictionary. True, it is difficult to judge the increase in accuracy, except perhaps the increase in the vocabulary of the spell checker (a module that checks spelling and grammar). Among other things, to improve recognition, it makes sense to familiarize yourself with the program settings (Tools -> Options) and select one of two modes:

    careful recognition- it can be selected when recognizing documents of any “complexity”: with tables without grid lines, text, graphs, tables on a colored background, etc. It can also help with low-quality recognition source

    fast recognition- this mode is recommended for processing large volumes of documents with simple design or in cases where time does not allow for thorough recognition. In most cases, when you have black printed text on a white background, you can settle for quick recognition.

    In general, improving the quality of work of FineReader is a separate topic for conversation, the details of which you can learn from the official help, namely in the section “How to improve the results obtained.”

    Saving the document. The last stage of work in the Fine Reader 12 program is saving the final result in a specific graphic/text format. Pre-save settings can be specified in the FR options: Tools -> Options, “Save” tab. Each format has its own settings. When saving in DOCX format, you should be careful about format compatibility (DOCX files are not recognized in Word 2003<). В txt-файлах не забудьте проверить правильность кодировки (особенно в случае с текстом в кириллице).

    ABBYY Screenshot Reader

    In many large packages, developers often like to add small service utilities. Let's say that the well-known disc burning application Nero includes a set of 3 - 5 utilities that allow you to do something that even Nero itself cannot do. Review (you can also download it here as part of Fine Reader 12).

    As for FineReader, it contains one small application, Screenshot Reader. With its help, you can quickly convert it to the desired format using FR. The program is available through the Start menu (Start -> All Programs -> ABBYY FineReader 12.0 -> ABBYY Screenshot Reader.).

    The capabilities of Screenshot Reader are somewhat wider than it might seem at first glance. (otherwise you could do it by simply pressing the “PrintScreen” key on your keyboard). In addition to taking a screenshot of the screen (or more accurately, a selected area of ​​the screen), Screenshot Reader is tightly integrated with FR.

    When you click the “Snapshot” button on the Screenshot Reader panel, the cursor changes shape and the screen area selection tool is activated. The selected area of ​​the image is framed for further text recognition (it runs automatically).

    In the drop-down list, you can select the desired action: in fact, Screenshot Reader duplicates FR quick scripts with the difference that instead of a screenshot from the scanner, a screenshot is received as input.

    It should be noted that the program, along with the entire package, requires activation. When registering the product, ABBYY FineReader 12 Professional Edition Screenshot Reader is provided free of charge as a “bonus”.

    Conclusion

    FineReader - indispensable program for scanning and recognizing graphic data. Russian-language interface and the availability of settings will not scare you away inexperienced user. Support for the latest formats, innovative technologies and, as a result, high-quality recognition makes the program optimal choice, especially since ABBYY FineReader still has no competitors in this area.

    FineReader 12 hotkeys

    • Create new document ABBYY FineReader- CTRL +N
    • Open ABBYY FineReader document 12 - CTRL +SHIFT+N
    • Save pages- CTRL +S
    • Save image to file- CTRL +ALT+S
    • Recognize all pages of a document- CTRL + SHIFT + R
    • Close current page- CTRL +F4
    • Recognize selected pages of an ABBYY FineReader document- CTRL + R
    • Open Scenario Manager- CTRL +T
    • Open the Fine Reader Options dialog- CTRL + SHIFT + O
    • Open help- F1
    • Go to the Document window- ALT +1
    • Go to the Image window- ALT +2
    • Go to the Text window- ALT +3
    • Go to window Close-up- ALT +4

    One of the most popular functionality for working with scanning and file processing various types- Fine Reader. Functional software product was developed by the Russian company ABBYY, it allows not only to recognize, but also to process documents (translate, change formats, etc.). Many users can only install it, but cannot immediately figure out how to use ABBYY FineReader. You can find answers to many questions in this article.

    The program allows you to scan and recognize text - and more

    To understand in detail what kind of program ABBYY FineReader 12 is, you need to consider in detail all its capabilities. The first and simplest function is to scan a document. There are two scanning options: with and without recognition. If you scan a printed sheet normally, you will receive the image you scanned in the specified folder on your computing device.

    ATTENTION. The sheet must be placed evenly on the scanning part of the printer, along the contours indicated on the printer. Do not allow the source code to be twisted, this may lead to poor quality final scan.

    You must decide for yourself why you need FineReader, since the utility has significant functionality, for example, you can independently choose what color you want the image to be in, it is possible to convert all photos to black and white. In black and white, recognition is faster and the quality of processing increases.

    If you are interested in the text recognition function of ABBYY FineReader, you need to press a special button before scanning. In this case, there are several options for obtaining information. As standard, a recognized piece of sheet will be displayed on your screen, which you can copy or edit manually.

    If you select other functions, you can immediately receive the file as a Word document or Excel table. Selecting functions is very simple, the menu is intuitive and easy to customize due to the fact that all the buttons you need are in front of your eyes.

    IMPORTANT. Before you recognize text ABBYY FineReader, you need to accurately select the processing language. Despite the fact that the utility works completely automatically, it happens that the low quality of the source does not allow us to understand what kind of language was in the source. This greatly reduces the quality of the final results of the application.

    Multiple operating modes

    To fully understand how to use ABBYY FineReader 12, you need to try two modes of operation: “Careful” and “Quick recognition”. The second mode is suitable for high-quality images, and the first for low-quality files. The Thorough mode takes 3-5 times longer to process files.

    The illustration shows the result of the program - text recognition from an image

    What other functions are there?

    Text recognition in ABBYY FineReader is not the only useful feature. For greater user convenience, there is

    So, we have FineReader installed on our computer. We turn on the scanner and digitize some multi-page document. Let's call it, conditionally, "Agreement".

    Place the first page of the document on the scanner glass and close the lid. Launch the FineReader program. Click the “Scan” button, or press the “Ctrl+K” combination. The "ABBYY FineReader Scanning" window opens. When digitizing ordinary text page typed in 11-12 point font, leave the settings in the default window and click the “View” button.

    The scanner works and after a few seconds we see our page in the viewing window. Here we can change the size of the scan if necessary. And then click the "Scan" button.

    FineReader begins the text recognition process and within a minute the page image opens in the program window. Right side The window is now divided into three sections. In the left section "Image" we can edit the image. You can read more about image editing in the lesson: Scanning a book. In the right section "Text" you can immediately make changes to the text - edit the content of the page even before saving it. This is very convenient when you need, for example, to quickly change dates, details, and last names in a document.

    An icon of the recognized page appears in the left part of the “Pages” window:

    If you don’t need to edit anything, replace the first page on the scanner glass with the second page and repeat the technology. Having adjusted the scan sizes once in the "ABBYY FineReader Scanning" window in the "Preview" mode for the first page, now immediately click the "Scan" button. The settings for the first page are saved, and subsequent pages are scanned without preview. So we scan all the pages of our document.

    We’ve finished, and now, by clicking on the icons one by one, we open the pages, checking their correct sequence.

    After that, in the left part of the “Pages” window, select all the icons with the button: “Edit – Select all” or with the keyboard shortcut: “Ctrl + A”. Then, in the drop-down list next to the “Save” button, select the command: “Save as PDF document":


    Now click on the button itself and save the document with the name “Agreement.pdf” in the “Agreement” folder:


    As a result, we get a multi-page text document in pdf format - an electronic version of our document with the code name “Agreement”.

    So, we digitize text documents using FineReader.

    By changing the scanning mode to “color” in the “ABBYY FineReader Scanning” window, we can also easily digitize color pictures and photographs.

    And by asking in context menu, for example, the command: “Save as Microsoft Word 2007 document” will transform our project into a single multi-page editable Word document.

    In general, the program is easy to understand, intuitive and has pop-up tips everywhere.

    The history of Abbyy FineReader goes back more than 20 years. The company celebrated the anniversary of 2013 with the release of a full-fledged (compared to the Express Edition from 2009) Abbyy FineReader Pro for Mac, and a couple of months later, in February 2014, they also received their “gift” Windows users- Abbyy FineReader 12 Professional and Corporate. Let me remind you that the previous version appeared back in 2011, and two and a half years is a long time - let’s figure out how significant the changes are.

    General information

    System requirements for new version have not changed at all. The platform can be Windows or Windows Server starting from XP and 2003 respectively. Hardware requirements are even more modest these days: a processor of any capacity with a frequency of 1 GHz or more, RAM at least 1 GB plus 512 MB for each computing core, etc. Only the need for disk space- now installation requires not 700, but 850 MB (plus, as before, another 700 MB for working files).

    Naturally, we are talking about minimum requirements; the full capabilities of Abbyy FineReader 12 Professional will be revealed only at relatively modern systems. In particular, let me remind you that the program can effectively parallelize processing individual pages, uses all processor cores and loads any processor almost 100%. But it’s really not greedy when it comes to RAM, and even remains 32-bit.

    The installation procedure has not changed either: a minimum of questions and options. Abbyy FineReader 12 Professional still comes with Abbyy Screenshot Reader, which becomes operational only after user registration.

    After this, you will also have access to technical support.

    Even on the basis of this modest information, we can assume that this is the result of evolution. Accordingly, in what follows I will focus on describing the changes compared to previous version, which can be divided into two main groups: working with the program (interface, auxiliary tools, ease of use) and OCR (quality and performance of the recognition itself).

    Working with the program

    Abbyy FineReader 12 Professional demonstrates some improvements in the user interface. This is immediately noticeable in the Tasks window, which opens by default when the program starts. It obviously imitates the concept of Windows 8.x tiles and is adapted for finger control, especially since the program also supports basic gestures like scrolling and zooming. In fact, the changes affected only the “facade”, and only partly - next to the tiles there are regular controls and in the process of setting up any scenario you will have to deal with standard dialog boxes. Working with them with your fingers is quite problematic, especially on 8-10″ screens, which are becoming popular with Windows tablets.

    It’s really not difficult to imagine that the user of such a tablet equipped with a camera might want to quickly enter some printed document “on the go.” Meanwhile, all Windows history, starting with the first edition of Tablet PC, confirms the pointlessness of adapting a standard desktop interface to touch controls. Apparently, for these purposes it is much more correct to create a special shell that corresponds to all Metro canons, but uses the same “engine”. Example such a decision serves Internet Explorer from Windows 8.x. In addition, Abbyy even has a certain backlog in the form of Abbyy FineReader Touch for Windows 8, which uses cloud service companies.

    If you look away from touch input, there are still changes of this class- from the expected update of document opening/saving windows, which, among other things, provide easy access to cloud storage(if there is a corresponding agent and its folder in the system), to several more important and useful ones.

    Page processing in Abbyy FineReader 12 Professional is now done in the background. This implies the absence of the former modal window with the status of operations (now this role is played by the status line at the bottom of the screen) and, accordingly, the availability of access to the interface. Thus, the user has the opportunity to work with the program in parallel with the recognition process (if it is, of course, long enough), for example, copy fragments of the received text or even adjust the page layout - the latter will be queued and processed again.

    Unlike previous version, also there is no turning of pages during recognition or when bootstrap document if automatic recognition is disabled. In Abbyy FineReader 12 Professional, the document is loaded and divided into pages almost instantly, and their thumbnails are built only as you manually scroll through the left panel. Among other things, this saves computing resources, quite noticeably on large multi-page documents.

    The remaining changes in this class are not so interesting, although they may be useful in some scenarios, so we will talk about them briefly.

    If you do not need to process the entire document, but only quote individual passages, then you can disable all automatic operations and select the necessary fragments of any type, immediately copying them to the clipboard - while analysis and recognition will be performed on the fly.

    To get a result with a simpler structure than the original, you can disable the recreation of headers, footers, and other layout elements. This can be useful, for example, when preparing e-books.

    Continuing about e-books, Abbyy FineReader 12 Professional supports EPUB 2.0.1 and 3.0 formats.

    The conversion options to XLSX have been expanded, for example, it is now possible to clear formatting or save images.

    When saving resulting documents to PDF with a text layer, you can now use new technology Abbyy Precise Scan, which consists of smoothing characters on original page images. By the way, it is available only in color mode.

    The effect of her work is quite noticeable, although not always, let’s say, “academic.” However, the readability of antialiased characters should be higher in any case, and in this example the original is really very low quality.


    OCR

    Now let's see what improvements have occurred in the recognition mechanisms themselves.

    The developers report the next stage in improving ADRT technology, which, let me remind you, analyzes and recreates the logical structure of the document. It is declared that it has begun to work much more accurately, especially with tables, lists, and diagrams. Demonstrating this with adequate examples is not so easy, but not impossible. Here, for example, are the recognition results (with default settings) of the same page in Abbyy FineReader 11 Professional (above) and Abbyy FineReader 12 Professional (below).


    The old version selected and processed only the main text block, perhaps considering the remaining elements as “garbage” due to the low quality of the original. The new one, on the contrary, correctly identified the list and tried to recreate it. The result, however, is not ideal: the fact that not all markers were recognized can, again, be attributed to the quality of the image, but the program, apparently, still did not understand that there was content in front of it, otherwise it would not have interpreted the numbers as letters. However, progress is obvious and such claims might not have been made with higher quality originals.

    And here is how an “implicit” table without dividing lines is processed - Abbyy FineReader 11 Professional (above) and Abbyy FineReader 12 Professional (below).


    It is clearly visible that the old version, unlike the new one, did not see a table structure here at all and was limited to a set of unrelated text blocks. Take the time to click on the images and compare the recognition results - Abbyy FineReader 12 Professional is close to ideal.

    Unfortunately, this does not always happen, and already on the neighboring pages Abbyy FineReader 12 Professional showed results similar to Abbyy FineReader 11 Professional. Although it would be ADRT who should have tracked the identical “caps” and understood that in front of it was a kind of flowing table.

    But it is still clearly noticeable that the updated algorithms pay attention to more details than before. During testing of Abbyy FineReader 12 Professional, for example, there was even an attempt to interpret a picture with an ordered placement on it as a table text information. Much more often, the new version also tries to recreate various diagrams and diagrams based on the background image, rather than from individual graphic and text blocks.

    There are several other new features designed to improve the quality of recognition in Abbyy FineReader 12 Professional. As you know, one of the prerequisites for this is the quality of the original, especially if it was obtained using a camera rather than a scanner. That is why, at one time, FineReader introduced tools for pre-processing originals. In the new version, their list has been expanded, cropping along the edges of pages, lightening and leveling the background brightness, and removing colored elements have been added. The latter can be useful, for example, for processing documents with seals and stamps. In addition, the user can now connect various methods individually.

    Language support has also been improved. Firstly, the Russian alphabet with accents has appeared, and secondly, an increase in the quality of recognition of Chinese, Japanese and Korean (up to 20%), Arabic (up to 60%), and Hebrew (up to 10%) is declared - this has apparently been achieved through improvement and additional training of classifiers.

    And finally, one of the most burning questions for many readers: has the speed of the program increased? It is not so easy to answer this question substantively, especially with numbers - there are too many languages, each of which has its own nuances; the variety of originals is too great; There are too many unknown factors influencing the operation of algorithms. Therefore, even the developers themselves are quite restrained when talking about an increase in the performance of Abbyy FineReader 12 Professional by 10-15%.

    Such figures are usually obtained from the results of processing fairly large arrays of documents and, accordingly, represent something like “ average temperature around the hospital." Therefore, it is useful to study in more detail some illustrative special cases, for example, like the following two:

    • scanned in color with a resolution of 300 dpi 10 pages of a full-color booklet in A4 format. The quality is good, languages ​​are Russian and English, the layout is complex;
    • PDF with graphic images 138 pages of the book containing a small number of color and black and white illustrations, several tables. The quality is low (starting, apparently, with the “blind” printing in the paper book), the languages ​​are Ukrainian and Russian, the layout is simple.

    Both documents were recognized in color mode, and the second one was also recognized in black and white, which was intended to simulate the preparation process e-book. All default settings were left unchanged, with the exception of the set of languages ​​and, accordingly, operating modes. A PC with an i5-3450 processor and 8 GB of memory was used as a testing ground. The results are presented in the following table:

    As you can see, for PDF the speedup even exceeds the promised 15% - perhaps this is just one of the special cases that is well suited for the latest optimizations in recognition algorithms. It should be borne in mind that programs, generally speaking, have done different amounts of work. Just look at the illustrations above for table processing - it’s hard to say which version was more difficult.

    As for the number of errors, it was practically the same for both versions, although it was noticeable that sometimes different fragments and symbols raise doubts - this, apparently, is evidence of the training of the algorithms. In any case, the majority of uncertainly recognized characters were absolutely correctly identified using dictionaries, and “gross” errors (incorrect interpretation of special and decorative symbols, text on graphics, etc.) coincided. So the difference can be considered completely disappearing.

    Another question is, how much does such productivity improvement matter? Apparently, the gain of half a minute on 138 pages that still need to be checked and possibly corrected is not worth much. If work like test tasks is supposed to be performed occasionally, then you definitely don’t have to worry about performance. It's a different matter when it comes to offline processing of large volumes of documents, which is available in Abbyy FineReader 12 Corporate. In this case, saving 15% of time is already quite noticeable.

    Resume

    Despite the fact that the new Abbyy FineReader 12 Professional did not promise anything revolutionary, at least a few changes in it deserve all the praise. First of all, these are improvements to ADRT technology in terms of recognizing tables, diagrams and the logical structure of pages in general, which in some cases allows you to get radically best results, and also background mode processing, which opens up new opportunities for interactive work with large documents.

    There are also many other changes, although they are less significant. Movement towards support touch control today it is certainly justified, but the path chosen is vicious - it is hardly possible to provide equally convenient operation with a mouse and fingers in one interface. However, for now, Windows tablets are just trying to break into the market, and the developers from Abbyy still have time.

    Prices for Abbyy FineReader 12 Professional:

    • boxed version: 4990 RUR;
    • download version: RUB 4,490;
    • update: 2690 rub.

    As usual, the answer to the question “is it worth changing old version to a new one? depends on the situation. In any case, it is worth considering that life cycle FineReader is quite long-lasting, and if any of the described improvements plays any significant role for you, then in 2-3 years the cost of updating will certainly pay off - if not financially, then morally. Solving this question for yourself will finally help.

      In order to use the ABBYY FineReader program, which is designed for text recognition from non-editable and graphic formats. First you need to download it and install it on your computer, and then watch the video below, everything is described in detail about this program.

      This program is designed to scan text and work and recognize it.

      Of course, it can be used, and to carry out this use, you can, without leaving the Finereader program itself, in which you are working, recognize the text of the file and subsequently transform it from a scanned copy of the document into a classic format, Word programs. Then it will turn out to be for your use.

      Finereader is a program for scanning and text recognition with export of information to popular office packages. The principle of working with it can be described in a nutshell as follows: take a sheet of paper with printed text, scan it with a scanner, and get a certain graphic file raster format. Then, without leaving the Finereader program, we recognize the text of the file and the next step is to make a document from the scanned copy Word format. Before this, the recognized text can be viewed and edited. The resulting Word document can be further supplemented and edited.

      The Abbyyfinereader program is undoubtedly the leader among similar programs.

      It has very broad capabilities for recognizing text from non-editable and graphic formats.

      The program will be able to recognize text from such basic formats as (non-editable pdf, digital formats jpeg files, jpg, Djvu, gif, png, etc.).

      Also, ABBYY FineReader works well with almost all scanner models.

      The main functions of the program are:

      Scan documents to formats: Microsoft Word, Microsoft Excel, Pdf, scan and save images, PDF or image to Microsoft Word, convert photos to Microsoft Word.

      ABBYY Finereader work area:

      To add a new task, you must click on the **new task** button, which is located in the upper left part of the program work area.

      Will open window new task

      In the window that opens, you need to select the task you want to perform.

      Let's say we have a photo of a document that we want to convert to Microsoft document Word. To do this in the window new task find the active inscription Convert photo to Microsoft Word and click on this inscription. Will open program explorer window with preview :

      In the window that opens, select a photo text file which needs to be recognized and converted into the format you need.

      Will open window with recognition process scale:

      After the program processes the photo and tries to recognize the text.

      You will see the following:

      Here you can select the area of ​​your photo for text recognition.

      After selecting the area, click the button recognize which is located in top menu programs. The program will begin converting the selected photo into text. After processing the image, click on the arrow next to the button save and select required format to create a text document:

      Powerful and functional program ABBYY FineReader, is designed for high-quality scanning and accurate recognition (this depends on the resolution set during scanning) of various paper media with printed text (books, magazines, newspapers, etc.), as well as digital images.

      The program supports various languages recognition, can save in: Microsoft Word, PDF, image formats and other formats. Since the program has an intuitive interface, it is convenient to work with it.

      So, the first thing you need to do is set the settings and scan document, we get an image whose text follows the program recognize. After recognition, you can correct the text (if there are any inaccuracies) and save it in the desired format.