Exploded vector text to OCR text


See Also: OCR settings


Whilst classically OCR recognition is normally performed on raster images, Scan2CADv8.3a and later has introduced a new ‘exploded vector text’ to OCR ‘ASCII text ‘option.


What is exploded text.

Exploded text is text made up from a connected combination of either line/bezier/polygon vectors and typically for large/emboldened text a series of connected solid polygon vectors.

Examples of exploded text:-

· Text made from connected line and polyline vectors. (for clarity the vector end points are displayed with small squares)

· Bold text made from a series of connected solid polygons. (polygons in gray color in this example and for clarity the polygon edges are highlighted in black)

The above types of text representation are typically found in PDF documents and DXF files that have been produced from CAD packages.

Where a user wishes to change the contents of this type of exploded text they would need to delete all vectors used to represent the text and completely retype the new contents.


Exploded vector text to OCR text.

This feature is provided via 3 Scan2CAD OCR Menu options used in the following way.

· 1. First set the Character Size for the required OCR identification using OCR Menu > Char Size > Vector.

You will now be prompted to draw a rectangle closely around a group of vectors representing a single character as illustrated below with the dotted line surrounding the G character. Where the choice allows, select an upper-case character, preferably one of the larger characters such as W or M.


· 2. Now window around the required exploded vector text string(s) for conversion.



· 3. Now initiate the OCR conversion using OCR Menu > OCR Run > Vector or press the F5 function key.

The exploded text will now be converted to editable text as displayed below in the Scan2CAD default mauve text color. The original exploded text will have been removed.

· 4. If the OCR conversion was sufficiently inaccurate or for some other reason you wish to return to the original exploded text then use OCR Menu > UnDo OCR > Vector or press the Ctrl+F5 function key. The OCR text will be removed and the original exploded vector text returned.
· Other Considerations
· Whilst exploded text to OCR conversion can be done on the whole vector image, it is advisable to select only those exploded text strings that require conversion for updating as it is possible to unwittingly convert other drawing elements to OCR text thus affecting the drawing integrity.
· When moving from a block of exploded text to another of a significantly different text height then reset the character size as described in 1 above before proceeding with 2 and 3. Failure to do this will affect OCR recognition accuracy.
· Where text is not in a horizontal plane then set the appropriate Character Rotation in the OCR Settings dialog.
· If a string contains words of differing colors then before using OCR Menu > Run > Vector, select OCR Menu > Split Strings on Space
· The OCR Menu > UnDo OCR > Vector applies to the last actioned OCR Menu > Run > Vector only. If the last OCR Menu > Run > Vector produced more than 512 strings then only the last 512 strings of that Run will be UnDone.
· If a series of widely separated exploded vector text strings have been encapsulated as a single poly-polygon exploded text object then it may be necessary to manually delete the poly-polygon object after the OCR conversion.
· All other OCR Menu options as used for raster OCR apply equally to vector OCR.

Have questions on this topic? Talk to us