pdf-toolbox-content-0.1.1: A collection of tools for processing PDF files
Safe HaskellSafe-Inferred
LanguageHaskell2010

Pdf.Content.UnicodeCMap

Description

Unicode CMap defines mapping from glyphs to text

Synopsis

Documentation

data UnicodeCMap Source #

Unicode character map

Font dictionary can contain "ToUnicode" key -- reference to a stream with unicode CMap

Instances

Instances details
Show UnicodeCMap Source # 
Instance details

Defined in Pdf.Content.UnicodeCMap

parseUnicodeCMap :: ByteString -> Either String UnicodeCMap Source #

Parse content of unicode CMap

unicodeCMapNextGlyph :: UnicodeCMap -> ByteString -> Maybe (Int, ByteString) Source #

Take the next glyph code from string, also returns the rest of the string

unicodeCMapDecodeGlyph :: UnicodeCMap -> Int -> Maybe Text Source #

Convert glyph to text

Note: one glyph can represent more then one char, e.g. for ligatures