No
|
# of pdf Pages
|
Original Document (PDF)
|
Logical Structure Extracted (XML)
|
Generated Schema (XSD)
|
Transforming (XSLT)
|
DocBody without Headings (TXT)
|
Unique Tokens in Doc Body
|
Headings without Numbers
|
Unique Tokens in Headings
|
Heading Numbers Used in Linking
|
Headings Used in Cross Referencing
|
# of html Pages
|
Final Interface (HTML)
|
HTokens' Position among DTokens
|