Skip to main content

Home/ Open Web/ Group items tagged NOOXML

Rss Feed Group items tagged

Gary Edwards

Bricolage Structured Prediction Algorithm - 0 views

  •  
    I was surprised to learn that Florian's native document parser is a JSON like ripper of OpenXML visual objects.  He doesn't wrestle with structured objects, but simply treats everything as a visual object.  NOOXML might be closer to a virtual print driver than a OpenXML ripper.   So this has me rethinking the OCR/Scan methods used to rip paper documents to create Tagged PDF "structured object" versions.  Structured objects can easily be converted to interactive HTML-CSS or SVG.  Today Google released an OCR enhanced Android gDOCS app.  Not sure if it uses the Bricolage/Bento algorithm, but that would be an interesting approach. excerpt: the Bricolage algorithm for transferring design and content between Web pages. Bricolage employs a novel, structured-prediction technique that learns to create coherent mappings between pages by training on human-generated exemplars. The produced mappings are then used to automatically transfer the content from one page into the style and layout of another. We show that Bricolage can learn to accurately reproduce human page mappings, and that it provides a general, efficient, and automatic technique for retargeting content between a variety of real Web pages.
Gary Edwards

Mars:FAQ - Adobe Labs - 0 views

    • Gary Edwards
       
      Sounds like docubase "layers" to me.
  • auxiliary content
  • document assembly and disassembly b
  • ...5 more annotations...
    • Gary Edwards
       
      The Acrobat 8 Reader can read Tagged PDF, MARS and Flash.  Flash uses SWF-FLA, a proprietary version of SVG.  Funny they would use SVG (with namespace customization) for MARS.
  • Anyone over the age of 18, or minors with parental permission, can
  • ocument.
  • create a Mars d
    • Gary Edwards
       
      Wow, anyone can create a MARS document.  Even OpenOffice?  How about Florian's NOOXML Trellis?
1 - 2 of 2
Showing 20 items per page