Skip to main content

Home/ Diigo Community/ Group items tagged html

Rss Feed Group items tagged

Graham Perrin

Mediactive » Two Tools for Archiving Web Pages - 15 views

  • Tools for Archiving Web Pages
  • Diigo
  • ...13 more annotations...
  • Internet research
  • many interesting things
  • collaborators
  • snapshot
  • What Diigo calls a screenshot
  • October 7th, 2009
  • the user can still search and highlight the text
    • Graham Perrin
       
      Maybe some confusion?

      A single snapshot comprises an HTML file and a PNG (screen shot), both cached.

      The snapshot page defaults to an HTML version.

      Texts within HTML cached copies are indexed, and can be searched; and cached HTML content is used for the underlying part of an annotated view, but:

      * it is not (or should not be) possible to draw new Diigo highlights across cached copies.
  • is a cached version
  • PositivePress
  • priced high for those only interested in page archival
  • impressive
  • a one-button approach
  • searchable, cached
Graham Perrin

ODF versus OOXML: Don't forget about HTML! - O'Reilly XML Blog - 0 views

  • Don't forget about HTML
  • February 25, 2007
  • HTML’s potential and actual suitability for much document interchange
  • ...27 more annotations...
  • HTML is the format to consider first
  • validated, standards compliant XHTML in particular
  • HTML at one end (simple WP documents)
  • PDF at the other end (full page fidility but read-only)
  • W3C versus ISO

  • HTML, ODF, OOXML, PDF
  • Lie adopts an extreme view towards overlap of standards:
  • overlap at all brings nothing but misery and bloat.
  • The next dodgy detail is to make blanket comparisons between HTML and ODF/OOXML.
  • ODF and OOXML deal with many issues that HTML/CSS simply does not.
  • the W3C argument might be to say that every part should have a URL
  • a strange theory that MS wants ODF and OOXML to both fail
  • being pro-ODF does not mean you have have to be anti-OOXML
  • HTML is the format of choice for interchange of simple documents
  • ODF will evolve to be the format of choice for more complicated documents
  • OOXML is the format of choice for full-fidelity dumps from MS Office
  • PDF is the format of choice for non-editable page-faithful documents
  • all have overlap
  • we need to to encourage a rich library of standard technologies,
  • widely deployed,
  • free,
  • unencumbered,
  • explicit,
  • awareness of when each is appropriate
  • an adequate set of profiles and profile validators
  • using ISO Schematron
  • Plurality
  •  
    Relevance to Diigo Community: of the four formats (HTML, ODF, OOXML, PDF) mentioned in this 2007 post, HTML is clearly most suitable for services/software such as Diigo.
1 - 2 of 2
Showing 20 items per page