[Home] [By Thread] [By Date] [Recent Entries]

  • From: "Liam R. E. Quin" <liam@f...>
  • To: bryan rasmussen <rasmussen.bryan@g...>, xml-dev<xml-dev@l...>
  • Date: Thu, 25 Jun 2020 00:38:05 -0400

On Wed, 2020-06-24 at 10:52 +0200, bryan rasmussen wrote:
> I'm looking for either a tool that when given a complete html
> document
> or a fragment thereof will generate an rdf description of the tree.
> If not a tool documents/studies showing how to do it.

I seem  to remember Henry Thompson (or Richart Tobin) at Edinburgh
wrote a tool to convert from a post-schema-validation "infoset" into
rdf. The results are spectacularly impractical - for example, as i
recall, each c h a r a c t e r becomes a separate information item and
hence is the subject of a triple describing its location, considerably
complicating string search. A quick search found an XML infoset to RDF
script in Python.

There isn't a single tree.

Maybe you want an RDF representation of the HTML 5 DOM that would be
built?

For example, should namespaces be normalised as per HTML 5, and should
missing "tbody: elements be inserted?

What is your goal?

Liam


-- 
Liam Quin, https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations:  http://www.fromoldbooks.org



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member