Paper with an order of magnitude speed increase for parsing JSON!

From: Rick Jelliffe <rjelliffe@allette.com.au>
To: xml-dev <xml-dev@lists.xml.org>
Date: Wed, 2 Jan 2019 22:00:17 +1100

Play the video

Happy New Year everyone!

Readers of this list may be amused by the paper at https://blog.acolyer.org/2018/08/20/filter-before-you-parse-faster-analytics-on-raw-data-with-sparser/ which promises to "deliver an order-of-magnitude speed-up" on JSON parsing! Gosh! Now JSON parsing speed is such a problem!

Except, of course, the paper does no such thing. It filters out uninteresting files, so that they don't need to be parsed in the first place. (It gives a pre- filter that uses SIMD parallel n-grams (2, 4, 8) similar to Bloom filters with various neat twiddles so that JSON documents that don't include some n-grams can be rapidly excluded from parsing. ) It does not speed up parsing at all, it just excludes more documents from parsing. (Isn't it bait-and-switch when you promise something but it turns out to be something diifferent?)

Anyway, of course, the technique is general and can be equally applied to (canonicalized or standalone) XML documents. But I wonder whether this adds some light to the problem of XML parsing speed, for situations where you are looking through lots of records: has the old answer of preprocessing files through grep (etc) to find candidates now respectable again?

Rick

Follow-Ups:
- Re: Paper with an order of magnitude speed increase forparsing JSON!
  - From: "Liam R. E. Quin" <liam@fromoldbooks.org>

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Subscribe in XML format

RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >