|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: What is the general direction you are seeing these daysto
On Sun, Mar 8, 2015 at 3:00 PM, Ihe Onwuka <ihe.onwuka@gmail.com> wrote:
Actually I can concretize that. It describes most of the databases behind most public movie API's, e.g http://omdbapi.com/. Typically such databases consist of pieces of information cobbled together from various places usually they run on a LAMPish/Java platform. There are research papers detailing the challenges encountered in such an endeavour Matching of MovieLens and ImDb titles. Extraction and integration of MovieLens and ImDb data I would expect similar problems (matching to support correct aggregation, deduplication, disambiguation) to manifest in many Big Data projects simply because data integration has the potential to add so much value. A "Small Data" project may morph into a Big Data one once you have assembled information from enough sources/silos. I've been allowed to approach this problem with NXD technology. Schemalessness - being able to ingest data from anywhere in any format - and the ability to defer decisions on data management due to a superior extract and transform capability, has been key to dodging many of the problems that the MovieLens group experienced. It has also allowed the focus to shift to more ambitious higher value problems, although challenges remain.
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] |
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








