If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.
You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

FrontPage

This version was saved 15 years, 11 months ago View current version Page history

Saved by Dan Zambonini
on May 22, 2008 at 5:20:22 pm

hoard.it

hoard.it is a prototype system for scraping granular, semantic data from existing (template-driven) HTML pages. To date, the prototype has been used to aggregate museum object data and 'museum listings' data, but it can be used for any other type of data.

The prototype can be found here.

Please check out the Frequently Asked Questions first. Any questions after that? Contact Us.

Latest Updates

2008-05-22 Added 'gallery' output format, for quick overview

(e.g. http://feeds.boxuk.com/museums/xmlfeed/keyword/parachute/format/gallery)

2008-05-22 Added limit (temporarily) of 1000 records to be returned by API, to prevent anyone bringing down the server...

2008-05-22 Added 'keyword' parameter to feed; searches ALL fields

(e.g. http://feeds.boxuk.com/museums/xmlfeed/record.type/object/keyword/contraceptive/format/html)

2008-05-22 Added local caching of thumbnails to improve performance

2008-05-21 Started crawling some National Portrait Gallery objects (about 2,000 to date)

Comments (0)

You don't have permission to comment on this page.

To join this workspace, request access.

Already have an account? Log in!

Loading…

This is your Sidebar, which you can edit like any other wiki page.

This Sidebar appears everywhere on your wiki. Add to it whatever you like -- a navigation section, a link to your favorite web sites, or anything else.

Loading…

FrontPage

hoard.it

Latest Updates

FrontPage

Page Tools

Insert links

Comments (0)

Join this workspace

Navigator

SideBar

Recent Activity