Content Reprocessing in Bulk

At last week’s 2007 Mark Logic User Conference, I talked about bulk reprocessing of XML content. The problem is simple: you have 100 GB to 100 TB of XML, and you need to make a small change to each and every document. The problem is simple, but the solution is not.

As part of the talk, I demonstrated and released a tool for this, called Corb (or “CoRB”, if you prefer). Hopefully this will save someone else from re-inventing the wheel.

Oh, and I talked about scalability, too. How many TB of XML would you like?

Leave a Reply

You must be logged in to post a comment.