Ambercite's Ben Palmer, our Director of Software Development, presented on 'Big Data" to the Churchill Club this week (a group focused on IT and startups). Ben was standing in for our Managing Director Doris Spielthenner, who was unavoidably detained elsewhere working on another big data project for the Australian government.
Ben was one of three excellent speakers. Ian Holsman, consultant & Former Chair of the Apache Hadoop project, spoke about the work he had done on data mining for a large consumer news website. This at one stage included real time feedback from the newspieces they were posting on the website, allowing the website to fine tune articles of interest for their audience.
Peter Buckingham from Spectrum Analysis Australia spoke the work he did in predicitng the best location for new franchise outlets, based on sophisticated data analysis. He also discussed a recent court case in Australia, where a franchisor was successfully sued by a franchisee over misleading representation made by the former over likely sales for a new franchise. When asked on what basis these predictions were made, the franchisor admitted to 'scientific guessing'. The judge was not impressed.
Ben spoke about some earlier work he and his team had done for the Australian government on predicting air traffic movement all over Australia, and in particularly deviations from expected movements, in real time, i.e. while these planes were in the air. Ben also previewed recent developments in our patent searching software.
Despite the variation in areas of interest, some common themes emerged from all three speakers and the followup discussion:
- The real value in big data is not the data itself, but what we do with it. Visual presentation of the resulting analysis can help end users make the most of this value.
- Data quality, even from reported reliable sources, can be very patchy. There will be lots of hole. This in turn creates opportunities to identify this missing data, which can be very important.
- Timely analysis can be very important
All of these concepts have been taken on board in our development program:
- We have been told that our visualisations are the best in the business, but we aim to go beyond anything we have done before.
- Like others, we also need to clean up the data we are using. We are tackling the problem of missing data in ways that we think will surprise and delight many people. Certainly our internal panel of patent attorney testers are more than excited by what we are showing them, and the additional information being made available when compared to other patent data sources.
- And while we don't necessarily need to process patent data faster than planes can fly, we understand that clients want results in real time when use patent analysis software, and that new patent data is published every day that our clients will want to know about.
Interested in knowing more? Come back to us, and we will keep you fully informed about future product releases.