I was recently part of a team of volunteer data scientists from DataKind UK who helped the NGO Global Witness to analyse the UK corporate ownership register. The results have been published in a variety of places.
Blog: How we mined the world’s first open data register of company control | Global Witness
Full report: The Companies We Keep | Global Witness
pyData London talk: Searching for Shady Patterns: Shining a light on UK corporate ownership - Adam Hill - YouTube
Hey Adam - thanks for sharing. I'm a big fan of your work on this!
I'd love to chat with you sometime about the deduplication problem you mention, as this comes up a lot.
I played around with this dataset at NICAR last year (as well as property ownership): https://www.ire.org/events-and-training/event/3189/3680/ Leila Haddou from the Times is looking at similar data using Neo4j as well.
Thanks for submitting!
I’ve added a tag that allows your blog to be displayed on the community home page!
Great work. Here in Belgium we have http://openthebox.be leading a similar effort. @William_Lyon we really should start looking into doing this at a European level.
Great work. In Italy, we have built some solutions for this kind of problems (using Neo4j since 2012).
You can find more on