It's the sources, stupid
Read moreClarifiedBy... Data Quality
The Economist famously stated in 2017 that data was then more valuable than oil. Whether or not you were initially – or have been subsequently – convinced by this provocative claim, there is widespread acceptance that data is a highly valuable resource, particularly for businesses. Data has become a key driver for business decision-making at the strategic, operational and tactical levels.¹
However, it is refreshing that even since publication of the Economist article in 2017, the conversation has moved beyond ‘big data’ as an end in itself. It is of course relatively easy for companies to accumulate huge amounts of data, but if they then have to employ vast teams to clean, analyse and extract insights, the value of the exercise may be questionable. Instead, competitive edge can be achieved by those companies that draw on sophisticated analytics and high-quality datasets, which allows insights and conclusions to be drawn more readily.
Data quality has always been central to Diligencia’s mission – not just in the authenticity of the data and how it is sourced (from official sources only) but also in the way we then structure and curate our information. While one approach is to scrape and compile data from multiple sources, with varying degrees of freshness and accuracy, and then allow users to draw their own conclusions, we believe that clearly sourced, reliable data that is consistent, clean and connected, is ultimately more valuable to our clients.
For example, we have around 40 tests built into our platform, which all company and individual profiles must pass before being published on ClarifiedBy. These rules have been designed to ensure:
-
- Completeness: we establish that key fields such as directors, shareholders, and company identifiers are fully populated before publishing each profile
-
- Integrity: ranging from the simple (e.g. shareholdings cannot exceed 100%) through to the more technical (e.g. sole proprietorships cannot have more than one shareholder), a number of these tests ensure our information satisfies the demands of our discerning clients
-
- De-duplication: not always easy given the challenges of Arabic to English transliteration and translation, but we dream of a world without false positives, and do our utmost to ensure that companies and individuals are not recorded twice in our database. This is also the key to producing our network diagrams
As our database continues to expand, data quality becomes ever-more important for us at Diligencia – particularly as we look to build tools and additional datasets that bring the relevant information and insights to the surface. To extend the oil analogy, why accept crude when you can have the refined product?
We post blogs like this one on the Diligencia website and on our platform ClarifiedBy.com. Through collecting and analysing vast quantities of data, we have a unique perspective on a range of topics, from due diligence best practice to doing business in the MEA region – a perspective we feel compelled to share.
To find out how to become a member of our platform, or if you have due diligence questions associated with the Africa and the Middle East then please send us an email.