Tracking and Understanding Data

Hackathon problem statement
Tracking Data
Data that can assist in tracking corruption is often not clean because of mistakes made when entered manually, low quality of an original source, or scanning non-digital documents. This requires an online platform that can import generic data sets to be editable, tagged, and also allow for tracking of all change history to preserve integrity.
default hackathon image

There is no online platform available to solve this problem -given there are hundreds of thousands of records- that easily allows users to import a large data set, and then share permissions with a large group of trusted volunteers to help clean up that data. Integrity is important, so changes and source of changes (the author[s]) needs to be recorded and tracked for every field and row. Finally, it also needs a tagging facility so that users can easily tag rows for various uses such as categorization issues, either by data, or type of information to be extracted.

We need it to be user friendly enough, so that normal users can import the data (often information from spreadsheets or database table dumps) and manage communities of users who clean up this information. They should also be able to easily export this cleaned up data again for others to use. A solution to this problem can help the government clean up data to release it to the public. Citizens can also use it to clean and organize data to improve transparency or research corruption issues.

Additional information:

Some of type of information we can extract from an unclean database to detect corruption:

If we can clean up this data to be more accurate while preserving integrity by logging all records of changes  in terms of classifications of government departments,misspellings and tagging eg. names of politicians, government vs private sector projects etc. we will be able to extract and map out corruption and abuse of power for Malaysian construction projects.

Tags:

Stay informed

additional hackathon problem statements

Article image

Electoral advertising

- While Colombian citizens have access to information on election campaign finances reported via the 'Cuentas Claras' website, they can't compare real spending with official ...

Article image

On Corruption Verdicts

- This index is designed to answer questions of how the public views corruption verdicts. It is a tool to measure the public perception of corruption verdict fairness nationally, as ...

Article image

Youth Community

- Based on a 2012 study by TI Hungary young people (between 15-29) don't feel they have the tools, equipment or knowledge they can use against corruption. Given corruption is a ...