Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals
<p>An integrated framework to characterize data ecosystems.</p> "> Figure 2
<p>Political economy analysis of water supply in Malawi. The dimensions of the data ecosystem are shown in bold. There is a large overlap between the actors involved in the WASH sector and those that have data, but some alternative data providers are outside the group of actors directly involved and these are not depicted. CSO Civil Society Organization, MoAIWD Ministry of Agriculture, Irrigation and Water Development; DoIWD Department of Irrigation and Water Development. The diagram is developed by the authors and builds on insights from [<a href="#B41-ijgi-07-00456" class="html-bibr">41</a>].</p> "> Figure 3
<p>Duplicates in the dataset of DoIWD (left) and duplicates when comparing dataset DoIWD (red bullet) and Madzi Alipo (green bullet). Source: drone imagery Madzi Alipo.</p> "> Figure 4
<p>Spatial coverage of the data sets of the main actors in the WASH data ecosystem. NSO is not included as the data was only available at national level.</p> "> Figure 5
<p>Spatial coverage of attribute information available in all datasets combined.</p> "> Figure 6
<p>(<b>a</b>) Water points in Malawi extracted from the Madzi Alipo, WPDx, OSM, DoIWD, DoS, MCRS, 510, PCI and CJF data sources; (<b>b</b>) Water points in relation to population density in Southern Malawi (source: MASDAP, 2014).</p> ">
Abstract
:1. Introduction
Background Data for Sustainable Development Goals
2. Literature Review
An Integrated Data Ecosystem Framework
- Guardian is responsible for facilitating distribution of datasets and information products (in emergencies for example).
- Sponsor is responsible for identifying and liaising with relevant sources to analyse, collate, clean and achieve consensus around a specific dataset or information product.
- Source: Designated source or owner of a dataset, fully responsible for the development, maintenance and metadata associated with a dataset and control distribution restrictions.
3. Materials and Methods
3.1. Case Study Selection
3.2. Data Collection
4. Results
4.1. Political Economy Analysis of Water Supply Policies and Programming in Malawi
4.2. Actors and Roles
4.3. Data Supply
4.3.1. Quality
4.3.2. Cost of Data Extraction
4.4. Data Infrastructure
4.5. Data Demand
4.6. Data Ecosystem Governance
5. Discussion
6. Conclusions
- To lessen the overall fragmentation, we recommend that an NSO takes on the coordinating role of characterizing the data ecosystem on a continuous basis as actors will come and go and data supply and demand will fluctuate;
- To increase data adoption and awareness, we recommend that efforts are taken to eliminate the duplication of data across multiple platforms and to increase the quality and usefulness of the data by supplying more metadata;
- To stimulate the growth of data supply in the data ecosystem, we recommend that mechanisms are put in place (1) to empower multilateral donors to enforce the opening of data collected during projects and (2) to incentivize data sharing among stakeholders by offering value in return;
- To support the development and evolution of the data ecosystem, we recommend fostering data expertise and capabilities among local actors, as opposed to international actors, to obtain and integrate diverse data sources for SDG monitoring;
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Hancioglu, A. The Multiple Indicator Cluster Surveys, the Role of Household Surveys in Support of the Sustainable Development Agenda. 2016. Available online: https://unstats.un.org/sdgs/files/meetings/iaeg-sdgs-meeting-04/UNICEF_The MICS Program_Household Surveys Side Event_4th IAEG-SDG.pdf (accessed on 1 November 2018).
- Kitchin, R.; Lauriault, T.P. Small data in the era of big data. GeoJournal 2015, 80, 463–475. [Google Scholar] [CrossRef]
- UN Inter-Agency Expert Group on the Data Revolution for Sustainable Development. A World that Counts—Mobilising the Data Revolution for Sustainable Development; UN Inter-Agency Expert Group on the Data Revolution for Sustainable Development: New York, NY, USA, 2014. Available online: http://www.undatarevolution.org/wp-content/uploads/2014/12/A-World-That-Counts2.pdf (accessed on 1 November 2018).
- GRID3. 2018. Available online: https://unstats.un.org/unsd/statcom/49th-session/side-events/20180307-1M-GRID3-Launch-Concept%20Note.pdf (accessed on 1 November 2018).
- Thinyane, M.; Goldkind, L.; Lam, H.I. Data Collaboration and Participation for Sustainable Development Goals—A Case for Engaging Community-Based Organizations. J. Hum. Rights Soc. Work 2018, 3, 44–51. [Google Scholar] [CrossRef]
- Thinyane, M. Small data and sustainable development—Individuals at the center of data-driven societies. In Proceedings of the IEEE ITU Kaleidoscope: Challenges for a Data-Driven Society (ITU K), Nanjing, China, 27–29 November 2017; pp. 1–8. [Google Scholar]
- Clark, C.; Maron, M.; Patel, D.; Radford, T.; Soden, R.; Uithol, P. Open Mapping for the SDGs: A Practical Guide to Launching and Growing Open Mapping Initiatives at the National and Local Levels. 2016. Available online: https://opendri.org/resource/open-mapping-for-the-sdgs-a-practical-guide/ (accessed on 1 November 2018).
- Liu, W.; Dugar, S.; McCallum, I.; Thapa, G.; See, L.; Khadka, P.; Shakya, P. Integrated Participatory and Collaborative Risk Mapping for Enhancing Disaster Resilience. ISPRS Int. J. GeoInf. 2018, 7, 68. [Google Scholar] [CrossRef]
- See, L.; Mooney, P.; Foody, G.; Bastin, L.; Comber, A.; Estima, J.; Rutzinger, M. Crowdsourcing, Citizen Science or Volunteered Geographic Information? The Current State of Crowdsourced Geographic Information. ISPRS Int. J. GeoInf. 2016, 5, 55. [Google Scholar] [CrossRef]
- Mooney, P.; Winstanley, A. Is VGI Big Data? In Proceedings of GISRUK 2015, Leeds, UK, 15–17 April 2015; pp. 448–453. [Google Scholar]
- Maaroof, A. Big Data and the 2030 Agenda for Sustainable Development. Available online: http://www.unescap.org/sites/default/files/Final Draft_ stock-taking report_For Comment_301115.pdf (accessed on 1 November 2018).
- Moorosi, N.; Thinyane, M.; Marivate, V. A Critical and Systemic Consideration of Data for Sustainable Development in Africa. In International Conference on Social Implications of Computers in Developing Countries; Springer: Cham, Switzerland, 2017; pp. 232–241. [Google Scholar]
- Parsons, M.A.; Godøy, Ø.; Ledrew, E.; De Bruin, T.F.; Danis, B.; Tomlinson, S.; Carlson, D. A conceptual framework for managing very diverse data for complex, interdisciplinary science. J. Inf. Sci. 2011, 37, 555–569. [Google Scholar] [CrossRef] [Green Version]
- Susha, I.; Janssen, M.; Verhulst, S. Data Collaboratives as a New Frontier of Cross-Sector Partnerships in the Age of Open Data: Taxonomy Development. In Proceedings of the 50th Hawaii International Conference on System Sciences, Hilton Waikoloa Village, HI, USA, 4–7 January 2017; pp. 2691–2700. [Google Scholar]
- Nardi, B.; O’Day, V.L. Information Ecologies: Using Technology with Heart; MIT Press: Cambridge, MA, USA, 1999. [Google Scholar]
- Dawes, S.S.; Vidiasovab, L.; Parkhimovichc, O. Planning and designing open government data programs: An ecosystem approach. Gov. Inf. Q. 2016, 33, 15–27. [Google Scholar] [CrossRef]
- Heimstädt, M.; Saunderson, F.; Heath, T. From toddler to teen: Growth of an open data ecosystem. JeDEM-eJ. eDemocr. Open Gov. 2014, 6, 123–135. [Google Scholar] [CrossRef]
- Harrison, T.M.; Pardo, T.A.; Cook, M. Creating open government ecosystems: A research and development agenda. Future Internet 2002, 4, 900–928. [Google Scholar] [CrossRef]
- Zuiderwijk, A.; Janssen, M.; Davis, C. Innovation with open data: Essential elements of open data ecosystems. Inf. Polity 2014, 19, 17–33. [Google Scholar] [CrossRef]
- Oliveira, M.I.S.; Lóscio, B.F. What Is a Data Ecosystem? In Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age (dg.o ’18), Delft, The Netherlands, 30 May–1 June 2018. Article No. 74. [Google Scholar]
- Thinyane, M.; Marcovecchio, I. Small Data for Sustainable Development Goals. 2015. Available online: http://cs.unu.edu/wp-content/uploads/ProjectBriefing_SDGsSD.pdf (accessed on 1 November 2018).
- UN Inter-Agency and Expert Group on SDG Indicators. Guiding Principles of Data Reporting and Data Sharing for the Global Monitoring of the 2030 Agenda for Sustainable Development. 2017. Available online: https://unstats.un.org/sdgs/files/meetings/iaeg-sdgs-meeting-06/2017-10-04_CCSA Guiding Principles data flows.pdf (accessed on 1 November 2018).
- Klievink, B.; Neuroni, A.; Fraefel, M.; Zuiderwijk, A. Digital Strategies in Action: A Comparative Analysis of National Data Infrastructure Development. In Proceedings of the 18th Annual International Conference on Digital Government Research, Staten Island, NY, USA, 7–9 June 2017; pp. 129–138. [Google Scholar]
- Makanga, P.; Smit, J. A review of the status of Spatial Data Infrastructure Implementation in Africa. S. Afr. Comput. J. 2010, 45, 18–25. [Google Scholar] [CrossRef]
- United Nations Development Program (UNDP). Data Ecosystems for Sustainable Development: An Assessment of Six Pilot Countries; 2017. Available online: http://www.undp.org/content/dam/undp/library/Sustainable%20Development/Data%20Ecosystems%20for%20Sustainable%20Development.pdf (accessed on 12 June 2018).
- Development Initiatives. Nepal’s Emergent Data Revolution. 2017. Available online: http://devinit.org/wp-content/uploads/2017/04/Nepals-emerging-data-revolution.pdf (accessed on 12 June 2018).
- Data Pop Alliance. Opportunities and Requirements for Leveraging Big Data for Official Statistics and the Sustainable Development Goals in Latin America, 2016. Available online: http://datapopalliance.org/wp-content/uploads/2016/05/Data-Pop-Alliance-LAC-NSO-EN.pdf (accessed on 1 November 2018).
- Feiring, B.; Thornberry, F.; Hassler, A. Building a pluralistic ecosystem of data to leave no one behind: A human rights perspective on the challenge of monitoring the Sustainable Development Goals. Stat. J. IAOS 2017, 33, 919–942. [Google Scholar] [CrossRef]
- Haak, E.; Ubacht, J.; Van den Homberg, M.; Cunningham, S.; Van den Walle, B. A framework for strengthening data ecosystems to serve humanitarian purposes. In Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age, Delft, The Netherlands, 30 May–1 June 2018; p. 85. [Google Scholar]
- Van den Homberg, M.; Monné;, R.; Spruit, M. Bridging the information gap of disaster responders by optimizing data selection using cost and quality. Comput. Geosci. 2018, 120, 60–72. [Google Scholar] [CrossRef]
- Inter-agency Standing Committee (IASC) Guidelines. Common Operational Datasets (CODs) in Disaster Preparedness and Response. 2010. Available online: http://fscluster.org/sites/default/files/documents/IASC_Guidelines_ on_Common_Operational_Datasets_in_Disaster_Preparedness_and_Response_1_Nov._ 2010%5B1%5D.pdf (accessed on 15 May 2018).
- US Intelligence Best Practices. Available online: http://wikivisually.com/wiki/Intelligence_collection_ management#Collection_department_ratings (accessed on 31 May 2017).
- United Nations Office for Disaster Risk Reduction (UNISDR). Technical Guidance for Monitoring and Reporting on Progress in Achieving the Global Targets of the Sendai Framework for Disaster Risk Reduction Collection of Technical Notes on Data and Methodology; UNISDR: Geneva, Switzerland, 2018; Available online: https://www.unisdr.org/files/54970_techguidancefdigitalhr.pdf (accessed on 1 November 2018).
- Zuiderwijk, A.; Janssen, M.; Dwivedi, Y.K. Acceptance and use predictors of open data technologies: Drawing upon the unified theory of acceptance and use of technology. Gov. Inf. Q. 2015, 32, 429–440. [Google Scholar] [CrossRef]
- Kitchin, R. The Data Revolution: Big Data, Open Data, Data Infrastructures and Their Consequences; Sage Publications Ltd.: London, UK, 2014. [Google Scholar]
- Steudler, D.; Rajabifard, A.; Williamson, I. Evaluation and performance indicators to assess spatial data infrastructure initiatives. In A Multi-View Framework to Assess SDIs; Crompvoets, J., Rajabifard, A., van Loenen, B., Fernández, T.D., Eds.; Space for Geo-Information, Wageningen University: Wageningen, The Netherlands and Centre for SDIs and Land Administration, Department of Geomatics, The University of Melbourne: Melbourne, VIC, Australia 2008; p. 193. [Google Scholar]
- Secretariat Consultative Committee for Space Data Systems (CCSDS). Reference Model for an Open Archival Information System (OAIS); CCSDS: Washington DC, USA, 2002. [Google Scholar]
- United Nations Statistics Division (UNSD). Work Plans for Tier III Indicators. 2017. Available online: https://unstats.un.org/sdgs/files/meetings/iaeg-sdgs-meeting-05/TierIII_Work_Plans_03_03_2017.pdf (accessed on 1 November 2018).
- National Statistics Office (NSO). Country Report on Support for Statistics Malawi; Paris 21; NSO: Zomba, Malawi, June 2015. [Google Scholar]
- United Nations Statistics Division (UNSD). Metadata Indicator SDG 6.1.1. 2018. Available online: https://unstats.un.org/sdgs/metadata/files/Metadata-06-01-01.pdf (accessed on 1 November 2018).
- Chowns, E.E. The Political Economy of Community Management: A Study of Factors Influencing Sustainability in Malawi’s Rural Water Supply Sector. Master’s Thesis, University of Birmingham, Birmingham, UK, 2014. [Google Scholar]
- Franks, T.; Cleaver, F. Water governance and poverty. Prog. Dev. Stud. 2007, 7, 291–306. [Google Scholar] [CrossRef] [Green Version]
- Government of Malawi. Malawi Growth and Development Strategy III (2017–2022). 2017. Available online: https://www.afidep.org/resource-centre/downloads/policy-guidelines/malawi-growth-development-strategy-mgds-iii-2017-2022/ (accessed on 1 November 2018).
- Ministry of Irrigation and Water Development. National Water Policy. 2005. Available online: http://www.300in6.org/wp-content/uploads/2014/07/3.-National-Water-Policy-by-GoM-2005.pdf (accessed on 1 November 2018).
- Battle and Mambulu. Strengthening SWA Engagement at Country Level: Malawi Case Study. Wateraid under the Leadership of Government of Malawi. 2017. Available online: https://washmatters.wateraid.org/sites/g/files/jkxoof256/files/Strengthening_SWA_engagement_at_country_level_Malawi_case_study.pdf (accessed on 15 May 2018).
- Ministry of Irrigation and Water Development. Implementation Guidelines for Rural Water Supply and Sanitation. 2010. Available online: http://www.rural-water-supply.net/en/resources/details/389 (accessed on 1 November 2018).
- Malawi Economic Justice Network. Analysis of the Water Supply Sector Financing in Malawi; Malawi Economic Justice Network: Lilongwe, Malawi, 2010. [Google Scholar]
- Gutierrez, E. Delivering pro-poor water and sanitation services: The technical and political challenges in Malawi and Zambia. Geoforum 2007, 38, 886–900. [Google Scholar] [CrossRef]
- Danert, K.; Carter, R.C.; Adekile, D.; MacDonald, A. Cost-effective boreholes in sub-Saharan Africa. In Sustainable Groundwater Resources in Africa: Water Supply and Sanitation Environment; Xu, Y., Braune, E., Eds.; CRC Press: Boca Raton, FL, USA, 2009; pp. 187–203. [Google Scholar]
- Chambers, R. Paradigm Shifts and the practice of Participatory Research and Development; Institute of Development Studies: Brighton, UK, 1994. [Google Scholar]
- Cleaver, F. Development through Bricolage: Rethinking Institutions for Natural Resource Management; Routledge: London, UK, 2012. [Google Scholar]
- Scott, O. Supporting institutionalized monitoring systems for rural water supply and sanitation in Malawi. Waterlines 2012, 31, 272–279. [Google Scholar] [CrossRef]
- ICF. 2018. Available online: https://dhsprogram.com/What-We-Do/GPS-Data-Collection.cfm (accessed on 1 November 2018).
- Department of Survey (DoS). 2018. Available online: http://www.lands.gov.mw/index.php/computerisation-of-records-and-spatial-data-sharing.html (accessed on 1 November 2018).
- Rural Water Supply Network (RWSN). Code of Practice for Cost Effective Boreholes; Rural Water Supply Network: Saint Gallen, Switzerland, 2010. [Google Scholar]
- OSMstats. 2018. Available online: https://osmstats.neis-one.org/?item=countries&country=Malawi (accessed on 1 November 2018).
- Ministry of Agriculture, Irrigation and Water Development. Malawi Water Sector Investment Plan. 2012. Available online: http://documents.worldbank.org/curated/en/764081468044978635/Final-report-volume-two (accessed on 1 November 2018).
- Akvo. 2017. Available online: https://akvo.org/blog/wpdx-improving-water-service-evidence-based-decision-making/ (accessed on 1 November 2018).
- Government of Malawi, Malawi OGP National Action Plan 2016-2018, Available online:. Available online: https://www.opengovpartnership.org/sites/default/files/Malawi_NAP_2016-2018.pdf (accessed on 1 November 2018).
- Lundell, B.; Forssten, B.; Gamalielsson, J.; Gustavsson, H.; Karlsson, R.; Lennerholt, C.; Lings, B.; Mattsson, A.; Olsson, E. Exploring health within OSS ecosystems. In First International Workshop on Building Sustainable Open Source Communities (OSCOMM 2009); University of Skövde: Skövde, Sweden, 2009; pp. 1–5. [Google Scholar]
- Kaika, M. Don’t call me resilient again: The New Urban Agenda as immunology … or … what happens when communities refuse to be vaccinated with ‘smart cities’ and indicators. Environ. Urban 2017, 29, 89–102. [Google Scholar] [CrossRef]
- Andrews, M.; Pritchett, L.; Woolcock, M. Escaping capability traps through problem driven iterative adaptation (PDIA). World Dev. 2013, 51, 234–244. [Google Scholar] [CrossRef]
- Butterworth, I.; Giles-Corti, B.; Whitzman, C. Perspective 1: Setting the Scene for the North and West Melbourne Data Integration and Demonstrator Projects; Spatial Data Access and Integration to Support Liveability; The University of Melbourne: Parkville, VIC, Australia, 2013; p. 11. [Google Scholar]
- Cho, A.; Webb, R.; Levy, M.; Kim-blanco, P.; Fischer, A. The Value of Water Information; Report 20; Xylem: New York, NY, USA.
- Marcovecchio, I.; Thinyane, M.; Estevez, E.; Fillottrani, P. Capability maturity models towards improved quality of the sustainable development goals indicators data. In Proceedings of the 2017 ITU Kaleidoscope Academic Conference: Challenges for a Data-Driven Society (ITU K), Nanjing, China, 27–29 November 2017; pp. 1–8. [Google Scholar]
- See, L.; Fritz, S.; Moorthy, I.; Danylo, O.; van Dijk, M.; Ryan, B. Using Remote Sensing and Geospatial Information for Sustainable Development. In From Summits to Solutions: Innovations in Implementing the Sustainable Development Goals; Desai, R., Kato, H., Kharas, H., McArthur, J.W., Eds.; Brookings Institution Press: Washington, DC, USA, 2018. [Google Scholar]
- Stevens, F.R.; Gaughan, A.E.; Linard, C.; Tatem, A.J. Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data. PLoS ONE 2015, 10, e0107042. [Google Scholar] [CrossRef] [PubMed]
- Lang, S.; Kienberger, S.; Tiede, D.; Hagenlocher, M.; Pernkopf, L. Geons-domain-specific regionalization of space. Cartogr. Geogr. Inf. Sci. Geons Domain Specif. Reg. Space 2014, 41, 214–226. [Google Scholar] [CrossRef]
- Ahasanul, H. 2016. Available online: http://ahasanulhoque.com/geodash-a-new-approach-to-opengeospatial-data-for-bangladesh/ (accessed on 1 November 2018).
- Verplanke, J.; Georgiadou, Y. Wicked Water Points: The Quest for an Error Free National Water Point Database. ISPRS Int. J. GeoInf. 2017, 6, 244. [Google Scholar] [CrossRef]
- Van den Homberg, M.; Visser, J.; van der Veen, M. Unpacking Data Preparedness from a humanitarian prioritization perspective: Towards an assessment framework at subnational level. In Proceedings of the 14th ISCRAM Conference, Albi, France, 21–24 May 2017. [Google Scholar]
Dimensions and Their Characteristics | Description |
---|---|
Actors and roles | |
Diversity of data providers (producers) | Which organizations/entities produce and provide the data? One or multiple providers, from same or different sectors |
Target user group (consumers) | What kind of organizations can or do use the data? Academic, Commercial, Governmental, Non-Profit, Citizens. Global, national, local level. |
Facilitation (intermediaries) | Who facilitates the exchange if applicable? Self-facilitated, Intermediary with data-related functions, Intermediary with organizational functions |
Data supply | |
Costs of data extraction | |
Structuredness of data | The format of the data; how easy it is to use it. |
Degree of access to data | How much of the data is opened? Real-time direct access to (a copy of) raw data, access to modified or enriched data, access to outcomes of processed data, data shared as open data |
Quality | |
Timeliness | A combination of when the data set was last updated and how long a data set remains representative of the reality (retention period). |
Content accuracy | Is the content confirmed by other independent sources, logical in itself and consistent with other information on the subject? |
Source reliability | Is it a reliable source, where there is no doubt of authenticity, trustworthiness or competency? Does the source have a history of complete reliability? |
Granularity and spatial coverage | Up to which administrative level is a data set available (granularity) and at which spatial coverage (for the whole country at admin-level 3? Or only for a part of the country?) |
Content of data | What themes does the data cover? Demographic, economic, social and environmental for example. |
Data infrastructure | |
Classification of the infrastructure | Data holder, data archive, catalogue, single-site repository, multi-site repository or cyber-infrastructure. |
Technical architecture | What software uses the platform/infrastructure? Are there clear data and technical procedures in place? |
Functionalities | Uploading, downloading, possible to give and receive feedback, analysis possibilities |
Ease of use | To what extent it is easy to use the functionalities? |
Adoption | Number of users, data sets uploaded and downloaded. |
Data demand | |
Research or policy problem | Which problem does the data address? Specified, Unspecified |
Expected outcome of data use | Which desired outcome is in focus of the data use? Policy intervention (prediction and alerts, needs-based planning, capacity building, monitoring), Data science, Data-driven innovation |
Purpose of data use | To what extent does the purpose of the data use differ from the purpose for which the data was initially collected? Primary, Secondary, Tertiary, End use |
Data ecosystem governance | |
Participatory capacity | For all actors, suppliers, users and intermediaries: technical expertise on how to use data infrastructure, data management knowledge of aspects such as data quality and operational knowledge of how to harness the data ecosystem for decision making. |
Continuity of collaboration between users and suppliers | Which organization is responsible for the data infrastructure and does it have long-term commitment and resources available for continued collaboration? When do users and suppliers work together? On demand, Event-based, Continuous. |
Communication | How is a collaborative and interactive environment created? What is the transparency and feedback mechanism? |
Incentive to share data | Which incentives do data producer or intermediaries have to share data? Closely related to incentive to use data. For example, funding, legal or for social good reasons. |
User selection | How is access to data provided? On agreement or application basis, open. |
Incentive to use data | Which incentives do data users have to use the data? Tangible, intangible. |
Collaboration among data users | To what extent the users collaborate with one another in data analysis? One user, self-selected analysis by several users, collaborative analysis by several users. |
Characteristic | Score | Explanation |
---|---|---|
Costs of data extraction | ||
Structuredness of data | 1 | Data is provided ready to use |
2 | Little pre-processing required to make data ready for use | |
3 | Much pre-processing required to make data ready for use | |
4 | Data is not usable | |
Degree of access to data | 1 | Open data/unrestricted access |
2 | Restricted access, but access granted after registration | |
3 | Restricted access, but access can be requested, not always granted | |
4 | There is no access to downloadable data from this source | |
Quality | ||
Timeliness | 1 | Report date of data falls within retention period, or no functionality characteristic |
2 | Report date of data does not fall within retention period | |
Content accuracy | 1 | Confirmed; Confirmed by other independent sources; logical in itself; consistent with other information on the subject |
2 | Probably true; Not confirmed; logical in itself; consistent with other information on the subject | |
3 | Possibly true; Not confirmed; reasonably logical in itself; agrees with some other information on the subject | |
4 | Doubtfully true; Not confirmed; possible but not logical; no other information on the subject | |
5 | Improbable; Not confirmed; not logical in itself; contradicted by other information on the subject | |
6 | Cannot be judged; no basis exists. | |
Source reliability | 1 | Reliable; No doubt of authenticity, trustworthiness or competency; has a history of complete reliability. Based on extensive consultation of and shared, coordinated and used by national institutions. Clear responsibilities for decision-making, planning and storing data. |
2 | Usually reliable; Minor doubt about authenticity, trustworthiness or competency; has a history of valid information most of the time. Based on consultation of and shared, coordinated and used by national institutions. Some clear responsibilities decision-making, planning and storing data. | |
3 | Fairly reliable; Doubt of authenticity, trustworthiness or competency; but has provided valuable information in the past. Some consultation, sharing, coordination or usage by national institutions. Few responsibilities for decision-making, planning and storing data. | |
4 | Not usually reliable; Significant doubt about authenticity, trustworthiness or competency; but has provided valuable information in the past. Very limited consultation, sharing, coordination or usage by national institutions. Very limited responsibilities decision-making, planning and storing data. | |
5 | Not reliable; Lacking in authenticity, trustworthiness and competency; history of invalid information. No consultation, sharing, coordination or usage by national institutions. No clear responsibilities for decision-making, planning and storing data. | |
6 | Cannot be judged; no basis exists. | |
Granularity | 1 | Admin level 4 |
2 | Admin level 3 | |
3 | Admin level 2 | |
4 | Admin level 1 | |
5 | National level | |
Spatial coverage | 1 | Whole area of interest covered (country) |
2 | One or more Admin level 1 covered | |
3 | One or more Admin 2 covered | |
4 | One or more Admin 3 covered | |
Content of data | 1 | 9–11 attributes |
2 | 7–8 attributes | |
3 | 5–6 attributes | |
4 | 3–4 attributes | |
5 | 0–2 attributes |
Actors | Madzi Alipo | WPDx | DoIWD | PCI | NSO | DoS | MRCS | 510 | OSM | CJF on mWater |
---|---|---|---|---|---|---|---|---|---|---|
Diversity of data providers | Multiple data providers (initiative of one organization but includes data from 29 actors). Local level. | Multiple data providers (initiative of one organization but includes data from 8 other actors and some but not all Madzi Alipo data). Global and national level. | Only one provider. National and local level. | Only one provider. Local level. | One provider (DHS). | Multiple providers within government (MoU with six departments). For water points only one provider. | Only one provider | Only one provider | Multiple OSM users mapped utilities | Multiple data providers |
Target user group | Non-Profit/Local stakeholders | Non-Profit/Local stakeholders | Government: MoAIWD and DoIWD | Non-Profit partners | Government, donors and NGOs. | Focus government, but also shares via MASDAP. | Non-Profit, within own organization. | Unspecified | Government | |
Facilitation (by an intermediary) | Intermediary with data-related and organizational functions (Madzi Alipo participates in sector M&E/information systems meeting) | Intermediary with data-related functions. Organizational functions mostly towards global level (part of global working groups). | Self-facilitated, but with active role in convening WASH actors. | Intermediary with organizational functions (PCI involved in public private partnerships with other parties through GDA) | Intermediary with data and organizational functions: ICF (sponsored by USAID) | Self-facilitated in terms of water point data set (not on MASDAP) | Self-facilitated | Intermediary with data-related functions, no direct link to WASH groups | Intermediary with data-related functions |
Actors | Quality | Overall Quality (1–15) | ||||||
---|---|---|---|---|---|---|---|---|
Timeliness | Source Reliability (1–6) | Content Accuracy (1–6) | Granularity and Spatial Coverage | Content of Data (1–5) | ||||
Date of Source | Retention (1–2) | Granularity (1–5) | Spatial Coverage (1–4) | |||||
NSO | 2015–2016 | 2 | 1 | 1 | 5 | 1 | 5 | 15 |
Madzi Alipo | Multiple | 2 | 1 | 1 | 1 | 1 | 2 | 8 |
WPDx | Multiple | 2 | 2 | 1 | 1 | 1 | 2 | 9 |
DoIWD | 2002–2004 | 2 | 1 | 3 | 1 | 1 | 2 | 10 |
OSM | Daily (14/6/2018) | 1 | 1 | 1 | 1 | 2 | 5 | 11 |
MRCS | February–April 2108 | 1 | 1 | 1 | 1 | 3 | 4 | 11 |
PCI | 2003, 2016 | 2 | 1 | 1 | 1 | 4 | 3 | 12 |
510 | August 2017 | 1 | 1 | 2 | 1 | 4 | 4 | 13 |
DoS | 2012–2015 | 2 | 1 | 1 | 1 | 3 | 5 | 13 |
CJF | Unknown | 2 | 2 | 2 | 1 | 2.5 | 5 | 14.5 |
Actors | Costs | ||
---|---|---|---|
Level of Structuredness (1–4) | Degree of Access to Data (1–4) | Overall Costs (1–8) | |
NSO | 4 | 1 | 5 |
Madzi Alipo | 2 (csv) | 2 | 4 |
WPDx | 2 (csv) | 1 | 3 |
DoIWD | 2 (shapefile) | 3 | 5 |
OSM | 1 (shapefile) | 1 | 2 |
MRCS | 1 (shapefile) | 3 | 4 |
PCI | 1 (shapefile) | 3 | 4 |
510 (NLRC) | 2 (GeoJSON) | 3 | 5 |
DoS | 2 (csv) | 3 | 5 |
CJF | 2 (shapefile) | 3 | 5 |
Definitions Water Point Sustainability | Related Attributes in Datasets |
---|---|
Components | |
Functionality at time of survey | Functionality, visit time, reporter |
Frequency of breakdown | |
Duration of breakdown | |
Days operational since installation | |
Quality of water | Quality of water |
Quantity of water | |
Proximate variables | |
Design and installation factors | |
Type of Technology | Type of waterpoint, Installer/funder |
Quality of Installation | |
User numbers | GPS location, Access (located on premises or not). |
System age | Install year |
Post-construction factors | |
Frequency of maintenance | |
Availability of spare parts | |
Availability of maintenance and repair skills | Management of the water point |
Availability of funds for maintenance and repair | Whether the water point is a free service or users have to pay. |
Availability of external support | |
Incidence of theft |
Actor | Madzi Alipo | WPDx | DoIWD | PCI | DoS |
---|---|---|---|---|---|
Classification of the infrastructure | Multi-site repository | Multi-site repository | Data archive | Data holder | Data archive; although some data on MASDAP |
Technical architecture/software | Madzi Alipo app to collect data, database that contains data, website to access data, API available | Data gathered using various collection methods, database that contains the data, website to access the data, API available | Government has data in their own database, dataset in SHP format, obtained via USB transfer | Dataset in SHP format obtained via USB transfer | Government has data in their own database, dataset in CSV format, obtained via USB transfer |
Functionalities | App: report water points, look for closest water point. Website: make reports, select data based on multiple characteristics, download data in CSV format, visualize data. | Website: download data for specific country in CSV format, or use ‘data playground’ | Data can be loaded into a GIS and analysed/visualized | ||
Ease of use | Registration required to download data, website and app easy to understand, CSV can be opened in a GIS | Everyone can download data, however ‘data playground’ on the website is quite cluttered and unclear, CSV can be opened in a GIS | Not easy to obtain data, data cleaning required before data is usable in GIS | Data not accessible for everyone, dataset consists of four separate shapefiles | Data not accessible for everyone, can be opened in a GIS |
Adoption | Around 300 users | Large number of users worldwide (users shared 300.000 water points in over 30 countries). No user data for Malawi. | Few users (because data is not open data and not distributed widely) | ||
Actor | NSO | MCRS | 510 | OSM | CJF on mWater |
Classification of the infrastructure | Single-site repository | Data holder | Data holder | Multi-site repository | Data archive (as not yet completely accessible on multi-site repository) |
Technical architecture/software | DHS program website with data download and recoding options. | Data owned by and in database of MRCS, dataset in SHP format, obtained via USB transfer | Data in database of 510, obtained via email transfer | Data gathered remotely through OSM, extracted through QGIS and Overpass query, also API available | Data collected through app AkvoFlow or mWater app, published in database mWater, online data portal mWater, also API available |
Functionalities | Several online tools to work with the survey data and support as to how to interpret and analyse them. | Data can be loaded into a GIS and analysed/visualized | Data can be loaded into a GIS and analysed/visualized | Multiple options to extract data from OSM (for example through QGIS, or through Overpass-turbo) | mWater portal and app offers different dashboards, consoles and indicator library. Includes several functionalities per waterpoint and two-level approval mechanism. |
Ease of use | Data not accessible for everyone, only after screening. No dashboard, analysis should be done by user. | Data not accessible for everyone, dataset consists of six separate shapefiles | Dataset consists of two shapefiles and a GeoJSON file and contains other points of interest besides water points, so data cleaning required | Everyone can access data, can be opened in a GIS, data extraction is quite easy, but some knowledge of GIS is required | Easy to use. |
Adoption | No data available. | Few users (because data is not open data and not distributed widely) | OSM community in Malawi around 100 members, at peak level 125 nodes per day mapped. OSM contributors from outside Malawi can come from the 4 million OSMers worldwide. | No data available. |
Actor | NSO | Madzi Alipo | WPDx | DoIWD | DoS | PCI/MCRS/510 | OSM | CJF on mWater |
---|---|---|---|---|---|---|---|---|
Participatory Capacity | High level of data (statistical) expertise. Translation into operational knowledge through cooperation with responsible ministries. | Madzi Alipo and WPDx have high technical and data management expertise enabling them to not only participate in the data ecosystem but also to grow it by enabling actors to contribute even with low participatory capacity (easy to use app, manuals). Madzi Alipo also translates data to operational knowledge. WPDx is less tailored to operationalization in Malawi context. | DoIWD and DoS have less advanced technical knowledge, given lack of ICT infrastructure and limited data literacy among government employees, but they are directly incorporating data activities into government practice. | Medium to high levels of technical and data management expertise. MRCS and PCI directly implement data activities into their project management. | High/average: every citizen can learn how to use and contribute to OSM and download all the data. OSM developer community has high level of technical expertise. | High technical and data management expertise but not participating in data ecosystem outside the government database. Data activities directly embedded into government practice. | ||
Continuity of collaboration between users and suppliers | Mostly event-based, for example after a survey or census. | Continuous | DoS continuous; DoIWD mostly on demand | Event-based and on demand | Continuous | Event-based and on demand | ||
Communication | Mostly within the government and a few key development actors (such as UNICEF) via regular meetings and working groups. Communication to other actors less active. | Trainings on the app, easy to share feedback via the website, regular blogs. | Easy to share feedback via the website. Regular articles although not specifically for Malawi. | DoIWD plays a key role in organizing WASH meetings | DoS is in the steering committee of MASDAP but lacks resources to organize regular awareness meetings | Only within own organization. | The continuity of collaboration between users and suppliers is especially for OSM well developed and feedback mechanisms (in terms of for example, validation protocols of items mapped) are in place. | Website on the Water Futures Programme and the mWater platform have blogs, newsletters. |
Incentive to share and/or use data | Intangible: NSO has the mandate to compile statistical data also of other government bodies and to promote use of it for, for example, policy formulation. NSO does not directly use the data themselves. | Intangible. Share data to align efforts in the WASH sector through better monitoring. Tangible: use data for improving operation of water points. | Intangible. Guidelines for sharing might become part of future Land Survey bill. DoS is not directly using the data. | Intangible. Share data to create synergy or goodwill with other NGOs. Tangible: use data for project interventions. | Intangible, such as share data for recognition by OSM community. They usually do not directly use the data themselves. | No incentive to share data. Not requested by donor; government prefers not to share for accountability reasons. Incentive to use data for government interventions and development of investment plan. | ||
User selection | High level data is open. More detailed data on application basis. | On application basis | Open/on application basis | On agreement basis | On agreement basis | Open | Open | On application basis (to use the portal) and on demand (to get the data, but only sample set possible). |
Collaboration among data users | Self-selected analysis. | Self-selected analysis by several users | Self-selected analysis by several users | One user | One user | One user | Self-selected analysis by several users | Self-selected analysis by several users |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Van den Homberg, M.; Susha, I. Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals. ISPRS Int. J. Geo-Inf. 2018, 7, 456. https://doi.org/10.3390/ijgi7120456
Van den Homberg M, Susha I. Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals. ISPRS International Journal of Geo-Information. 2018; 7(12):456. https://doi.org/10.3390/ijgi7120456
Chicago/Turabian StyleVan den Homberg, Marc, and Iryna Susha. 2018. "Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals" ISPRS International Journal of Geo-Information 7, no. 12: 456. https://doi.org/10.3390/ijgi7120456
APA StyleVan den Homberg, M., & Susha, I. (2018). Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals. ISPRS International Journal of Geo-Information, 7(12), 456. https://doi.org/10.3390/ijgi7120456