Alohomora: Unlocking data quality causes through event log context

, , , & (2020) Alohomora: Unlocking data quality causes through event log context. In Proceedings of the 28th European Conference on Information Systems (ECIS2020). Association for Information Systems, United States of America, pp. 1-16.

View at publisher

Description

Big data’s rise has amplified the role of information systems in process management. Process mining, a branch of data science, provides analytical tools and methods which can distil insights about process behaviour from big process-related data. Yet challenges remain, including dealing with the quality of big data and the impact of poor quality data on event logs as the input to process mining analyses. We show, through an analysis of 152 case studies, that despite researchers raising concerns about event log data quality, the event log preparation (data pre-processing) phase of process mining case studies is generally handled in a naive manner (as opposed to informed), focusing on fixing symptoms rather than uncovering the root causes of event log data quality issues.This paper considers event log data quality problems from a new angle. We introduce the Odigos (Greek for ‘guide’) framework, adapted from Mingers and Willcocks (2014), based on semiotics and Peircean abductive reasoning, that explains the notion of process mining context at a conceptual level. From a practical perspective, the Odigos framework facilitates an informed way of dealing with data quality issues in event logs through supporting both prognostic (foreshadowing potential quality issues) and diagnostic (identifying root causes of discovered quality issues) approaches. From a theoretical perspective, the work provides a foundation for the development of a process mining methodology for data pre-processing and for further IS theory development in the area of data analytics.

Impact and interest:

Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

490 since deposited on 21 Jul 2020
115 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 199828
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
ORCID iD:
Emamjome, Fahameorcid.org/0000-0001-9450-9999
Andrews, Robertorcid.org/0000-0001-7743-5772
ter Hofstede, Arthurorcid.org/0000-0002-2730-0201
Measurements or Duration: 16 pages
Additional URLs:
Keywords: process mining, data quality, semiotic
Pure ID: 49125275
Divisions: Past > Institutes > Institute for Future Environments
Past > QUT Faculties & Divisions > Science & Engineering Faculty
?? 3233 ??
Current > Research Centres > Centre for Tropical Crops and Biocommodities
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 21 Jul 2020 12:24
Last Modified: 29 Feb 2024 14:57