I want to suggest the following actions to cope with the big health data challenges mentioned in “Big Health Data and Cardiovascular Diseases: A Challenge for Research, an Opportunity for Clinical Care”:
Missing data
To address this challenge, I would like to propose standardizing terminology and variables across all systems to enhance interoperability. In addition, the data quality will be improved by embedding a structured data system and key variables. The regular data audit will be performed, and a missing data report will be extracted from the system with an action plan. And appropriate advocacy and orientation will be conducted to all the relevant stakeholders with the collaboration of local authorities.
Selection Bias
I would like to develop predefine data analysis plan to identify which variable is mandatory, and the application of advanced data analysis techniques will be beneficial to cope with this challenge.
Data Analysis and Training
I would like to form multidisciplinary teams with clinicians, statisticians, and data scientists. Then, I will develop models and share reusable data analytical pipelines with the collaboration of this team. In addition, I will provide training to ensure the interpretation with the collaboration of academic and industry experts.
Interpretation and Translational Applicability of Results
I would propose developing clinically interpretable models with explainability tools, with the involvement of clinicians whether outputs align with real-world workflows.
Privacy and Ethical Issues
The data governance system will be strengthened. Societal benefits and individual rights need to be balanced.
