Computational Methods for Discovering and Analyzing Causal Relationships in Health Data

Liang, Yiheng

Computational Methods for Discovering and Analyzing Causal Relationships in Health Data

PDF Version Also Available for Download.

Description

Publicly available datasets in health science are often large and observational, in contrast to experimental datasets where a small number of data are collected in controlled experiments. Variables' causal relationships in the observational dataset are yet to be determined. However, there is a significant interest in health science to discover and analyze causal relationships from health data since identified causal relationships will greatly facilitate medical professionals to prevent diseases or to mitigate the negative effects of the disease. Recent advances in Computer Science, particularly in Bayesian networks, has initiated a renewed interest for causality research. Causal relationships can be possibly … continued below

Physical Description

vii, 113 pages : illustrations (some color)

Creation Information

Liang, Yiheng August 2015.

Context

This dissertation is part of the collection entitled: UNT Theses and Dissertations and was provided by the UNT Libraries to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 165 times. More information about this dissertation can be viewed below.

Author

Liang, Yiheng

Chair

Mikler, Armin Major Professor

Committee Members

Publisher

University of North Texas
Publisher Info: www.unt.edu

Place of Publication: Denton, Texas

Rights Holder

For guidance see Citations, Rights, Re-Use.

Liang, Yiheng

Provided By

UNT Libraries

The UNT Libraries serve the university and community by providing access to physical and online collections, fostering information literacy, supporting academic research, and much, much more.

Degree Information

Department: Department of Computer Science and Engineering
Discipline: Computer Science
Level: Doctoral
Name: Doctor of Philosophy
Grantor: University of North Texas
PublicationType: Doctoral Dissertation

Description

Publicly available datasets in health science are often large and observational, in contrast to experimental datasets where a small number of data are collected in controlled experiments. Variables' causal relationships in the observational dataset are yet to be determined. However, there is a significant interest in health science to discover and analyze causal relationships from health data since identified causal relationships will greatly facilitate medical professionals to prevent diseases or to mitigate the negative effects of the disease. Recent advances in Computer Science, particularly in Bayesian networks, has initiated a renewed interest for causality research. Causal relationships can be possibly discovered through learning the network structures from data. However, the number of candidate graphs grows in a more than exponential rate with the increase of variables. Exact learning for obtaining the optimal structure is thus computationally infeasible in practice. As a result, heuristic approaches are imperative to alleviate the difficulty of computations. This research provides effective and efficient learning tools for local causal discoveries and novel methods of learning causal structures with a combination of background knowledge. Specifically in the direction of constraint based structural learning, polynomial-time algorithms for constructing causal structures are designed with first-order conditional independence. Algorithms of efficiently discovering non-causal factors are developed and proved. In addition, when the background knowledge is partially known, methods of graph decomposition are provided so as to reduce the number of conditioned variables. Experiments on both synthetic data and real epidemiological data indicate the provided methods are applicable to large-scale datasets and scalable for causal analysis in health data. Followed by the research methods and experiments, this dissertation gives thoughtful discussions on the reliability of causal discoveries computational health science research, complexity, and implications in health science research.

Physical Description

vii, 113 pages : illustrations (some color)

Subjects

Keywords

Library of Congress Subject Headings

Language

English

Item Type

Thesis or Dissertation

Identifier

Unique identifying numbers for this dissertation in the Digital Library or other systems.

Archival Resource Key: ark:/67531/metadc804966

Collections

This dissertation is part of the following collection of related materials.

UNT Theses and Dissertations

Theses and dissertations represent a wealth of scholarly and artistic content created by masters and doctoral students in the degree-seeking process. Some ETDs in this collection are restricted to use by the UNT community.

What responsibilities do I have when using this dissertation?

Creation Date

August 2015

Added to The UNT Digital Library

March 4, 2016, 4:14 p.m.

Description Last Updated

May 10, 2017, 11:16 a.m.

Usage Statistics

When was this dissertation last used?

Yesterday: 0

Past 30 days: 0

Total Uses: 165

Liang, Yiheng. Computational Methods for Discovering and Analyzing Causal Relationships in Health Data, dissertation, August 2015; Denton, Texas. (https://digital.library.unt.edu/ark:/67531/metadc804966/: accessed June 7, 2024), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; .

Computational Methods for Discovering and Analyzing Causal Relationships in Health Data

Description

Physical Description

Creation Information

Context

Who

Author

Chair

Committee Members

Publisher

Rights Holder

Provided By

UNT Libraries

Contact Us

What

Degree Information

Description

Physical Description

Subjects

Keywords

Library of Congress Subject Headings

Language

Item Type

Identifier

Collections

UNT Theses and Dissertations

Digital Files

When

Creation Date

Added to The UNT Digital Library

Description Last Updated

Usage Statistics

Interact With This Dissertation

Search Inside

Start Reading

Citations, Rights, Re-Use

International Image Interoperability Framework

Print / Share

Links for Robots

Archival Resource Key (ARK)

International Image Interoperability Framework (IIIF)

Metadata Formats

Images

URLs

Stats