figshare
Browse
Basak_cmu_0041E_10183.pdf (2.29 MB)

Scalable Bayesian Network Learning and its Applications

Download (2.29 MB)
thesis
posted on 2018-08-13, 00:00 authored by Aniruddha BasakAniruddha Basak
The Bayesian network is a powerful tool for modeling of cause effect and other uncertain relations between variables in a domain of interest. Probabilistic reasoning with a Bayesian network offers prediction of one or more unobserved variables of interest, given evidence. To use a Bayesian network in a real-world problem, one may need to learn the structure, the parameters, or both from data. However, learning Bayesian networks from high dimensional
and large datasets is a computationally challenging
problem. Parameter learning from large datasets demands
considerable computational and memory resources. Moreover, the runtime of theoretically correct structure learning algorithms (such as Hill Climbing, PC) are super-linear in the number of data dimensions.
This research develops scalable techniques for both structure learning and parameter learning of Bayesian networks from data. For the parameter learning task, we proposed a novel decomposition of the Expectation Maximization algorithm in the MapReduce
framework, where computation is performed in parallel across partitions of the data records. This learning method can handle both complete and incomplete data. Complete data means that all features have values in all records in a dataset, while this is not the case in the incomplete data case. For the Bayesian network structure learning task, a novel score-based method is developed. Score-based structure learning may seems inherently sequential, due to its use of iterative improvement steps. However, we bring parallelism to the scorebased structure learning paradigm.. This is done by organizing the candidate updates for a given structure in a matrix, partitioning the matrix in blocks, and computing scores for each block in parallel. Moreover, we maintain an archive of potential structures, which appear in the search path and use them as starting points of restarts. This mechanism helps preventing the search from getting trapped in local optimal solutions. We apply the proposed techniques to several datasets including two real-world engineering problems: smart building optimization
and next-generation air trac control. For smart building
optimization, we study the isolation of candidate causes of
adverse events in a building heating-cooling system. We propose a novel scalable causal learning (SCL) method using Bayesian network structure learning as a central piece. Experimental results on a dataset collected from a Building Automation System (BAS) show improved prediction accuracy and reduced computation
time of SCL compared to existing algorithms. For next generation air traffic control, we improve on the prediction of
aircraft taxi times in airports via Bayesian network uncertainty modeling of surface traffic. We apply both Bayesian network structure learning and parameter learning on dataset obtained from surface trac simulations for three airports in the New York city multiplex: JFK, LGA, and EWR. We use junction tree inference on the trained model to obtain the posterior distribution of transit time variables. The uncertainly model of the transit times are used by a scheduler to minimize the overall delay of departures
while maximizing runway throughput. Existing approaches
heavily rely on subject matter expert models and therefore limited in scope. However, the scalable structure learning approach relies on data, and thereby enables the use of Bayesian networks in any arbitrary airport where prior expert knowledge is limited or unavailable.

History

Date

2018-08-13

Degree Type

  • Dissertation

Department

  • Electrical and Computer Engineering

Degree Name

  • Doctor of Philosophy (PhD)

Advisor(s)

Ole J. Mengshoel

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC