Bachelorarbeit, 2016
57 Seiten, Note: 1
1. Introduction
1.1. Motivation
1.2. Outline
1.3. The Goal
1.4. Methodology
2. Theory Section
2.1. Gaussian Mixture Model
2.2. Dirichlet Process GMM
2.3. Hierarchical Clustering
3. Implementation
3.1. Software Specification
3.2. Data and Sensors
3.3. Data Visualisation and Inspection
3.4. Sensor values Discretisation and Extraction
3.5. Unsupervised Event Extraction
3.6. Data Structure for Event Analysis
3.7. Data Quality and Quantity
3.8. Predictive Analysis
3.8.1. Hourly Binning Analysis
3.8.2. Clustering Analysis
3.8.2.1. Hierarchical Clustering Analysis
3.8.2.2. Dirichlet Process Gaussian Mixture Model Clustering Analysis
4. Summary
4.1. Results
4.2. Discussion
This thesis investigates unsupervised methods for predicting and clustering user behavior based on time-series data collected from infrared temperature sensors in an ambient assisted living (AAL) environment, with the aim of reliably identifying cooking activities.
3.4 Sensor Values Discretisation and Extraction
As noted earlier IIR sensor values need discretisation. The idea is to implement some kind of sampling on continuous values of the IIR cooking sensor. This is needed to be able to extract the cookings from the rest of the sensor events from the dataset.
Having a discrete signal from infrared temperature sensor, two approaches for cooking event extraction from a dataset are going to be discussed. The goal is to have possibilities of reliable and correct cooking events detection in a single dataset.
The first approach is the sequencing of temperature rises that belong together. The second approach is unsupervised event extraction using a clustering algorithm for grouping temperature increases that belong together. The latter is discussed in the next section.
Three big peaks from Figure 3.1 need to be discretized since only “heating in progress” on the cooking plate is of interest. This gives an idea that positive temperature increases should be inspected and leads to conclusion that the first step of temperature signal transforming should be a difference operation on temperature values.
1. Introduction: Presents the motivation for AAL systems and defines the goal of detecting elderly cooking patterns without supervised intervention.
2. Theory Section: Discusses the theoretical foundations of Gaussian Mixture Models, Dirichlet Process GMM, and Hierarchical Clustering.
3. Implementation: Describes the practical setup, sensor data formats, signal discretization, and the execution of predictive analysis using different clustering and binning approaches.
4. Summary: Evaluates the performance of the implemented methods and discusses findings regarding data quality and future improvements for adaptive systems.
AAL, Data Cluster Analysis, Predictive Analysis, Gaussian Mixture Model, DPGMM, Hierarchical Clustering, Hourly Binning, Time-Series Data, Infrared Sensors, User Behaviour, Unsupervised Learning, Activity Monitoring, Sensor Data Discretisation, Event Extraction, Smart Homes
The research focuses on analyzing time-series sensor data from private households to identify and predict cooking activities of older people to support independent living.
The work combines ambient assisted living (AAL) technologies, machine learning for clustering, and data preprocessing for predictive modeling.
The thesis aims to determine how reliable and unsupervised prediction of daily user behavior can be achieved using various clustering and binning algorithms on raw infrared sensor signals.
The author evaluates Hourly Based Binning against more complex unsupervised clustering techniques, specifically Hierarchical Clustering and Dirichlet Process Gaussian Mixture Models (DPGMM).
It details the software environment (Python/IPython), data cleansing of noisy CSV inputs, event isolation, sequence naming, and the comparison of different algorithms against training and testing datasets.
The methods are evaluated based on prediction hit rates, computational efficiency, robustness regarding sparse/noisy data, and the ability to find a realistic number of clusters.
DPGMM shows robustness with smaller, sparse datasets, though it often requires parameter tuning to outperform the simpler Hourly Binning approach.
It was chosen as an unsupervised grouping method that requires fewer parameters than other models and serves as a baseline to pre-estimate cluster counts for more complex models like DPGMM.
The study concludes that prediction performance is directly correlated with data density and recording duration, with smaller or noisier datasets leading to lower accuracy and higher miss rates.
Der GRIN Verlag hat sich seit 1998 auf die Veröffentlichung akademischer eBooks und Bücher spezialisiert. Der GRIN Verlag steht damit als erstes Unternehmen für User Generated Quality Content. Die Verlagsseiten GRIN.com, Hausarbeiten.de und Diplomarbeiten24 bieten für Hochschullehrer, Absolventen und Studenten die ideale Plattform, wissenschaftliche Texte wie Hausarbeiten, Referate, Bachelorarbeiten, Masterarbeiten, Diplomarbeiten, Dissertationen und wissenschaftliche Aufsätze einem breiten Publikum zu präsentieren.
Kostenfreie Veröffentlichung: Hausarbeit, Bachelorarbeit, Diplomarbeit, Dissertation, Masterarbeit, Interpretation oder Referat jetzt veröffentlichen!

