Current Search: Deep learning Machine learning (x)
View All Items
Pages
- Title
- Parallel Distributed Deep Learning on Cluster Computers.
- Creator
- Kennedy, Robert Kwan Lee, Khoshgoftaar, Taghi M., Florida Atlantic University, College of Engineering and Computer Science, Department of Computer and Electrical Engineering and Computer Science
- Abstract/Description
-
Deep Learning is an increasingly important subdomain of arti cial intelligence. Deep Learning architectures, arti cial neural networks characterized by having both a large breadth of neurons and a large depth of layers, bene ts from training on Big Data. The size and complexity of the model combined with the size of the training data makes the training procedure very computationally and temporally expensive. Accelerating the training procedure of Deep Learning using cluster computers faces...
Show moreDeep Learning is an increasingly important subdomain of arti cial intelligence. Deep Learning architectures, arti cial neural networks characterized by having both a large breadth of neurons and a large depth of layers, bene ts from training on Big Data. The size and complexity of the model combined with the size of the training data makes the training procedure very computationally and temporally expensive. Accelerating the training procedure of Deep Learning using cluster computers faces many challenges ranging from distributed optimizers to the large communication overhead speci c to a system with o the shelf networking components. In this thesis, we present a novel synchronous data parallel distributed Deep Learning implementation on HPCC Systems, a cluster computer system. We discuss research that has been conducted on the distribution and parallelization of Deep Learning, as well as the concerns relating to cluster environments. Additionally, we provide case studies that evaluate and validate our implementation.
Show less - Date Issued
- 2018
- PURL
- http://purl.flvc.org/fau/fd/FA00013080
- Subject Headings
- Deep learning., Neural networks (Computer science)., Artificial intelligence., Machine learning.
- Format
- Document (PDF)
- Title
- OPTIMIZED DEEP LEARNING ARCHITECTURES AND TECHNIQUES FOR EDGE AI.
- Creator
- Zaniolo, Luiz, Marques, Oge, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
The recent rise of artificial intelligence (AI) using deep learning networks allowed the development of automatic solutions for many tasks that, in the past, were seen as impossible to be performed by a machine. However, deep learning models are getting larger, need significant processing power to train, and powerful machines to use it. As deep learning applications become ubiquitous, another trend is taking place: the growing use of edge devices. This dissertation addresses selected...
Show moreThe recent rise of artificial intelligence (AI) using deep learning networks allowed the development of automatic solutions for many tasks that, in the past, were seen as impossible to be performed by a machine. However, deep learning models are getting larger, need significant processing power to train, and powerful machines to use it. As deep learning applications become ubiquitous, another trend is taking place: the growing use of edge devices. This dissertation addresses selected technical issues associated with edge AI, proposes novel solutions to them, and demonstrates the effectiveness of the proposed approaches. The technical contributions of this dissertation include: (i) architectural optimizations to deep neural networks, particularly the use of patterned stride in convolutional neural networks used for image classification; (ii) use of weight quantization to reduce model size without hurting its accuracy; (iii) systematic evaluation of the impact of image imperfections on skin lesion classifiers' performance in the context of teledermatology; and (iv) a new approach for code prediction using natural language processing techniques, targeted at edge devices.
Show less - Date Issued
- 2021
- PURL
- http://purl.flvc.org/fau/fd/FA00013822
- Subject Headings
- Artificial intelligence, Deep learning (Machine learning), Neural networks (Computer science)
- Format
- Document (PDF)
- Title
- IMAGE QUALITY AND BEAUTY CLASSIFICATION USING DEEP LEARNING.
- Creator
- Golchubian, Arash, Nojoumian, Mehrdad, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
The field of computer vision has grown by leaps and bounds in the past decade. The rapid advances can be largely attributed to advances made in the field of Artificial Neural Networks and more specifically can be attributed to the rapid advancement of Convolutional Neural Networks (CNN) and Deep Learning. One area that is of great interest to the research community at large is the ability to detect the quality of images in the sense of technical parameters such as blurriness, encoding...
Show moreThe field of computer vision has grown by leaps and bounds in the past decade. The rapid advances can be largely attributed to advances made in the field of Artificial Neural Networks and more specifically can be attributed to the rapid advancement of Convolutional Neural Networks (CNN) and Deep Learning. One area that is of great interest to the research community at large is the ability to detect the quality of images in the sense of technical parameters such as blurriness, encoding artifacts, saturation, and lighting, as well as for its’ aesthetic appeal. The purpose of such a mechanism could be detecting and discarding noisy, blurry, dark, or over exposed images, as well as detecting images that would be considered beautiful by a majority of viewers. In this dissertation, the detection of various quality and aesthetic aspects of an image using CNNs is explored. This research produced two datasets that are manually labeled for quality issues such as blur, poor lighting, and digital noise, and for their aesthetic qualities, and Convolutional Neural Networks were designed and trained using these datasets. Lastly, two case studies were performed to show the real-world impact of this research to traffic sign detection and medical image diagnosis.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00014029
- Subject Headings
- Deep learning (Machine learning), Computer vision, Aesthetics, Image Quality
- Format
- Document (PDF)
- Title
- NETWORK INTRUSION DETECTION AND DEEP LEARNING MECHANISMS.
- Creator
- Chatterjee, Suvosree, Cardei, Ionut, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Cyber attack is a strong threat to the digital world. So, it’s very essential to keep the network safe. Network Intrusion Detection system is the system to address this problem. Network Intrusion Detection system functions like a firewall, and monitors incoming and outgoing traffic like ingress and egress filtering fire wall. Network Intrusion Detection System does anomaly and hybrid detection for detecting known and unknown attacks. My thesis discusses about the several network cyber attacks...
Show moreCyber attack is a strong threat to the digital world. So, it’s very essential to keep the network safe. Network Intrusion Detection system is the system to address this problem. Network Intrusion Detection system functions like a firewall, and monitors incoming and outgoing traffic like ingress and egress filtering fire wall. Network Intrusion Detection System does anomaly and hybrid detection for detecting known and unknown attacks. My thesis discusses about the several network cyber attacks we face nowadays and I created several Deep learning models to detect accurately, I used NSL-KDD dataset which is a popular dataset, that contains several network attacks. After experimenting with different deep learning models I found some disparities in the training accuracy and validation accuracy, which is a clear indication of overfitting. To reduce the overfitting I introduced regularization and dropout in the models and experimented with different hyperparameters.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014128
- Subject Headings
- Deep learning (Machine learning), Cyberterrorism, Intrusion detection systems (Computer security)
- Format
- Document (PDF)
- Title
- A UNIFIED SOFT SENSING FRAMEWORK FOR COMPLEX DYNAMICAL SYSTEMS.
- Creator
- Huang, Yu, Tang, Yufei, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
In the past few years, the development of complex dynamical networks or systems has stimulated great interest in the study of the principles and mechanisms underlying the Internet of things (IoT). IoT is envisioned as an intelligent network infrastructure with a vast number of ubiquitous smart devices present in diverse application domains and have already improved many aspects of daily life. Many overtly futuristic IoT applications acquire data gathered via distributed sensors that can be...
Show moreIn the past few years, the development of complex dynamical networks or systems has stimulated great interest in the study of the principles and mechanisms underlying the Internet of things (IoT). IoT is envisioned as an intelligent network infrastructure with a vast number of ubiquitous smart devices present in diverse application domains and have already improved many aspects of daily life. Many overtly futuristic IoT applications acquire data gathered via distributed sensors that can be uniquely identified, localized, and communicated with, i.e., the support of sensor networks. Soft-sensing models are in demand to support IoT applications to achieve the maximal exploitation of transforming the information of measurements into more useful knowledge, which plays essential roles in condition monitoring, quality prediction, smooth control, and many other essential aspects of complex dynamical systems. This in turn calls for innovative soft-sensing models that account for scalability, heterogeneity, adaptivity, and robustness to unpredictable uncertainties. The advent of big data, the advantages of ever-evolving deep learning (DL) techniques (where models use multiple layers to extract multi-levels of feature representations progressively), as well as ever-increasing processing power in hardware, has triggered a proliferation of research that applies DL to soft-sensing models. However, many critical questions need to be further investigated in the deep learning-based soft-sensing.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00013993
- Subject Headings
- Dynamical systems, Dynamics, Sensor networks, Deep learning (Machine learning)
- Format
- Document (PDF)
- Title
- AN ARTIFICIAL INTELLIGENCE DRIVEN FRAMEWORK FOR MEDICAL IMAGING.
- Creator
- Sanghvi, Harshal A., Agarwal, Ankur, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
The major objective of this dissertation was to create a framework which is used for medical image diagnosis. In this diagnosis, we brought classification and diagnosing of diseases through an Artificial Intelligence based framework, including COVID, Pneumonia, and Melanoma cancer through medical images. The algorithm ran on multiple datasets. A model was developed which detected the medical images through changing hyper-parameters. The aim of this work was to apply the new transfer learning...
Show moreThe major objective of this dissertation was to create a framework which is used for medical image diagnosis. In this diagnosis, we brought classification and diagnosing of diseases through an Artificial Intelligence based framework, including COVID, Pneumonia, and Melanoma cancer through medical images. The algorithm ran on multiple datasets. A model was developed which detected the medical images through changing hyper-parameters. The aim of this work was to apply the new transfer learning framework DenseNet-201 for the diagnosis of the diseases and compare the results with the other deep learning models. The novelty in the proposed work was modifying the Dense Net 201 Algorithm, changing hyper parameters (source weights, Batch Size, Epochs, Architecture (number of neurons in hidden layer), learning rate and optimizer) to quantify the results. The novelty also included the training of the model by quantifying weights and in order to get more accuracy. During the data selection process, the data were cleaned, removing all the outliers. Data augmentation was used for the novel architecture to overcome overfitting and hence not producing false absurd results the computational performance was also observed. The proposed model results were also compared with the existing deep learning models and the algorithm was also tested on multiple datasets.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014274
- Subject Headings
- Diagnostic imaging, Artificial intelligence, Deep learning (Machine learning)
- Format
- Document (PDF)
- Title
- FEATURE REPRESENTATION LEARNING FOR ONLINE ADVERTISING AND RECOMMENDATIONS.
- Creator
- Gharibshah, Zhabiz, Zhu, Xingquan, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Online advertising [100], as a multi-billion dollar business, provides a common marketing experience when people access online services using electronic devices, such as desktop computers, tablets, smartphones, and so on. Using the Internet as a means of advertising, different stakeholders take actions in the background to provide and deliver advertisements to users through numerous platforms, such as search engines, news sites, and social networks, where dedicated spots of areas are used to...
Show moreOnline advertising [100], as a multi-billion dollar business, provides a common marketing experience when people access online services using electronic devices, such as desktop computers, tablets, smartphones, and so on. Using the Internet as a means of advertising, different stakeholders take actions in the background to provide and deliver advertisements to users through numerous platforms, such as search engines, news sites, and social networks, where dedicated spots of areas are used to display advertisements (ads) along with search results, posts, or page content. Online advertising is mainly based on dynamically selecting ads through a real-time bidding (or auction) mechanism. Predicting user responses like clicking ads in e-commerce platforms and internet-based advertising systems, as the first measurable user response, is an essential step for many digital advertising and recommendation systems to capture the user’s propensity to follow up actions, such as purchasing a product or subscribing to a service. To maximize revenue and user satisfaction, online advertising platforms must predict the expected user behavior of each displayed advertisement and maximize the user’s expectations of clicking [28]. Based on this observed feedback, these systems are tailored to user preferences to decide the order in that ads or any promoted content should be served to them. This objective provides an incentive to develop new research by using ideas derived from different domains like machine learning and data mining combined with models for information retrieval and mathematical optimization. They introduce different machine learning and data mining methods that employ deep learning-based predictive models to learn the representation of input features with the aim of user response prediction. Feature representation learning is known as a fundamental task on how to input information is going to be represented in machine learning models. A good feature representation learning method that seeks to learn low-dimensional embedding vectors is a key factor for the success of many downstream analytics tasks, such as click-through prediction and conversion prediction in recommendation systems and online advertising platforms.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014269
- Subject Headings
- Internet advertising, Deep learning (Machine learning), Internet marketing
- Format
- Document (PDF)
- Title
- DATA AUGMENTATION IN DEEP LEARNING.
- Creator
- Shorten, Connor, Khoshgoftaar, Taghi M., Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Recent successes of Deep Learning-powered AI are largely due to the trio of: algorithms, GPU computing, and big data. Data could take the shape of hospital records, satellite images, or the text in this paragraph. Deep Learning algorithms typically need massive collections of data before they can make reliable predictions. This limitation inspired investigation into a class of techniques referred to as Data Augmentation. Data Augmentation was originally developed as a set of label-preserving...
Show moreRecent successes of Deep Learning-powered AI are largely due to the trio of: algorithms, GPU computing, and big data. Data could take the shape of hospital records, satellite images, or the text in this paragraph. Deep Learning algorithms typically need massive collections of data before they can make reliable predictions. This limitation inspired investigation into a class of techniques referred to as Data Augmentation. Data Augmentation was originally developed as a set of label-preserving transformations used in order to simulate large datasets from small ones. For example, imagine developing a classifier that categorizes images as either a “cat” or a “dog”. After initial collection and labeling, there may only be 500 of these images, which are not enough data points to train a Deep Learning model. By transforming these images with Data Augmentations such as rotations and brightness modifications, more labeled images are available for model training and classification! In addition to applications for learning from limited labeled data, Data Augmentation can also be used for generalization testing. For example, we can augment the test set to set the visual style of images to “winter” and see how that impacts the performance of a stop sign detector. The dissertation begins with an overview of Deep Learning methods such as neural network architectures, gradient descent optimization, and generalization testing. Following an initial description of this technology, the dissertation explains overfitting. Overfitting is the crux of Deep Learning methods in which improvements to the training set do not lead to improvements on the testing set. To the rescue are Data Augmentation techniques, of which the Dissertation presents an overview of the augmentations used for both image and text data, as well as the promising potential of generative data augmentation with models such as ChatGPT. The dissertation then describes three major experimental works revolving around CIFAR-10 image classification, language modeling a novel dataset of Keras information, and patient survival classification from COVID-19 Electronic Health Records. The dissertation concludes with a reflection on the evolution of limitations of Deep Learning and directions for future work.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014228
- Subject Headings
- Deep learning (Machine learning), Artificial intelligence, Data augmentation
- Format
- Document (PDF)
- Title
- Computer-aided diagnosis of skin cancers using dermatology images.
- Creator
- Gilani, Syed Qasim, Marques, Oge, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Skin cancer is a prevalent cancer that significantly contributes to global mortality rates. Early detection is crucial for a high survival rate. Dermatologists primarily rely on visual inspection to diagnose skin cancers, but this method is inaccurate. Deep learning algorithms can enhance the diagnostic accuracy of skin cancers. However, these algorithms require substantial labeled data for effective training. Acquiring annotated data for skin cancer classification is time-consuming,...
Show moreSkin cancer is a prevalent cancer that significantly contributes to global mortality rates. Early detection is crucial for a high survival rate. Dermatologists primarily rely on visual inspection to diagnose skin cancers, but this method is inaccurate. Deep learning algorithms can enhance the diagnostic accuracy of skin cancers. However, these algorithms require substantial labeled data for effective training. Acquiring annotated data for skin cancer classification is time-consuming, expensive, and necessitates expert annotation. Moreover, skin cancer datasets often suffer from imbalanced data distribution. Generative Adversarial Networks (GANs) can be used to overcome the challenges of data scarcity and lack of labels by automatically generating skin cancer images. However, training and testing data from different distributions can introduce domain shift and bias, impacting the model’s performance. This dissertation addresses this issue by developing deep learning-based domain adaptation models. Additionally, this research emphasizes deploying deep learning models on hardware to enable real-time skin cancer detection, facilitating accurate diagnoses by dermatologists. Deploying conventional deep learning algorithms on hardware is not preferred due to the problem of high resource consumption. Therefore, this dissertation presents spiking neural network-based (SNN) models designed specifically for hardware implementation. SNNs are preferred for their power-efficient behavior and suitability for hardware deployment.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014233
- Subject Headings
- Deep learning (Machine learning), Diagnostic imaging, Skin--Cancer--Diagnosis
- Format
- Document (PDF)
- Title
- OCR2SEQ: A NOVEL MULTI-MODAL DATA AUGMENTATION PIPELINE FOR WEAK SUPERVISION.
- Creator
- Lowe, Michael A., Khoshgoftaar, Taghi M., Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
With the recent large-scale adoption of Large Language Models in multidisciplinary research and commercial space, the need for large amounts of labeled data has become more crucial than ever to evaluate potential use cases for opportunities in applied intelligence. Most domain specific fields require a substantial shift that involves extremely large amounts of heterogeneous data to have meaningful impact on the pre-computed weights of most large language models. We explore extending the...
Show moreWith the recent large-scale adoption of Large Language Models in multidisciplinary research and commercial space, the need for large amounts of labeled data has become more crucial than ever to evaluate potential use cases for opportunities in applied intelligence. Most domain specific fields require a substantial shift that involves extremely large amounts of heterogeneous data to have meaningful impact on the pre-computed weights of most large language models. We explore extending the capabilities a state-of-the-art unsupervised pre-training method; Transformers and Sequential Denoising Auto-Encoder (TSDAE). In this study we show various opportunities for using OCR2Seq a multi-modal generative augmentation strategy to further enhance and measure the quality of noise samples used when using TSDAE as a pretraining task. This study is a first of its kind work that leverages converting both generalized and sparse domains of relational data into multi-modal sources. Our primary objective is measuring the quality of augmentation in relation to the current implementation of the sentence transformers library. Further work includes the effect on ranking, language understanding, and corrective quality.
Show less - Date Issued
- 2023
- PURL
- http://purl.flvc.org/fau/fd/FA00014367
- Subject Headings
- Natural language processing (Computer science), Deep learning (Machine learning)
- Format
- Document (PDF)
- Title
- ADVANCING ONE-CLASS CLASSIFICATION: A COMPREHENSIVE ANALYSIS FROM THEORY TO NOVEL APPLICATIONS.
- Creator
- Abdollah, Zadeh Azadeh, Khoshgoftaar, Taghi M., Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
This dissertation explores one-class classification (OCC) in the context of big data and fraud detection, addressing challenges posed by imbalanced datasets. A detailed survey of OCC-related literature forms a core part of the study, categorizing works into outlier detection, novelty detection, and deep learning applications. This survey reveals a gap in the application of OCC to the inherent problems of big data, such as class rarity and noisy data. Building upon the foundational insights...
Show moreThis dissertation explores one-class classification (OCC) in the context of big data and fraud detection, addressing challenges posed by imbalanced datasets. A detailed survey of OCC-related literature forms a core part of the study, categorizing works into outlier detection, novelty detection, and deep learning applications. This survey reveals a gap in the application of OCC to the inherent problems of big data, such as class rarity and noisy data. Building upon the foundational insights gained from the comprehensive literature review on OCC, the dissertation progresses to a detailed comparative analysis between OCC and binary classification methods. This comparison is pivotal in understanding their respective strengths and limitations across various applications, emphasizing their roles in addressing imbalanced datasets. The research then specifically evaluates binary and OCC using credit card fraud data. This practical application highlights the nuances and effectiveness of these classification methods in real-world scenarios, offering insights into their performance in detecting fraudulent activities. After the evaluation of binary and OCC using credit card fraud data, the dissertation extends this inquiry with a detailed investigation into the effectiveness of both methodologies in fraud detection. This extended analysis involves utilizing not only the Credit Card Fraud Detection Dataset but also the Medicare Part D dataset. The findings show the comparative performance and suitability of these classification methods in practical fraud detection scenarios. Finally, the dissertation examines the impact of training OCC algorithms on majority versus minority classes, using the two previously mentioned datasets in addition to Medicare Part B and Durable Medical Equipment, Prosthetics, Orthotics and Supplies (DMEPOS) datasets. This exploration offers critical insights into model training strategies and their implications, suggesting that training on the majority class can often lead to more robust classification results. In summary, this dissertation provides a deep understanding of OCC, effectively bridging theoretical concepts with novel applications in big data and fraud detection. It contributes to the field by offering a comprehensive analysis of OCC methodologies, their practical implications, and their effectiveness in addressing class imbalance in big data.
Show less - Date Issued
- 2024
- PURL
- http://purl.flvc.org/fau/fd/FA00014387
- Subject Headings
- Classification, Big data, Deep learning (Machine learning), Computer engineering
- Format
- Document (PDF)
- Title
- PRESERVING KNOWLEDGE IN SIMULATED BEHAVIORAL ACTION LOOPS.
- Creator
- St.Clair, Rachel, Barenholtz, Elan, Hahn, William, Florida Atlantic University, Center for Complex Systems and Brain Sciences, Charles E. Schmidt College of Science
- Abstract/Description
-
One basic goal of artificial learning systems is the ability to continually learn throughout that system’s lifetime. Transitioning between tasks and re-deploying prior knowledge is thus a desired feature of artificial learning. However, in the deep-learning approaches, the problem of catastrophic forgetting of prior knowledge persists. As a field, we want to solve the catastrophic forgetting problem without requiring exponential computations or time, while demonstrating real-world relevance....
Show moreOne basic goal of artificial learning systems is the ability to continually learn throughout that system’s lifetime. Transitioning between tasks and re-deploying prior knowledge is thus a desired feature of artificial learning. However, in the deep-learning approaches, the problem of catastrophic forgetting of prior knowledge persists. As a field, we want to solve the catastrophic forgetting problem without requiring exponential computations or time, while demonstrating real-world relevance. This work proposes a novel model which uses an evolutionary algorithm similar to a meta-learning objective, that is fitted with a resource constraint metrics. Four reinforcement learning environments are considered with the shared concept of depth although the collection of environments is multi-modal. This system shows preservation of some knowledge in sequential task learning and protection of catastrophic forgetting in deep neural networks.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00013896
- Subject Headings
- Artificial intelligence, Deep learning (Machine learning), Reinforcement learning, Neural networks (Computer science)
- Format
- Document (PDF)
- Title
- FINANCIAL TIME-SERIES ANALYSIS WITH DEEP NEURAL NETWORKS.
- Creator
- Rimal, Binod, Hahn, William Edward, Florida Atlantic University, Department of Mathematical Sciences, Charles E. Schmidt College of Science
- Abstract/Description
-
Financial time-series data are noisy, volatile, and nonlinear. The classic statistical linear models may not capture those underlying structures of the data. The rapid advancement in artificial intelligence and machine learning techniques, availability of large-scale data, and increased computational capabilities of a machine opens the door to developing sophisticated deep learning models to capture the nonlinearity and hidden information in the data. Creating a robust model by unlocking the...
Show moreFinancial time-series data are noisy, volatile, and nonlinear. The classic statistical linear models may not capture those underlying structures of the data. The rapid advancement in artificial intelligence and machine learning techniques, availability of large-scale data, and increased computational capabilities of a machine opens the door to developing sophisticated deep learning models to capture the nonlinearity and hidden information in the data. Creating a robust model by unlocking the power of a deep neural network and using real-time data is essential in this tech era. This study constructs a new computational framework to uncover the information in the financial time-series data and better inform the related parties. It carries out the comparative analysis of the performance of the deep learning models on stock price prediction with a well-balanced set of factors from fundamental data, macroeconomic data, and technical indicators responsible for stock price movement. We further build a novel computational framework through a merger of recurrent neural networks and random compression for the time-series analysis. The performance of the model is tested on a benchmark anomaly time-series dataset. This new computational framework in a compressed paradigm leads to improved computational efficiency and data privacy. Finally, this study develops a custom trading simulator and an agent-based hybrid model by combining gradient and gradient-free optimization methods. In particular, we explore the use of simulated annealing with stochastic gradient descent. The model trains a population of agents to predict appropriate trading behaviors such as buy, hold, or sell by optimizing the portfolio returns. Experimental results on S&P 500 index show that the proposed model outperforms the baseline models.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00014009
- Subject Headings
- Neural networks (Computer science), Deep learning (Machine learning), Time-series analysis, Stocks, Simulated annealing (Mathematics)
- Format
- Document (PDF)
- Title
- DEEP LEARNING REGRESSION MODELS FOR LIMITED BIOMEDICAL TIME-SERIES DATA.
- Creator
- Hssayeni, Murtadha D., Behnaz Ghoraani, Behnaz, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Time-series data in biomedical applications are gaining an increased interest to detect and predict underlying diseases and estimate their severity, such as Parkinson’s disease (PD) and cardiovascular diseases. This interest is driven by advances in wearable sensors and deep learning models to a large extent. In the literature, less attention has been paid to regression models for continuous outcomes in these applications, especially when dealing with limited data. Training deep learning...
Show moreTime-series data in biomedical applications are gaining an increased interest to detect and predict underlying diseases and estimate their severity, such as Parkinson’s disease (PD) and cardiovascular diseases. This interest is driven by advances in wearable sensors and deep learning models to a large extent. In the literature, less attention has been paid to regression models for continuous outcomes in these applications, especially when dealing with limited data. Training deep learning models on raw limited data results in overfitted models, which is the main technical challenge we address in this dissertation. An example of limited and\or imbalanced time-series data is PD’s motion signals that are needed for the continuous severity estimation of Parkinson’s disease (PD). The significance of this continuous estimation is providing a tool for longitudinal monitoring of daily motor and non-motor fluctuations and managing PD medications. The dissertation objective is to train generalizable deep learning models for biomedical regression problems when dealing with limited training time-series data. The goal is designing, developing, and validating an automatic assessment system based on wearable sensors that can measure the severity of PD complications in the home-living environment while patients with PD perform their activities of daily living (ADL). We first propose using a combination of domain-specific feature engineering, transfer learning, and an ensemble of multiple modalities. Second, we utilize generative adversarial networks (GAN) and propose a new formulation of conditional GAN (cGAN) as a generative model for regression to handle an imbalanced training dataset. Next, we propose a dual-channel auxiliary regressor GAN (AR-GAN) trained using Wasserstein-MSE-correlation loss. The proposed AR-GAN is used as a data augmentation method in regression problems.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00013992
- Subject Headings
- Deep learning (Machine learning), Regression analysis--Mathematical models, Biomedical engineering
- Format
- Document (PDF)
- Title
- DATA-DRIVEN IDENTIFICATION AND CONTROL OF TURBULENT STRUCTURES USING DEEP NEURAL NETWORKS.
- Creator
- Jagodinski, Eric, Verma, Siddhartha, Florida Atlantic University, Department of Ocean and Mechanical Engineering, College of Engineering and Computer Science
- Abstract/Description
-
Wall-bounded turbulent flows are pervasive in numerous physics and engineering applications. Such flows tend to have a strong impact on the design of ships, airplanes and rockets, industrial chemical mixing, wind and hydrokinetic energy, utility infrastructure and innumerable other fields. Understanding and controlling wall bounded turbulence has been a long-pursued endeavor yielding plentiful scientific and engineering discoveries, but there is much that remains unexplained from a...
Show moreWall-bounded turbulent flows are pervasive in numerous physics and engineering applications. Such flows tend to have a strong impact on the design of ships, airplanes and rockets, industrial chemical mixing, wind and hydrokinetic energy, utility infrastructure and innumerable other fields. Understanding and controlling wall bounded turbulence has been a long-pursued endeavor yielding plentiful scientific and engineering discoveries, but there is much that remains unexplained from a fundamental viewpoint. One unexplained phenomenon is the formation and impact of coherent structures like the ejections of slow near-wall fluid into faster moving ow which have been shown to correlate with increases in friction drag. This thesis focuses on recognizing and regulating organized structures within wall-bounded turbulent flows using a variety of machine learning techniques to overcome the nonlinear nature of this phenomenon. Deep Learning has provided new avenues of analyzing large amounts of data by applying techniques modeled after biological neurons. These techniques allow for the discovery of nonlinear relationships in massive, complex systems like the data found frequently in fluid dynamics simulation. Using a neural network architecture called Convolutional Neural Networks that specializes in uncovering spatial relationships, a network was trained to estimate the relative intensity of ejection structures within turbulent flow simulation without any a priori knowledge of the underlying flow dynamics. To explore the underlying physics that the trained network might reveal, an interpretation technique called Gradient-based Class Activation Mapping was modified to identify salient regions in the flow field which most influenced the trained network to make an accurate estimation of these organized structures. Using various statistical techniques, these salient regions were found to have a high correlation to ejection structures, and to high positive kinetic energy production, low negative production, and low energy dissipation regions within the flow. Additionally, these techniques present a general framework for identifying nonlinear causal structures in general three-dimensional data in any scientific domain where the underlying physics may be unknown.
Show less - Date Issued
- 2022
- PURL
- http://purl.flvc.org/fau/fd/FA00014119
- Subject Headings
- Turbulent flow, Turbulence, Neural networks (Computer science), Deep learning (Machine learning)
- Format
- Document (PDF)
- Title
- FROM DNA TO GRAVITATIONAL WAVES: APPLICATIONS OF STATISTICS AND MACHINE LEARNING.
- Creator
- Alemrajabi, Mahsa Firouzabad, Tichy, Wolfgang, Assis, Raquel, Florida Atlantic University, Department of Physics, Charles E. Schmidt College of Science
- Abstract/Description
-
In the current world of fast-paced data production, statistics and machine learning tools are essential for interpreting and utilizing the full potential of this data. This dissertation comprises three studies employing statistical analysis and Convolutional Neural Network models. First, the research investigates the genetic evolution of the SARS-CoV-2 RNA molecule, emphasizing the role of epistasis in the RNA virus’s ability to adapt and survive. Through statistical tests, this study...
Show moreIn the current world of fast-paced data production, statistics and machine learning tools are essential for interpreting and utilizing the full potential of this data. This dissertation comprises three studies employing statistical analysis and Convolutional Neural Network models. First, the research investigates the genetic evolution of the SARS-CoV-2 RNA molecule, emphasizing the role of epistasis in the RNA virus’s ability to adapt and survive. Through statistical tests, this study validates the significant impacts of genetic interactions and mutations on the virus’s structural changes over time, offering insights into its evolutionary dynamics. Secondly, the dissertation explores medical diagnosis by implementing Convolutional Neural Networks to differentiate between lung CT-scans of COVID-19 and non-COVID patients. This portion of the research demonstrates the capability of deep learning to enhance diagnostic processes, thereby reducing time and increasing accuracy in clinical settings. Lastly, we delve into gravitational wave detection, an area of astrophysics requiring precise data analysis to identify signals from cosmic events such as black hole mergers. Our goal is to utilize Convolutional Neural Network models in hopes of improving the sensitivity and accuracy of detecting these difficult to catch signals, pushing the boundaries of what we can observe in the universe. The findings of this dissertation underscore the utility of combining statistical methods and machine learning models to solve problems that are not only varied but also highly impactful in their respective fields.
Show less - Date Issued
- 2024
- PURL
- http://purl.flvc.org/fau/fd/FA00014454
- Subject Headings
- Neural networks (Computer science), Gravitational waves, Deep learning (Machine learning), Diagnosis, Epistasis, Genetic
- Format
- Document (PDF)
- Title
- MULTI-MODEL DEEP LEARNING FOR GROUPER SOUND CLASSIFICATION AND SEIZURE PREDICTION.
- Creator
- Ibrahim, Ali K., Zhuang, Hanqi, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Deep learning models have been successfully applied to a variety of machine learning tasks, including image identification, image segmentation, object detection, speaker recognition, natural language processing, bioinformatics and drug discovery, among other things. This dissertation introduces Multi-Model Deep Learning (MMDL), a new ensemble deep learning approach for signal classification and event forecasting. The ultimate goal of the MMDL method is to improve classification and...
Show moreDeep learning models have been successfully applied to a variety of machine learning tasks, including image identification, image segmentation, object detection, speaker recognition, natural language processing, bioinformatics and drug discovery, among other things. This dissertation introduces Multi-Model Deep Learning (MMDL), a new ensemble deep learning approach for signal classification and event forecasting. The ultimate goal of the MMDL method is to improve classification and forecasting performances of individual classifiers by fusing results of participating deep learning models. The performance of such an ensemble model, however, depends heavily on the following two design features. Firstly, the diversity of the participating (or base) deep learning models is crucial. If all base deep learning models produce similar classification results, then combining these results will not provide much improvement. Thus, diversity is considered to be a key design feature of any successful MMDL system. Secondly, the selection of a fusion function, namely, a suitable function to integrate the results of all the base models, is important. In short, building an effective MMDL system is a complex and challenging process which requires deep knowledge of the problem context and a well-defined prediction process. The proposed MMDL method utilizes a bank of Convolutional Neural Networks (CNNs) and Stacked AutoEncoders (SAEs). To reduce the design complexity, a randomized generation process is applied to assign values to hyperparameters of base models. To speed up the training process, new feature extraction procedures which captures time-spatial characteristics of input signals are also explored. The effectiveness of the MMDL method is validated in this dissertation study with three real-world case studies. In the first case study, the MMDL model is applied to classify call types of groupers, an important fishery resource in the Caribbean that produces sounds associated with reproductive behaviors during yearly spawning aggregations. In the second case study, the MMDL model is applied to detect upcalls of North Atlantic Right Whales (NARWs), a type of endangered whales. NARWs use upcalls to communicate among themselves. In the third case study, the MMDL model is modified to predict seizure episodes. In all these cases, the proposed MMDL model outperforms existing state-of-the-art methods.
Show less - Date Issued
- 2019
- PURL
- http://purl.flvc.org/fau/fd/FA00013382
- Subject Headings
- Deep Learning, Machine Learning, Neural networks (Computer science), Groupers, Whales, Vocalization, Animal, Seizures
- Format
- Document (PDF)
- Title
- MACHINE LEARNING DEMODULATOR ARCHITECTURES FOR POWER-LIMITED COMMUNICATIONS.
- Creator
- Gorday, Paul E., Nurgun, Erdol, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
The success of deep learning has renewed interest in applying neural networks and other machine learning techniques to most fields of data and signal processing, including communications. Advances in architecture and training lead us to consider new modem architectures that allow flexibility in design, continued learning in the field, and improved waveform coding. This dissertation examines neural network architectures and training methods suitable for demodulation in power-limited...
Show moreThe success of deep learning has renewed interest in applying neural networks and other machine learning techniques to most fields of data and signal processing, including communications. Advances in architecture and training lead us to consider new modem architectures that allow flexibility in design, continued learning in the field, and improved waveform coding. This dissertation examines neural network architectures and training methods suitable for demodulation in power-limited communication systems, such as those found in wireless sensor networks. Such networks will provide greater connection to the world around us and are expected to contain orders of magnitude more devices than cellular networks. A number of standard and proprietary protocols span this space, with modulations such as frequency-shift-keying (FSK), Gaussian FSK (GFSK), minimum shift keying (MSK), on-off-keying (OOK), and M-ary orthogonal modulation (M-orth). These modulations enable low-cost radio hardware with efficient nonlinear amplification in the transmitter and noncoherent demodulation in the receiver.
Show less - Date Issued
- 2020
- PURL
- http://purl.flvc.org/fau/fd/FA00013511
- Subject Headings
- Deep learning, Machine learning--Technique, Demodulators, Wireless sensor networks, Computer network architectures
- Format
- Document (PDF)
- Title
- META-LEARNING AND ENSEMBLE METHODS FOR DEEP NEURAL NETWORKS.
- Creator
- Liu, Feng, Dingding, Wang, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
Deep Neural Networks have been widely applied in many different applications and achieve significant improvement over classical machine learning techniques. However, training a neural network usually requires large amount of data, which is not guaranteed in some applications such as medical image classification. To address this issue, people propose to implement meta learning and ensemble learning techniques to make deep learning trainers more powerful. This thesis focuses on using deep...
Show moreDeep Neural Networks have been widely applied in many different applications and achieve significant improvement over classical machine learning techniques. However, training a neural network usually requires large amount of data, which is not guaranteed in some applications such as medical image classification. To address this issue, people propose to implement meta learning and ensemble learning techniques to make deep learning trainers more powerful. This thesis focuses on using deep learning equipped with meta learning and ensemble learning to study specific problems. We first propose a new deep learning based method for suggestion mining. The major challenges of suggestion mining include cross domain issue and the issues caused by unstructured and highly imbalanced data structure. To overcome these challenges, we propose to apply Random Multi-model Deep Learning (RMDL) which combines three different deep learning architectures (DNNs, RNNs and CNNs) and automatically selects the optimal hyper parameter to improve the robustness and flexibility of the model. Our experimental results on the SemEval-2019 competition Task 9 data sets demonstrate that our proposed RMDL outperforms most of the existing suggestion mining methods.
Show less - Date Issued
- 2020
- PURL
- http://purl.flvc.org/fau/fd/FA00013481
- Subject Headings
- Neural networks (Computer science), Deep learning, Neural Networks in Applications, Machine learning--Technique
- Format
- Document (PDF)
- Title
- STUDY AND ANALYSIS OF MACHINE LEARNING TECHNIQUES FOR DETECTION OF DISTRACTED DRIVERS.
- Creator
- Qu, Fangming, Nojoumian, Mehrdad, Florida Atlantic University, Department of Computer and Electrical Engineering and Computer Science, College of Engineering and Computer Science
- Abstract/Description
-
The rise of Advanced Driver-Assistance Systems (ADAS) and Autonomous Vehicles (AVs) emphasizes the urgent need to combat distracted driving. This study introduces a fresh approach for improved detection of distracted drivers, combining a pre-trained Convolutional Neural Network (CNN) with a Bidirectional Long Short- Term Memory (BiLSTM) network. Our analysis utilizes both spatial and temporal features to examine a broad array of driver distractions. We demonstrate the advantage of this CNN...
Show moreThe rise of Advanced Driver-Assistance Systems (ADAS) and Autonomous Vehicles (AVs) emphasizes the urgent need to combat distracted driving. This study introduces a fresh approach for improved detection of distracted drivers, combining a pre-trained Convolutional Neural Network (CNN) with a Bidirectional Long Short- Term Memory (BiLSTM) network. Our analysis utilizes both spatial and temporal features to examine a broad array of driver distractions. We demonstrate the advantage of this CNN-BiLSTM framework over conventional methods, achieving significant precision (up to 98.97%) on the combined ’Union Dataset,’ merging the Kaggle State Farm Dataset and AUC Distracted Driver Dataset (AUC-DDD). This research enhances safety in autonomous vehicles by providing a solid and flexible solution for everyday use. Our results mark considerable progress in accurately identifying driver distractions, pushing the boundaries of safety technology in AVs.
Show less - Date Issued
- 2024
- PURL
- http://purl.flvc.org/fau/fd/FA00014418
- Subject Headings
- Deep learning (Machine learning), Distracted driving, Transportation--Safety measures, Automated vehicles--Safety measures
- Format
- Document (PDF)