Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 7 of 7

Full-Text Articles in Engineering

Mouldingnet: Deep-Learning For 3d Object Reconstruction, Tobias Burns, Barak Pearlmutter, John B. Mcdonald Jan 2019

Mouldingnet: Deep-Learning For 3d Object Reconstruction, Tobias Burns, Barak Pearlmutter, John B. Mcdonald

Session 2: Deep Learning for Computer Vision

th the rise of deep neural networks a number of approaches for learning over 3D data have gained popularity. In this paper, we take advantage of one of these approaches, bilateral convolutional layers to propose a novel end-to-end deep auto-encoder architecture to efficiently encode and reconstruct 3D point clouds. Bilateral convolutional layers project the input point cloud onto an even tessellation of a hyperplane in the (d Å1)-dimensional space known as the permutohedral lattice and perform convolutions over this representation. In contrast to existing point cloud based learning approaches, this allows us to learn over the underlying geometry of the …


Deep Cnn Frameworks For Comparison For Malaria Diagnosis, Priyadarshini Adyasha Pattanaik, Zelong Wang, Patrick Horain Jan 2019

Deep Cnn Frameworks For Comparison For Malaria Diagnosis, Priyadarshini Adyasha Pattanaik, Zelong Wang, Patrick Horain

Session 2: Deep Learning for Computer Vision

Abstract We compare Deep Convolutional Neural Networks (DCNN) frameworks, namely AlexNet and VGGNet, for the classification of healthy and malaria-infected cells in large, grayscale, low quality and low resolution microscopic images, in the case only a small training set is available. Experimental results deliver promising results on the path to quick, automatic and precise classification in unstrained images.


Place Recognition In Challenging Conditions, Saravanabalagi Ramachandran, John Mcdonald Jan 2019

Place Recognition In Challenging Conditions, Saravanabalagi Ramachandran, John Mcdonald

Session 2: Deep Learning for Computer Vision

Place recognition in a visual SLAM system helps build and maintain a map from multiple traversals of the same environment while closing loops to correct drift accumulated over time. Despite the marked success in visual place recognition research over the past decade, it remains a challenging problem in the context of variations caused due to different times of the day, weather, lighting and seasons. In this paper, we address this problem by progressively training convolutional neural networks in a siamese fashion to generate embeddings that encode semantic and visual features for sequence-aligned image pairs taken at different timescales and viewpoints. …


Deep Convolutional Neural Networks For Estimating Lens Distortion Parameters, Sebastian Lutz, Mark Davey, Aljosa Smolic Jan 2019

Deep Convolutional Neural Networks For Estimating Lens Distortion Parameters, Sebastian Lutz, Mark Davey, Aljosa Smolic

Session 2: Deep Learning for Computer Vision

In this paper we present a convolutional neural network (CNN) to predict multiple lens distortion parameters from a single input image. Unlike other methods, our network is suitable to create high resolution output as it directly estimates the parameters from the image which then can be used to rectify even very high resolution input images. As our method it is fully automatic, it is suitable for both casual creatives and professional artists. Our results show that our network accurately predicts the lens distortion parameters of high resolution images and corrects the distortions satisfactory.


Synthetic Positron Emission Tomography Using Conditional-Generative Adversarial Networks For Healthy Bone Marrow Baseline Image Generation, Patrick Leydon, Martin O'Connell, Derek Greene, Kathleen Curran Jan 2019

Synthetic Positron Emission Tomography Using Conditional-Generative Adversarial Networks For Healthy Bone Marrow Baseline Image Generation, Patrick Leydon, Martin O'Connell, Derek Greene, Kathleen Curran

Session 6: Applications, Architecture and Systems Integration

A Conditional-Generative Adversarial Network has been used for a supervised image-to-image transla- tion task which outputs a synthetic PET scan based on real patient CT data. The network is trained using only data of patients with healthy bone marrow metabolism. This allows for a patient specific synthetic healthy baseline scan to be produced. This can be used by a clinician for comparison to real PET data in the absence of a baseline scan or to aid in the diagnosis of conditions such as Multiple Myeloma which manifest as changes in bone marrow metabolism.


Fisheyemultinet: Real-Time Multi-Task Learning Architecture For Surround-View Automated Parking System., Pullaro Maddu, Wayne Doherty, Ganesh Sistu, Isabelle Leang, Michal Uricar, Sumanth Chennupati, Hazem Rashed, Jonathan Horgan, Ciaran Hughes, Senthil Yogamani Jan 2019

Fisheyemultinet: Real-Time Multi-Task Learning Architecture For Surround-View Automated Parking System., Pullaro Maddu, Wayne Doherty, Ganesh Sistu, Isabelle Leang, Michal Uricar, Sumanth Chennupati, Hazem Rashed, Jonathan Horgan, Ciaran Hughes, Senthil Yogamani

Session 6: Applications, Architecture and Systems Integration

Automated Parking is a low speed manoeuvring scenario which is quite unstructured and complex, requiring full 360° near-field sensing around the vehicle. In this paper, we discuss the design and implementation of an automated parking system from the perspective of camera based deep learning algorithms. We provide a holistic overview of an industrial system covering the embedded system, use cases and the deep learning architecture. We demonstrate a real-time multi-task deep learning network called FisheyeMultiNet, which detects all the necessary objects for parking on a low-power embedded system. FisheyeMultiNet runs at 15 fps for 4 cameras and it has three …


Multi-Sensory Deep Learning Architectures For Slam Dunk Scene Classification, Paul Minogue Jan 2019

Multi-Sensory Deep Learning Architectures For Slam Dunk Scene Classification, Paul Minogue

Dissertations

Basketball teams at all levels of the game invest a considerable amount of time and effort into collecting, segmenting, and analysing footage from their upcoming opponents previous games. This analysis helps teams identify and exploit the potential weaknesses of their opponents and is commonly cited as one of the key elements required to achieve success in the modern game. The growing importance of this type of analysis has prompted research into the application of computer vision and audio classification techniques to help teams classify scoring sequences and key events using game footage. However, this research tends to focus on classifying …