Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Graphics and Human Computer Interfaces

2013

Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 92

Full-Text Articles in Physical Sciences and Mathematics

A Modular Approach To The Development Of Interactive Augmented Reality Applications., Nelson J. Andre Dec 2013

A Modular Approach To The Development Of Interactive Augmented Reality Applications., Nelson J. Andre

Electronic Thesis and Dissertation Repository

Augmented reality (AR) technologies are becoming increasingly popular as a result of the increase in the power of mobile computing devices. Emerging AR applications have the potential to have an enormous impact on industries such as education, healthcare, research, training and entertainment. There are currently a number of augmented reality toolkits and libraries available for the development of these applications; however, there is currently no standard tool for development. In this thesis we propose a modular approach to the organization and development of AR systems in order to enable the creation novel AR experiences. We also investigate the incorporation of …


Detecting Multilingual Lines Of Text With Fusion Moves, Igor Milevskiy Dec 2013

Detecting Multilingual Lines Of Text With Fusion Moves, Igor Milevskiy

Electronic Thesis and Dissertation Repository

This thesis proposes an optimization-based algorithm for detecting lines of text in images taken by hand-held cameras. The majority of existing methods for this problem assume alphabet-based texts (e.g. in Latin or Greek) and they use heuristics specific to such texts: proximity between letters within one line, larger distance between separate lines, etc. We are interested in a more challenging problem where images combine alphabet and logographic characters from multiple languages where typographic rules vary a lot (e.g. English, Korean, and Chinese). Significantly higher complexity of fitting multiple lines of text in different languages calls for an energy-based formulation combining …


Redesign Of Johar: A Framework For Developing Accessible Applications, Oladapo Oyebode Dec 2013

Redesign Of Johar: A Framework For Developing Accessible Applications, Oladapo Oyebode

Electronic Thesis and Dissertation Repository

As the population of disabled people continues to grow, designing accessible applications is still a challenge, since most applications are incompatible with assistive technologies used by disabled people to interact with the computer. This accessibility issue is usually caused by the reluctance of software engineers or developers to include complete accessibility features in their applications, which in turn is often due to the extra cost and development effort required to dynamically adapt applications to a wide range of disabilities. Our aim to resolve accessibility issues led to the design and implementation of the "Johar" framework, which facilitates the development of …


Astrophobia: A 3d Multiplayer Space Combat Game With Linear Entity Interpolation, Luke Larson Dec 2013

Astrophobia: A 3d Multiplayer Space Combat Game With Linear Entity Interpolation, Luke Larson

Computer Science and Software Engineering

Astrophobia is a Descent-like 3D networked multiplayer space combat game with linear entity interpolation (client-side animation between game-state packets to give the illusion of continuous game updates) similar to entity interpolation implemented in Valve Software’s Source engine. Additionally, Astrophobia procedurally generates unique levels, has zero-gravity physics, ship and projectile wall bouncing, hit detection, OpenGL 3D graphics, Phong lighting, ship model and texture, and a simple HUD that provides visualization of health points, aiming crosshair, and player scoreboard.


Reconstructing Point Clouds Of Mid-Size Objects, Spencer Woodworth Dec 2013

Reconstructing Point Clouds Of Mid-Size Objects, Spencer Woodworth

Computer Science and Software Engineering

This project explores the use of an inexpensive 3D camera for the acquisition and reconstruction of mid-size objects. The disparity of objects between stereo image pairs are used to calculate depth and generate a depth map. The depth map is used to generate a point cloud representation of the object from a single view. Finally, point clouds are generated from several views of an object and then aligned and merged into a seamless 360-degree point cloud.


Dense Image Correspondence Under Large Appearance Variations, Linlin Liu, Kok-Lim Low, Wen-Yan Lin Dec 2013

Dense Image Correspondence Under Large Appearance Variations, Linlin Liu, Kok-Lim Low, Wen-Yan Lin

Research Collection School Of Computing and Information Systems

This paper addresses the difficult problem of finding dense correspondence across images with large appearance variations. Our method uses multiple feature samples at each pixel to deal with the appearance variations based on our observation that pre-defined single feature sample provides poor results in nearest neighbor matching. We apply the idea in a flow-based matching framework and utilize the best feature sample for each pixel to determine the flow field. We propose a novel energy function and use dual-layer loopy belief propagation to minimize it where the correspondence, the feature scale and rotation parameters are solved simultaneously. Our method is …


Partial Least Squares Regression On Grassmannian Manifold For Emotion Recognition, M. Liu, R. Wang, Zhiwu Huang, S. Shan, X. Chen Dec 2013

Partial Least Squares Regression On Grassmannian Manifold For Emotion Recognition, M. Liu, R. Wang, Zhiwu Huang, S. Shan, X. Chen

Research Collection School Of Computing and Information Systems

In this paper, we propose a method for video-based human emotion recognition. For each video clip, all frames are represented as an image set, which can be modeled as a linear subspace to be embedded in Grassmannian manifold. After feature extraction, Class-specific One-to-Rest Partial Least Squares (PLS) is learned on video and audio data respectively to distinguish each class from the other confusing ones. Finally, an optimal fusion of classifiers learned from both modalities (video and audio) is conducted at decision level. Our method is evaluated on the Emotion Recognition In The Wild Challenge (EmotiW 2013). The experimental results on …


A Bandwidth-Conserving Architecture For Crawling Virtual Worlds, Dipesh Gautam Dec 2013

A Bandwidth-Conserving Architecture For Crawling Virtual Worlds, Dipesh Gautam

Graduate Theses and Dissertations

A virtual world is a computer-based simulated environment intended for its users to inhabit via avatars. Content in virtual worlds such as Second Life or OpenSimulator is increasingly presented using three-dimensional (3D) dynamic presentation technologies that challenge traditional search technologies. As 3D environments become both more prevalent and more fragmented, the need for a data crawler and distributed search service will continue to grow. By increasing the visibility of content across virtual world servers in order to better collect and integrate the 3D data we can also improve the crawling and searching efficiency and accuracy by avoiding crawling unchanged regions …


Conceptual, Impact-Based Publications Recommendations, Ann Smittu Joseph Dec 2013

Conceptual, Impact-Based Publications Recommendations, Ann Smittu Joseph

Graduate Theses and Dissertations

CiteSeerx is a digital library for scientific publications by computer science researchers. It also functions as a search engine with several features including autonomous citation indexing, automatic metadata extraction, full-text indexing and reference linking. Users are able to retrieve relevant documents from the CiteSeerx database directly using search queries and will further benefit if the system suggests document recommendations to the user based on their preferences and search history. Therefore, recommender systems were initially developed and continue to evolve to recommend more relevant documents to the CiteSeerx users. In this thesis, we introduce the Conceptual, Impact-Based Recommender (CIBR), …


Representation, Recognition And Collaboration With Digital Ink, Rui Hu Nov 2013

Representation, Recognition And Collaboration With Digital Ink, Rui Hu

Electronic Thesis and Dissertation Repository

Pen input for computing devices is now widespread, providing a promising interaction mechanism for many purposes. Nevertheless, the diverse nature of digital ink and varied application domains still present many challenges. First, the sampling rate and resolution of pen-based devices keep improving, making input data more costly to process and store. At the same time, existing applications typically record digital ink either in proprietary formats, which are restricted to single platforms and consequently lack portability, or simply as images, which lose important information. Moreover, in certain domains such as mathematics, current systems are now achieving good recognition rates on individual …


Caver Quest 3d Virtual Cave Simulation Of Snowy River In Fort Stanton Cave, Ronald J. Lipinski, Pete Lindsley Nov 2013

Caver Quest 3d Virtual Cave Simulation Of Snowy River In Fort Stanton Cave, Ronald J. Lipinski, Pete Lindsley

National Cave and Karst Management Symposium 2013

Virtual worlds, or 3D simulations through which an avatar can travel, is becoming a common means to display products or provide training in new environments. This paper describes the steps in producing the 3D virtual simulation of Snowy River in Fort Stanton Cave, New Mexico. A traditional cave survey and map with cross sections was used to produce a 3D meshed surface of the cave walls using the Blender software package. Photographs were taken of the walls, ceiling, and floor and merged together. The merged montage was applied to the 3D mesh walls as a “texture”. Unity3D was used to …


Search Of Small Objects By Topology Matching, Context Modeling, And Pattern Mining, Wei Zhang, Chong-Wah Ngo Nov 2013

Search Of Small Objects By Topology Matching, Context Modeling, And Pattern Mining, Wei Zhang, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

No abstract provided.


Vireo/Ecnu @ Trecvid 2013: A Video Dance Of Detection, Recounting And Search With Motion Relativity And Concept Learning From Wild, Chong-Wah Ngo, Feng Wang, Wei Zhang, Chun-Chet Tan, Zhanhu Sun, Shi-Ai Zhu, Ting Yao Nov 2013

Vireo/Ecnu @ Trecvid 2013: A Video Dance Of Detection, Recounting And Search With Motion Relativity And Concept Learning From Wild, Chong-Wah Ngo, Feng Wang, Wei Zhang, Chun-Chet Tan, Zhanhu Sun, Shi-Ai Zhu, Ting Yao

Research Collection School Of Computing and Information Systems

The VIREO group participated in four tasks: instance search, multimedia event recounting, multimedia event detection, and semantic indexing. In this paper, we will present our approaches and discuss the evaluation results


Multimedia Modeling, Chong-Wah Ngo, Klaus Schoeffmann, Yiannis Andreopoulos, Christian Breiteneder Nov 2013

Multimedia Modeling, Chong-Wah Ngo, Klaus Schoeffmann, Yiannis Andreopoulos, Christian Breiteneder

Research Collection School Of Computing and Information Systems

Multimedia modeling aims to study computational models for addressing real-world multimedia problems from various perspectives, including information fusion, perceptual understanding, performance evaluation and social media. The topic becomes increasingly important with the massive amount of data available over the Internet, representing different pieces of information in heterogeneous forms that need to be consolidated before being used for multimedia problems. On the other hand, the advancement in technologies such as mobile and sensing devices drive the needs for revisiting the existing models for not only dealing with audio-visual cues but also incorporating various sensory modalities that have potential in providing cheaper …


Symmetry Robust Descriptor For Non-Rigid Surface Matching, Zhiyuan Zhang, Kangkang Yin, Kelvin W. C. Foong Nov 2013

Symmetry Robust Descriptor For Non-Rigid Surface Matching, Zhiyuan Zhang, Kangkang Yin, Kelvin W. C. Foong

Research Collection School Of Computing and Information Systems

In this paper, we propose a novel shape descriptor that is robust in differentiating intrinsic symmetric points on geometric surfaces. Our motivation is that even the state-of-theart shape descriptors and non-rigid surface matching algorithms suffer from symmetry flips. They cannot differentiate surface points that are symmetric or near symmetric. Hence a left hand of one human model may be matched to a right hand of another. Our Symmetry Robust Descriptor (SRD) is based on a signed angle field, which can be calculated from the gradient fields of the harmonic fields of two point pairs. Experiments show that the proposed shape …


Web-Based Visual Analytics For Social Media Data, Jun Xiang Tee, David S. Ebert Oct 2013

Web-Based Visual Analytics For Social Media Data, Jun Xiang Tee, David S. Ebert

The Summer Undergraduate Research Fellowship (SURF) Symposium

Social media data provides valuable information about different events, trends and happenings around the world. Visual data analysis tasks for social media data have large computational and storage space requirements. Due to these restrictions, subdivision of data analysis tools into several layers such as Data, Business Logic or Algorithms, and Presentation Layer is often necessary to make them accessible for variety of clients. On server side, social media data analysis algorithms can be implemented and published in the form of web services. Visual Interface can then be implemented in the form of thin clients that call these web services for …


Interactive Focus+Context Glyph And Streamline Vector Visualization, Joshua Joseph Anghel Oct 2013

Interactive Focus+Context Glyph And Streamline Vector Visualization, Joshua Joseph Anghel

Boise State University Theses and Dissertations

With data sets growing in size, more efficient methods of visualizing and analyzing data are required. A user can become overwhelmed if too much data is displayed at once and be distracted from identifying potentially important features. This thesis presents methods for focus+context visualization of vector fields. Users can interact with the data in real time to choose which regions should have more emphasis through a mouse or touch interface. Streamlines and hedgehog based visualizations are used to provide focus+context to vector visualizations. The presented visualization methods are shown to be more computationally efficient and are shown to scale well …


Vertical Color Maps: A Data Independent Alternative To Floor Plan Maps, Alexander Salveson Nossum, Nicholas A. Giudice, Hengshan Li Oct 2013

Vertical Color Maps: A Data Independent Alternative To Floor Plan Maps, Alexander Salveson Nossum, Nicholas A. Giudice, Hengshan Li

Spatial Information Science and Engineering Faculty Scholarship

Location sharing in indoor environments is limited by the sparse availability of indoor positioning and lack of geographical building data. Recently, several solutions have begun to implement digital maps for use in indoor space. The map design is often a variant of floor-plan maps. Whereas massive databases and GIS exist for outdoor use, the majority of indoor environments are not yet available in a consistent digital format. This dearth of indoor maps is problematic, as navigating multistorey buildings is known to create greater difficulty in maintaining spatial orientation and developing accurate cognitive maps. The development of standardized, more intuitive indoor …


Annotation For Free: Video Tagging By Mining User Search Behavior, Yao Ting, Tao Mei, Chong-Wah Ngo, Shipeng Li Oct 2013

Annotation For Free: Video Tagging By Mining User Search Behavior, Yao Ting, Tao Mei, Chong-Wah Ngo, Shipeng Li

Research Collection School Of Computing and Information Systems

The problem of tagging is mostly considered from the perspectives of machine learning and data-driven philosophy. A fundamental issue that underlies the success of these approaches is the visual similarity, ranging from the nearest neighbor search to manifold learning, to identify similar instances of an example for tag completion. The need to searching for millions of visual examples in high-dimensional feature space, however, makes the task computationally expensive. Moreover, the results can suffer from robustness problem, when the underlying data, such as online videos, are rich of semantics and the similarity is difficult to be learnt from low-level features. This …


Static Saliency Vs. Dynamic Saliency: A Comparative Study, Tam Nguyen, Mengdi Xu, Guangyu Gao, Mohan Kankanhalli, Qi Tian, Shuicheng Yan Oct 2013

Static Saliency Vs. Dynamic Saliency: A Comparative Study, Tam Nguyen, Mengdi Xu, Guangyu Gao, Mohan Kankanhalli, Qi Tian, Shuicheng Yan

Computer Science Faculty Publications

Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly.

Motivated by these observations, we propose a …


Image Search By Graph-Based Label Propagation With Image Representation From Dnn, Yingwei Pan, Yao Ting, Kuiyuan Yang, Houqiang Li, Chong-Wah Ngo, Jingdong Wang, Tao Mei Oct 2013

Image Search By Graph-Based Label Propagation With Image Representation From Dnn, Yingwei Pan, Yao Ting, Kuiyuan Yang, Houqiang Li, Chong-Wah Ngo, Jingdong Wang, Tao Mei

Research Collection School Of Computing and Information Systems

Our objective is to estimate the relevance of an image to a query for image search purposes. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of bridging the gap between semantic textual queries as well as users’ search intents and image visual content. Image search engines therefore primarily rely on static and textual features. Visual features are mainly used to identify potentially useful recurrent patterns or relevant training examples for complementing search by image reranking. Second, image rankers are trained on query-image pairs labeled by human experts, making the …


Error Recovered Hierarchical Classification, Shiai Zhu, Xiao-Yong Wei, Chong-Wah Ngo Oct 2013

Error Recovered Hierarchical Classification, Shiai Zhu, Xiao-Yong Wei, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

Hierarchical classification (HC) is a popular and efficient way for detecting the semantic concepts from the images. However, the conventional HC, which always selects the branch with the highest classification response to go on, has the risk of propagating serious errors from higher levels of the hierarchy to the lower levels. We argue that the highestresponse-first strategy is too arbitrary, because the candidate nodes are considered individually which ignores the semantic relationship among them. In this paper, we propose a novel method for HC, which is able to utilize the semantic relationship among candidate nodes and their children to recover …


The Vireo Team At Mediaeval 2013: Violent Scenes Detection By Mid-Level Concepts Learnt From Youtube, Chun Chet Tan, Chong-Wah Ngo Oct 2013

The Vireo Team At Mediaeval 2013: Violent Scenes Detection By Mid-Level Concepts Learnt From Youtube, Chun Chet Tan, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

The Violent Scenes Detection task continues to pose challenge in detecting violent scenes in Hollywood movies. In this working notes paper, we present the framework of our system and briefly discuss the performance results obtained in both objective and subjective subtasks. Besides using the low-level features for training the SVM classifiers for violent scenes detection, we show the feasibility in using the concept detectors to infer the occurrence of violent scenes. External Youtube data is exploited in our implementation to provide more diverse definition to violent scene concepts. Furthermore, we explore the feasibility of using Conditional Random Fields (CRF) to …


Cognitive Activity Support Tools: Design Of The Visual Interface, Paul Parsons Sep 2013

Cognitive Activity Support Tools: Design Of The Visual Interface, Paul Parsons

Electronic Thesis and Dissertation Repository

This dissertation is broadly concerned with interactive computational tools that support the performance of complex cognitive activities, examples of which are analytical reasoning, decision making, problem solving, sense making, forecasting, and learning. Examples of tools that support such activities are visualization-based tools in the areas of: education, information visualization, personal information management, statistics, and health informatics. Such tools enable access to information and data and, through interaction, enable a human-information discourse. In a more specific sense, this dissertation is concerned with the design of the visual interface of these tools. This dissertation presents a large and comprehensive theoretical framework to …


Vehicular Instrumentation And Data Processing For The Study Of Driver Intent, Taha Kowsari Sep 2013

Vehicular Instrumentation And Data Processing For The Study Of Driver Intent, Taha Kowsari

Electronic Thesis and Dissertation Repository

The primary goal of this thesis is to provide processed experimental data needed to determine whether driver intentionality and driving-related actions can be predicted from quantitative and qualitative analysis of driver behaviour. Towards this end, an instrumented experimental vehicle capable of recording several synchronized streams of data from the surroundings of the vehicle, the driver gaze with head pose and the vehicle state in a naturalistic driving environment was designed and developed. Several driving data sequences in both urban and rural environments were recorded with the instrumented vehicle. These sequences were automatically annotated for relevant artifacts such as lanes, vehicles …


Sitting On The Digital Divide, Singapore Management University Sep 2013

Sitting On The Digital Divide, Singapore Management University

Perspectives@SMU

Persuading the ‘digital resistant’ while balancing the digital:traditional media budget is a challenge for marketers


A Robust Rgbd Slam System For 3d Environment With Planar Surfaces, Po-Chang Su, Ju Shen, Sen-Ching S. Cheung Sep 2013

A Robust Rgbd Slam System For 3d Environment With Planar Surfaces, Po-Chang Su, Ju Shen, Sen-Ching S. Cheung

Computer Science Faculty Publications

With the increasing popularity of RGB-depth (RGB-D) sensors such as the Microsoft Kinect, there have been much research on capturing and reconstructing 3D environments using a movable RGB-D sensor. The key process behind these kinds of simultaneous location and mapping (SLAM) systems is the iterative closest point or ICP algorithm, which is an iterative algorithm that can estimate the rigid movement of the camera based on the captured 3D point clouds. While ICP is a well-studied algorithm, it is problematic when it is used in scanning large planar regions such as wall surfaces in a room. The lack of depth …


Tourguide: Augmented Reality Based On Structure Recognition, Michael Jipping Aug 2013

Tourguide: Augmented Reality Based On Structure Recognition, Michael Jipping

Faculty Presentations

TourGuide is software for Android phones and tablets that allows users to use augmented reality to see the world. TourGuide uses realtime video to search for images that match those in its database. When these images are found, TourGuide overlays the image with a number of options – from URLs/Web to video viewing – that are used to access more information about the image by touching the screen.

The software is designed so that configuring it is easy. In addition to the viewer, there is also a TourGuide Editor that allows configuration on a tablet or phone.


Online Instruction Made Easy: Getting Started With The Guide On The Side, Erica Defrain Aug 2013

Online Instruction Made Easy: Getting Started With The Guide On The Side, Erica Defrain

UVM Libraries Conference Day

Come learn about a great new tool for easily creating effective and engaging online tutorials built around the theory of active learning. The Guide on the Side was created by librarians at the University of Arizona and released as an open source download in 2012. We hope to soon have it installed for all to use at the UVM Libraries!


Towards Decrypting Attractiveness Via Multi-Modality Cue, Tam Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan Aug 2013

Towards Decrypting Attractiveness Via Multi-Modality Cue, Tam Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan

Computer Science Faculty Publications

Decrypting the secret of beauty or attractiveness has been the pursuit of artists and philosophers for centuries. To date, the computational model for attractiveness estimation has been actively explored in the computer vision and multimedia community, yet with the focus mainly on facial features. In this article, we conduct a comprehensive study on female attractiveness conveyed by single/multiple modalities of cues, that is, face, dressing and/or voice; the aim is to discover how different modalities individually and collectively affect the human sense of beauty.

To extensively investigate the problem, we collect the Multi-Modality Beauty (M2B) dataset, which is …