Inference Framework for Model Update and Development

Monday, February 26, 2018 - 01:30 pm
Meeting room 2265, Innovation Center
DISSERTATION DEFENSE Xiao Lin Advisor : Dr. Gabriel Terejanu Abstract Computational models play an important role in scientific discovery and engineering design. However, developing computational models is challenging, since the process always follows a path contaminated with errors and uncertainties. The uncertainties and errors inherent in computational models are the result of many factors, including experimental uncertainties, model structure inadequacies, uncertainties in model parameters and initial conditions, as well as errors due to numerical discretiza- tions. To realize the full potential in applications it is critical to systematically and economically reduce the uncertainties inherent in all computational models. The update and development of computational models is a recursive process between data assimilation and data selection. In data assimilation, measurements are incorporated into computational simulations to reduce the uncertainties of the model and in reverse, the simulations help determine where to acquire data such that most information can be provided. Currently, data assimilation techniques are overwhelmed by data volume and velocity and increased complexity of computational models. In this work, we develop a novel data assimilation approach EnLLVM which is based on linear latent variable model. There are several advantages of this approach. First, it works well with high dimensional dynamic systems, but only requires a small number of samples. Second, it can absorb model structure error and reflect the error in the uncertainty of data assimilation results. In addition, data assimilation is performed without calculating likelihood of observation, thus it can be applied to data assimilation problems in which likelihood is intractable. Obtaining informative data is also crucial, as data collection is an expensive endeavor for a number of science and engineering fields. Mutual information, which naturally measures information provided about one quantity by knowing the other quantity, has become a major design metric and has fueled a large body of work on experimental design. However, estimating mutual information is challenging and results are not reliable in high dimensions. In this work, we derive a lower bound of mutual information, which is computed in much lower dimensions. This lower bound can be applied to experimental design as well as other problems that require comparison of mutual information. At last, we develop a general framework for building computational models. In this framework, hypotheses about unknown model structure are generated by using EnLLVM for data assimilation and lower bound of mutual information for finding relations between state variables and unknown structure function. Then, different hypotheses can be ranked with model selection technique. This framework not only provides a way to infer model discrepancy, but also could further contribute to scientific discoveries.

Transfer Learning for Performance Analysis of Highly-Configurable Software Systems

Monday, February 26, 2018 - 10:15 am
Innovation Center, Room 2277
COLLOQUIUM Pooyan Jamshidi Abstract A wide range of modern software-intensive systems (e.g., autonomous systems, big data analytics, robotics, deep neural architectures) are built configurable. These systems offer a rich space for adaptation to different domains and tasks. Developers and users often need to reason about the performance of such systems, making tradeoffs to change specific quality attributes or detecting performance anomalies. For instance, developers of image recognition mobile apps are not only interested in learning which deep neural architectures are accurate enough to classify their images correctly, but also which architectures consume the least power on the mobile devices on which they are deployed. Recent research has focused on models built from performance measurements obtained by instrumenting the system. However, the fundamental problem is that the learning techniques for building a reliable performance model do not scale well, simply because the configuration space is exponentially large that is impossible to exhaustively explore. For example, it will take over 60 years to explore the whole configuration space of a system with 25 binary options. In this talk, I will start motivating the configuration space explosion problem based on my previous experience with large-scale big data systems in industry. I will then present my transfer learning solution to tackle the scalability challenge: instead of taking the measurements from the real system, we learn the performance model using samples from cheap sources, such as simulators that approximate the performance of the real system, with a fair fidelity and at a low cost. Results show that despite the high cost of measurement on the real system, learning performance models can become surprisingly cheap as long as certain properties are reused across environments. In the second half of the talk, I will present empirical evidence, which lays a foundation for a theory explaining why and when transfer learning works by showing the similarities of performance behavior across environments. I will present observations of environmental changes‘ impacts (such as changes to hardware, workload, and software versions) for a selected set of configurable systems from different domains to identify the key elements that can be exploited for transfer learning. These observations demonstrate a promising path for building efficient, reliable, and dependable software systems. Finally, I will share my research vision for the next five years and outline my immediate plans to further explore the opportunities of transfer learning. Pooyan Jamshidi is a postdoctoral researcher at Carnegie Mellon University, where he works on transfer learning for building performance models to enable dynamic adaptation of mobile robotics software as a part of BRASS, a DARPA sponsored project. Prior to his current position, he was a research associate at Imperial College London, where he worked on Bayesian optimization for automated performance tuning of big data systems. He holds a Ph.D. from Dublin City University, where he worked on self-learning Fuzzy control for auto-scaling in the cloud. He has spent 7 years in industry as a developer and a software architect. His research interests are at the intersection of software engineering, systems, and machine learning, and his focus lies predominantly in the areas of highly-configurable and self-adaptive systems (more details: https://pooyanjamshidi.github.io/research/). Date: Feb. 26, 2018 Time: 10:15-11:15 am Place: Innovation Center, Room 2277

Internet of Acoustic Things (IoAT): Challenges, Opportunities, and Threats

Monday, February 19, 2018 - 10:15 am
Storey Innovation Center, Room 2277
Abstract: The recent proliferation of acoustic devices, ranging from voice assistants to wearable health monitors, is leading to a sensing ecosystem around us -- referred to as the Internet of Acoustic Things or IoAT. My research focuses on developing hardware-software building blocks that enable new capabilities for this emerging future. In this talk, I will sample some of my projects. For instance, (1) I will demonstrate carefully designed sounds that are completely inaudible to humans but recordable by all microphones. (2) I will discuss our work with physical vibrations from mobile devices, and how they conduct through finger bones to enable new modalities of short range, human-centric communication. (3) Finally, I will draw attention to various acoustic leakages and threats that arrive with sensor-rich environments. I will conclude this talk with a glimpse of my ongoing and future projects targeting a stronger convergence of sensing, computing, and communications in tomorrow’s IoT, cyber-physical systems, and healthcare technologies. Bio: Nirupam Roy is a Ph.D. candidate in Electrical and Computer Engineering at the University of Illinois, Urbana-Champaign (UIUC). His research interests are in mobile sensing, wireless networking, and embedded systems with applications to IoT, cyber-physical-systems, and security. Roy is the recipient of the Valkenburg graduate research award, the Lalit Bahl fellowship, and the outstanding thesis awards from both his Bachelor's and Master's institutes. His recent research on "Making Microphones Hear Inaudible Sounds" received the MobiSys'17 best paper award and was selected for the ACM SIGMOBILE research highlights of the year in 2017.

Deep Learning and its Application in Bioinformatics: Case Study on Protein-peptide Binding Prediction

Friday, December 1, 2017 - 02:20 pm
Swearingen room 2A14
Speaker: Dr. Jianjun Hu Abstract: Deep learning has led to tremendous progress in computer vision, speech recognition, and natural language processing. It has now crossed the boundary and has brought breakthroughs also in the area of bioinformatics. One interesting problem is developing accurate models for predicting peptide binding affinities to protein receptors such as MHC(Major Histocompatibility complex), which can shed understanding to adverse drug reaction and autoimmune diseases and lead to more effective protein therapy and design of vaccines. We proposed a deep convolutional neural network (CNN) based peptide binding prediction algorithm for achieving substantially higher accuracy as tested in MHC-I peptide binding affinity prediction. Our model takes raw binding peptide sequences and affinity scores or binding labels as input without needing any human-designed features. The back-propagation training algorithm allows it to learn nonlinear relationships among the amino acid positions of the peptides. It also can naturally handle the peptide length variation, MHC polymorphasim, and unbalanced training samples of MHC proteins with different alleles via a simple amino acid padding scheme. Our experiments showed that DeepMHC can achieve the state-of-the-art prediction performance on most of the IEDB benchmark datasets with a single model architecture and without using any consensus or composite ensemble classifier models. Bio: I joined CSE department of the University of South Carolina in August 2007. I am now working on integrative functional genomics and especially integrative analysis of microarray data. I am also interested in motif discovery for understanding gene expression mechanisms involved in diseases. I got my Ph.D. in Computer Science in the area of machine learning and particularly evolutionary computation at the Genetic Algorithm Research and Application Group (GARAGe) of Michigan State University. My dissertation focuses on sustainable evolutionary computation algorithms and automated computational synthesis. I have worked on the DNA motif discovery problem as Postdoc at Kihara Bioinformatics Lab, Purdue University and microarray analysis at the Computational Molecular Biology Division at the University of Southern California (another USC).

Instant and Bug-Free Patch Generation for Fixing Heap Vulnerabilities

Wednesday, November 29, 2017 - 10:00 am
Storey 2277
Abstract: Patching is one of the most important measures to continuously uphold security throughout the life of a software system. Patch generation and deployment are probably the most critical tasks in the process of patching. However, patch generation is typically a lengthy procedure (according to Symantec, it takes an average of 28 days to release a patch for fixing a critical security bug); and patch deployment risks system stability due to new bugs contained in patches. From the perspective of speeding up patch generation and avoiding bugs in patches, we examine the notorious heap vulnerabilities, including heap buffer overflows (such as Heartbleed), uninitialized-read, use-after-free, and double-free, and explore the following two important but less-investigated problems: (1) How fast can heap patches be generated? (2) How to ensure zero bugs in the generated patches? While quick patch generation and bug-free patches are two naturally desired goals, they contradict each other in practice. Rushed patch generation tends to introduce bugs, while creating a quality patch requires significant time for debugging, testing, and even system redesign. Thus, how to achieve the two inherently contradictory goals simultaneously has been challenging. Inspired by “targeted therapy”, a cancer treatment that precisely recognizes and kills cancer cells, we propose Targeted Heap Therapy to pinpoint and treat vulnerable buffers, which are buffers that can be exploited to launch attacks, with instantly generated bug-free patches. This talk will also present some of the important problems and future prospects on Internet of Things security as well as our ongoing work on these problems. Bio: Dr. Qiang Zeng is a Tenure-Track Assistant Professor in the Department of Computer & Information Sciences at Temple University. He received his Ph.D. in Computer Science and Engineering from the Pennsylvania State University. He has rich industry experience and has worked in the IBM T.J. Watson Research Center, the NEC Lab America, Symantec and Yahoo. Dr. Zeng’s main research interest is Systems and Software Security. He currently works on Mobile Security, IoT Security, and deep learning for solving security problems. He has published papers in PLDI, NDSS, MobiSys, CGO, DSN and TKDE.

Ensuring the Observability of Structural Test Obligations

Thursday, November 16, 2017 - 11:00 am
Meeting room 2267, Innovation Center
THESIS DEFENSE Department of Computer Science and Engineering University of South Carolina Author : Meng Ying Advisor : Dr. Gregory Gay Abstract Test adequacy criteria are widely used to guide test creation. However, many of these criteria are sensitive to statement structure or the choice of test oracle. This is because such criteria ensure that execution reaches the element of interest, but impose no constraints on the execution path after this point. We are not guaranteed to observe a failure just because a fault is triggered. To address this issue, we have proposed the concept of observability—an extension to coverage criteria based on Boolean expressions that combines the obligations of a host criterion with an additional path condition that increases the likelihood that a fault encountered will propagate to a monitored variable. Our study, conducted over five industrial systems and an additional forty open-source systems, has revealed that adding observability tends to improve efficacy over satisfaction of the traditional criteria, with average improvements of up to 392.44% in mutation detection and per-model improvements of up to 1654.38%. Ultimately, there is merit to our hypothesis—observability reduces sensitivity to the choice of oracle and to the program structure.

Unmanned Systems & Robotics

Friday, November 10, 2017 - 02:20 pm
Swearingen room 2A14
I would like to invite you to attend this week's CSCE 791 seminar. These seminars highlight research being performed in our department and across the world. All CSCE 791 seminars are open to anybody who wishes to attend - not just students registered for the course. Friday, November 10, 2:20 - 3:10 PM Swearingen room 2A14 Speaker: Nikolaos Vitzilaios, University of South Carolina (Department of Mechanical Engineering) Abstract: The area of Unmanned Systems & Robotics has seen a tremendous growth over the last decades with autonomous systems being rapidly developed in many domains resulting in a wide range of applications that we can see in our daily lives (drones, autonomous cars, industrial robots, medical and service robotics, space robotics, etc.). This presentation will show the research of Dr. Nikolaos (Nikos) Vitzilaios in this area over the last 10 years, presenting the developments in specific areas over time and focusing on latest research as long as future research plans. The presentation will include research in the following areas: Aerial Robotics: applications of automatic control in unmanned fixed-wing aircraft and rotorcraft, including theoretical aspects as well as applied hardware and software developments. Several platforms have been developed over the last 10 years while the latest development will be presented based on a patented design of a dual-tilting quadcopter able to perform advanced navigation and control in narrow spaces as well as fault-tolerant control. Mobile Robotics: several ground robotic platforms will be presented, including customized commercial mobile robots as well as in-house built robots (including a patented one). These platforms are built for different applications and projects and the presentation will focus on the collaboration with aerial robots in critical missions UAV Aerodynamics: novel research in the area of circulation control wings for fixed-wing aircraft will be presented. Marine Robotics: a novel propulsion system will be presented for low speed propeller less robots that are required to be used in extreme environments (nuclear reactors). Medical Robotics: the latest research on the modelling of the human thumb will be presented accompanied by a new kinematic model that shows the importance of the thumb in grasping and how this will affect our perception for the development of future robotic hands. Mechatronic Systems: the development of an automatic bike gear shifter (patent pending). Modeling and control of complex and highly nonlinear systems. Future trends and planned research in perception and control. The presentation will focus on the outputs of each research project and will include demos and videos from field experiments and indoor-outdoor testing. Bio: Dr. Nikolaos (Nikos) Vitzilaios is an Assistant Professor at the Department of Mechanical Engineering, University of South Carolina, since August 2017. He holds a PhD in Mechanical Engineering from the Technical University of Crete (2010) and his PhD thesis was on the development of autonomous controllers for helicopter UAVs. Prior to joining USC, he was a Senior Lecturer in Robotics at Kingston University, London, UK. From 2011-2012, he was a Postdoctoral Fellow at the Department of Electrical Engineering, University of Alberta, Canada, working in the Applied Nonlinear Controls Laboratory and developing a helicopter UAV for power line inspection, funded by the Canadian government (NSERC). From 2012-2015, he was a Research Scientist at the University of Denver Unmanned Systems Research Institute (Department of Electrical Engineering), leading the Aerial Robotics Team and working on several projects in the area of unmanned systems funded by various agencies (NSF, ARL, NASA). Dr. Vitzilaios has more than 10 years of research and more than 5 years of teaching experience in the areas of Robotics and Controls, with notable presence in the Robotics & Automation society, more than 30 publications, one US patent and successful grant applications both in US and UK. His background is interdisciplinary from the areas of Mechanical Engineering, Electrical Engineering and Computer Science. His research interests span the broad area of Autonomous Unmanned Systems where he has significant hands-on experience in all kinds of robotic applications (aerial, ground, marine, industrial, biomedical). His research is mainly experimental and his interests include prototype development and commercialization of research outcomes. He is a Fellow of the Higher Education Academy in UK and a member of IEEE, AIAA, AUVSI and IFAC. He is a Chartered Mechanical Engineer in the Technical Chamber of Greece since March 2005.