Projects
Active Projects
PABCC
2024
Sponsor: PA Breast Cancer Coalition
RAPID AND INEXPENSIVE PRECISION BREAST CANCER SCREENING USING MACHINE LEARNING
The purpose of this proposal will be to create software for automatically segmenting breast cancer biopsy and pathology slides. Our goal is to create decision support software that can perform the labor-intensive segmentation task while leaving the analysis and diagnosis to a trained pathologist.
Read MoreNSF FET QUANT
2022
Sponsor: National Science Foundation
QUANTUM COMPUTING BASED APPROACH TO UNDIRECTED GENERATIVE MACHINE LEARNING MODELS
We propose a novel way of applying Quantum Annealing Computers (QACs) to training and sampling from Deep Boltzmann Machine (DBM) model distributions. Our method combines QAC and classical sampling.
Read MoreTU CAT DPATH
2021
Sponsor: Temple University
AUTOMATIC INTERPRETATION OF DIGITAL PATHOLOGY IMAGES USING DEEP LEARNING
This is a collaborative and interdisciplinary project to detect and characterize cancerous cells in digitized images of pathology slides, while also producing the world’s largest catalog of research-grade digital pathology slides.
Read MorePast Projects
Sponsor: Temple University
MODELING MOVEMENT THROUGH HETEROLOGOUS BIOFILM STRUCTURES FOR IMPROVED ANTIBIOTIC DELIVERY
We use antibiotics and nanoparticles moving through biofilms as well as mixed species biofilms given the implications for two clinically relevant topics: (1) the treatment of pathogenic biofilms and (2) determining whether heterologous structures in microbiota help prevent the complete eradication of microbiota during antibiotic treatment.
Read MoreSponsor: National Science Foundation
CCRI: PLANNING: DIGITAL PATHOLOGY RESEARCH CONSORTIUM
This proposal supports a community planning effort for digital pathology focused on planning the data and evaluation resources that will enable high performance, automated interpretation of pathology images using machine learning. We have created a diverse community of digital pathology researchers that contributes data, software tools, and expertise with the communal goal of improving healthcare outcomes through enhanced analysis of biological tissue.
Read MoreSponsor: National Science Foundation
REAL-TIME ANALYSIS OF ELECTROENCEPHALOGRAMS IN AN INTENSIVE CARE ENVIRONMENT
In this project, we have developed a real-time version of our state of the art automated seizure detection software. We have also improved performance of the system. The overall goal is to develop technology that is marketable.
Read MoreSponsor: Pennsylvania Department of Transportation
PENNDOT AGENT TRAINING
The primary goal of this project is to research, evaluate, improve and develop course material and delivery methods related to training required for authorized agents of PennDOT.
Read MoreSponsor: Temple University
PREDICTING ENDOGENOUS BANK HEALTH FROM FDIC STATISTICS ON DEPOSITORY INSTITUTIONS USING DEEP LEARNING
Our goal is to assess baseline performance of deep learning systems on operational banking data provided by the U.S. Federal Government (FDIC). Operational data contain many forms of imperfection that make it extremely difficult for the application of traditional deep leaning systems.
Read MoreSponsor: National Science Foundation
MRI: HIGH PERFORMANCE DIGITAL PATHOLOGY USING BIG DATA AND MACHINE LEARNING
The goal of this project is to build a large open-source database of pathology slides that can be used to train high performance deep learning models. The outcome will be a sustainable facility to rapidly collect large amounts of automatically annotated whole slide images.
Read MoreSponsor: Pennsylvania Department of Health
ENABLING THE APPLICATION OF DEEP LEARNING TO AUTOMATED SEIZURE DETECTION
Funded by the Commonwealth Universal Research Enhancement Program (CURE), the goal of this proposal is to quadruple the size of the TUH EEG Seizure Detection Corpus by manually annotating three years' worth of EEG data collected at Temple Hospital.
Read MoreSponsor: Temple College of Engineering
PEER MENTORING
An app that connects undergraduate students with the most appropriate peer mentor. This project was executed as an independent study course by a team of outstanding undergraduates.
Read MoreSponsor: Pennsylvania Department of Transportation
HIGHWAY INCIDENT TIMELINE DETECTION
The primary goal of this project is to estimate the time it takes for PennDOT to be notified when a highway incident occurs. Emergency response records from counties were manually paired with PennDOT 911 call logs to estimate the time delay.
Read MoreSponsor: National Institutes of Health
COST-EFFECTIVE CLOUD COMPUTING
This project, which was part of many of our funded research efforts, involved the development of a low-cost extensible heterogeneous compute cluster to support machine learning experiments.
Read MoreSponsor: National Institutes for Health
AUTOMATIC DISCOVERY AND PROCESSING OF EEG COHORTS FROM CLINICAL RECORDS
The primary goal of this project is to enable comparative research by automatically uncovering clinical knowledge from a vast BigData archive of clinical EEG signals and EEG reports.
Read MoreSponsor: National Science Foundation
AUTOMATIC INTERPRETATION OF EEGS
Commercialization of our automated interpretation technology. This project led our team through a process to define the market for EEG technology and focus the technology so that it addresses the greatest customer needs.
Read MoreSponsor: National Science Foundation
BRAINWAVE RECOGNITION
This project is a half-day laboratory on brainwave recognition that includes an introduction to Python. It was originally developed as part of our outreach program for Temple University's Women in Engineering Summer Program.
Read MoreSponsor: University City Science Center and Temple University
AUTOMATIC INTERPRETATION OF EEGS
The goal of this project is to apply machine learning techniques successful in speech recognition to electroencephalography (EEG). A system is being developed to generate a transcription of an EEG signal in real time.
Read MoreSponsor: DARPA, Temple University
EEG CORPUS CREATION
In this project, we are developing the largest EEG corpus ever to be publicly released. The corpus consists of over 20,000 EEGs dating back to 2002. In addition to the raw signal data, metadata about the subjects is available, including medical conditions and treatments. Physician interpretations of the data are also included making this an invaluable resource for machine learning experiments.
Read MoreSponsor: Temple University
AMERICAN SIGN LANGUAGE FINGER SPELLING RECOGNITION
This project investigated the application of nonparametric Bayesian statistical modeling to automatic recognition of finger spelling from single images and video. It is an attempt to extend our work in speech recognition to image recognition and understanding.
Read MoreSponsor: Temple University
VARIATIONAL INFERENCE ALGORITHMS FOR PHONE CLASSIFICATION
We investigated the use of three nonparametric Bayesian variational inference algorithms for phoneme classification on TIMIT and CALLHOME English and Mandarin.
Read MoreSponsor: National Institutes of Health
THE NEURAL ENGINEERING DATA CONSORTIUM
This organization develops open source big data resources designed to accelerate progress in machine learning applications in bioengineering.
Read MoreSponsor: Temple University
NONPARAMETRIC BAYESIAN ACOUSTIC MODELS
The goal of this project is to apply nonparametric Bayesian approaches to automatically discover subword acoustic units for high performance speech recognition systems.
Read MoreSponsor: Temple University
ALTERNATIVE ENERGY DEMONSTRATION
A very successful demonstration of the challenges with generating energy from human powered vehicles. This technology was originally developed for Earth Day at Temple University of April 21, 2011, and consists of a set of bicycles equipped with generators so that electricity can be generated by pedaling the bikes.
Read MoreSponsor: Temple University
HTK TUTORIALS
This web site contains detailed tutorials on how to run Cambridge University's Hidden Markov Modeling Toolkit (HTK) on common speech recognition tasks. Complete turnkey systems, along with expected results, are provided, making it very easy for researchers new to HLK to get started.
Read MoreSponsor: Temple University
AUTOMATIC ACOUSTIC UNIT CLASSIFICATION
We investigated automatic derivation of acoustic units for speech recognition using new algorithms such as Particle Swarm Optimization (PSO).
Read MoreSponsor: Temple University
A MATLAB VISUALIZATION TOOL FOR HYDROLOGY
In this project, we developed an interactive MATLAB-based visualization tool to analyze and model water flow in shallow basins such as streams and rivers.
Read MoreSponsor: Temple University
KEYWORD SEARCH TERM STRENGTH
The goal of this project was to develop a tool that predicts the strength of a keyword search term. We convert a proposed search term to a feature representation and then use these features to predict the reliability of a search term. Key factors include the number of syllables in the word and the phonetic content.
Read MoreSponsor: Temple University
SUMMER OF CODE: RESEARCH EXPERIENCE FOR UNDERGRADUATES
Every summer we welcome a group of enthusiastic undergraduates into our lab with a goal of developing their software skills and introducing them to fields such as speech and EEG signal processing, machine learning and big data.
Read MoreSponsor: Mississippi State University
CAMPUS BUS NETWORKING
Networked vehicles are the cornerstone of the next generation intelligent transportation system. In this project, we developed the hardware and software necessary to perform two-way communications with a vehicle track and to collect critical vehicle performance data.
Read MoreSponsor: Mississippi State University
IN-VEHICLE DIALOG SYSTEMS
A voice interface is a superb tool for in-vehicle information access when your hands and eyes are busy. In this project, we are developed a dialog system that provides information about the university and its surrounding area. For example, a user can ask "Where is the nearest restaurant to my hotel?" or "How do I get from the airport to my hotel?".
Read MoreSponsor: Mississippi State University
IP VERSION 6 RESEARCH
IP version 6 (IPv6) is the next generation Internet protocol that has the potential to drastically change the way we use the Internet as part of our everyday lives. We investigated peer to peer IPv6 networks and applications, mobile IPv6, and high performance routing.
Read MoreSponsor: National Science Foundation
NONLINEAR STATISTICAL MODELING OF SPEECH
Hidden Markov models (HMMs) have been the primary approach to speech recognition many years. The goal of this project was the development of a new approach to statistical modeling of speech based on nonlinear statistics. We implemented a speaker recognition system using a variety of nonlinear models including linear dynamic models and probabilistic mixtures of autoregressive models.
Read MoreSponsor: Conversay, Inc.
ROBUST ACOUSTIC MODELING
Field deployment of speech recognition technology results in a number of interesting problems, such as microphone saturation, which severely limit the performance of speech recognition engines. In this project, we studied the effects of microphone saturation and develop algorithms to improve robustness to saturation, clipping, and other forms of signal degradation.
Read MoreSponsor: Mississippi State University
VEHICLE PERFORMANCE MONITORING SYSTEM
This project is a one-year collaboration with the Mississippi Department of Transportation (MDOT) to adapt and apply the Mississippi State University wireless web-based vehicle performance and monitoring system (VPMS).
Read MoreSponsor: Mississippi State University
POWERTRAIN DESIGN AND OPTIMIZATION
State of the art design tools in automotive engineering lacks the power, sophistication, and automation of design tools for the electronics industry. We fundamentally advanced automotive design engineering by introducing optimization and physics-based design principles into standard industry design tools. This allowed designers to globally optimize design criteria such as size, efficiency, cost, weight, volume, and achieve unprecedented reductions in design turnaround time.
Read MoreSponsor: Creare, Inc.
COGNITIVE ASSESSMENT USING VOICE ANALYSIS
The goal of this project was to design an effective fatigue monitoring and assessment system by characterizing changes in a human voice as a speaker becomes fatigued or stressed. A remote, near-real-time assessment system to monitor the fatigue levels of military personnel was developed.
Read MoreSponsor: Mississippi State University
MOBILE COMPUTING USABILITY
Increasingly smaller and more complex computing devices have made the human interface to these systems more critical than ever. The primary goal of this project was to study the design and usability of interfaces to a variety of portable and ubiquitous computing devices.
Read MoreSponsor: ETSI
AURORA EVALUATION OF SPEECH RECOGNITION FRONT ENDS
The goal of this project was to evaluate and compare the robustness of feature extraction algorithms on a large vocabulary task. The target application is cellular telephony. These evaluations are being conducted under the auspices of the Aurora Distributed Speech Recognition working group of The European Telecommunications Standards Institute (ETSI). The Wall Street Journal database (WSJ0) is being used as the basis for experiments.
Read MoreSponsor: ETSI
BULLDOG STOCK EXCHANGE
As part of a unique entrepreneurship thrust in MS State's College of Engineering, EE Senior Design teams form companies. These companies are publicly traded on the Bulldog Stock Exchange. This simulation teaches our students about the intimate relationships between technology and business.
Read MoreSponsor: National Science Foundation
SPOKEN LANGUAGE INFORMATION RETRIEVAL
Integration of prosodic information, speech recognition and parsing can positively impact the problem of information extraction from spoken documents. This research provided the initial steps towards information extraction from telephone conversations and served as the basis for sophisticated web browsing
Read MoreSponsor: Dragon
SOUTHERN-ACCENTED SPEECH
Southern accents are underrepresented in most pubicly available databases. This had led to speculation that performance for such speakers is worse than other better-represented dialects. To test this hypothesis, a small data collection effort was conducted that targeted Southern-accented speakers. Data was collected from February 21 to February 25, 2000. The data collected consisted of a total of 23 speakers (13 males and 10 females) ranging in age from 18 to 56.
Read MoreSponsor: MITRE
ROBUST LOW PERPLEXITY VOICE INTERFACES
Robust speech recognition technology for speech recorded and transmitted over narrowband channels requires advances in several components of a speech recognition system: signal processing techniques that produce invariant feature sets; acoustic modeling and training that produce channel-independent acoustic models; noise cancellation techniques that mitigate the effects of impulsive and application- dependent transient noise. This project was a one-year collaboration with the MITRE Corporation that resulted in a prototype of a near real- time system that provided a robust and flexible command and control voice interface in realistic tactical noisy environments.
Read MoreSponsor: National Science Foundation
INTERNET-ACCESSIBLE SPEECH RECOGNITION TECHNOLOGY
Speech recognition research has always been a core competency in ISIP. Large vocabulary conversational speech recognition LVCSR) is a fascinating technology that draws heavily from the diverse research areas of statistical pattern recognition, digital signal processing, artificial intelligence, linguistics, and information theory. On this web site you will find a powerful and flexible public domain speech recognition system written in C++.
Read MoreSponsor: DoD
SWITCHBOARD RESEGMENTATION
The SWITCHBOARD Corpus (SWB) has become critical to the success of state-of-the-art LVCSR systems. Using this data, however, has not been without its share of drawbacks. Word-level transcription of SWB is difficult, and conventions associated with such transcriptions are highly controversial and often application dependent. By 1998, the quality of the SWB transcriptions for LVCSR was recognized to be less than ideal, and many years of small projects attempting to correct the transcriptions had taken their toll. In February of 1998 ISIP began a project to do a final cleanup of the SWB Corpus, and to organize and integrate all existing resources related to the data into this final release.
Read MoreSponsor: LDC
A JAPANESE COMMAND AND CONTROL WORD DATABASE
The Japan Electronic Industry Development Association's Common Speech Data (JCSD) Corpus is an isolated phrase corpus consisting of 150 speakers (75 males/75 females) and almost 200,000 utterances. It represents an important milestone in Japanese speech recognition technology development. In this project we organized and preprocessed the data so that it was ready for distribution by the Linguistic Data Consortium.
Read MoreSponsor: USFS
SCENIC BEAUTY ESTIMATION OF FORESTRY IMAGES
The United States Department of Agriculture and Forest Services requested we develop an algorithm to automatically determine the scenic beauty of a given forest scene. Their requirement is a consequence of rising public concern to preserve forest beauty. To achieve this, we have developed an extensive database that can support our algorithm development. The database consists of 637 unique images, each image having various subjective ratings for their scenic beauty content. The database extensively samples several dimensions of the problem including year, season, time of day, angle and treatment. In order to automatically relate the beauty of an image to the subjective beauty ratings, we developed algorithms to extract features from the image that determine its scenic beauty.
Read MoreSponsor: Texas Instruments
AUTOMATIC PRONUNCIATION GENERATION
Correct recognition of proper nouns is critical to problems in speech understanding and applications involving voice interfaces. We developed a suite of algorithms involving stochastic neural networks, decision trees and other statistical techniques that are capable of automatically generating multiple pronunciations for proper nouns based on only the text-based spelling of the name.
Read MoreSponsor: LDC
A DIGITAL TELEPHONE INTERFACE FOR SUN WORKSTATIONS
Using the Linkon system, a speech data collection board, we developed a fully- expandable, robust system for platform-independent collection of telephone speech data. Our object-oriented software libraries and intuitive GUI provide powerful tools with which even a novice user can efficiently prototype complex applications. Using the system one can generate programs which range from simple single-user prompt/record demonstrations to robust SWITCHBOARD-type multi-user applications.
Read More