Kepuska, Veton
Veton Kepuska
Emeritus Faculty | College of Engineering and Science: Department of Electrical Engineering and Computer Science
Personal Overview
My goal is to make a significant contribution in advancing Human - Machine Interaction and Communication through my Wake-Up-Word (WUW) Speech Recognition (SR) Technology. Conventional Speech Recognition Systems typically operate at their best within the range of 99% accuracy. This implies that for the natural rate of conversation of a human speech the person who utters 100 words per minute, then we are expected to have at least 1 (one) error per minute. My research has shown that WUW SR will make 1 (one) error per 3 hours!
Educational Background
1990 |
Ph.D. |
Computer Engineering |
Dissertation |
Artificial Neural Networks for Speech Recognition Applications |
|
Advisor |
John N. Gowdy |
|
1986 |
M.S. |
Computer Engineering |
Advisor |
John N. Gowdy |
|
1981 |
Dipl. Eng. |
Electrical Engineering University of Prishtina |
Thesis |
The use of the Analog Computers for Simulation and Automatic Control |
|
Advisor |
Abdurrahman Grapci |
|
1976 |
Diploma |
Mathematical Gymnasium |
Diploma Work |
Experimental Methods for Measurements of the Speed of Light |
|
Advisor |
Skender Skenderi |
Professional Experience
2003 - Present |
Florida Institute of Technology, Electrical and Computer Engineering – Associate Professor:
http://faculty.uml.edu/Mufeed_Mahd/UML_ADI/photo_fit.htm),
CS Dept. Curriculum Series Presentation, 2005:
|
2001 - 2003 |
Speech Recognition Scientist - ThinkEngine Networks, Inc., 175 Maple Street, Marlborough, MA 01745. USA.
Those scripts use numerous executables, gnuplot – a graph plotting tool, as well as other perl scripts. End result of this process is automatic generation of number of plots, charts, and graphs that depict performance of the system for easy evaluation and comparison.
|
1999 – 2001 |
Speech Recognition Scientist – SpeechWorks International, Inc., Product Group, 695 Atlantic Ave., Boston, MA 02111. USA.
|
1997 - 1999 |
Scientist - GTE, BBN Technologies, Speech Solutions Group, 70 Fawcett St., Cambridge, MA 02138. USA.
|
1993 - 1997 |
Speech Scientist – Voice Processing Corporation/Voice Control Systems, Advanced Technology Development Group,One Main Street,MA02142.USA.
¨ Analyzed the conflicting effect of window size and type (higher frequency resolution causing break down of enhancement due to harmonics, ¨ Analyzed several possible modifications of enhancement algorithm to accommodate higher frequency resolution, and ¨ Proposed elimination of pitch harmonics from the spectrum with Homomorphic filtering or LPC - based Spectrum.
|
1990 – 1993 |
Post-Doctoral Research Associate - Swiss Federal Institute of Technology, IGP, ETH-Hönggerberg, CH-8093Zürich,Switzerland.
|
1985 – 1990 |
Teaching Assistant – Electrical and Computer Engineering Department.Clemson University.
|
1987 - 1990 |
Consultant - Engineering Research and Computer Services Department, Clemson University, Electrical and Computer Engineering Department,Clemson, SC29634-0915.USA.
¨ Management of the repair and maintenance orders, ¨ Task allocation and duty assignment, ¨ Time-table management of the assigned personnel, and ¨ Generation of relevant statistical data. |
1985 - 1986 |
Software Engineer - Keiltronix: Textile Control Systems Inc.2910 Horseshoe Lane, P.O. Box 1923, Charlotte, NC 28219.
|
1981 - 1984 |
Assistant Lecturer - Electrical Engineering Faculty,University of Prishtina, Republic of Kosova.
|
Selected Publications
PATENTS:
- Dynamic Time Warping (DTW) Using Frequency Distributed Distance Measures: 6983246, January 3, 2006.
- Scoring and Rescoring Dynamic Time Warping of Speech: 7085717, April 1, 2006.
- Exploiting Differences in Correlations for Modeled and Un-Modeled Sequences by Transforming Trained Model Topology in Sequence Recognition: Provisional Patent Application, August 2009
BOOK CHAPTER
- Këpuska, V "Wake-Up-Word Speech Recognition", Speech Technologies /Book 1, Intech, ISBN 978-953-307-152-7, February 2011.
JOURNAL PUBLICATIONS
- Këpuska, V. et al. (2012). Energy Savings from using Mobile Smart Technologies, Journal of Renewable and Sustainable Energy, Submitted 2012
- Këpuska, V., Xerxes, B., & Powers, S (2011) Phoning Home: Bridging the Gap between Conservation and Convenience", JSTEM, 2012.
- Këpuska, V, & Rojanasthien, P. (2011) Speech Corpus Generation from DVDs of Movies and TV Series, JITIM, 2011-2012
- Këpuska, V (2010). Wake-Up-Word Recognition. SPIE Newsroom, Oct 6 2010. DOI: 10.1117/2.1201009.003154 http://spie.org/x42008.xml?ArticleID=x42008
- Rodriguez, W., Fiore, S., De Welde, K., Carstens, D., Këpuska, V. (2010). Ubiquitous Collaboration (uC) Learning, Ubiquitous Learning: Journal of International Technology and Information Management.
- Këpuska, V., & Klein, T. (2009). On Wake-Up-Word Speech Recognition Task, Technology, and Evaluation. Elsevier Journal of Nonlinear Analysis.
- Këpuska, V., Gurbuz, S., Rodriguez, W., Fiore, S., Carstens, D., Converse, P., Metcalf, D. (2009). uC: Ubiquitous Collaboration Platform for Multimodal Team Interaction Support, Submitted to Journal of International Technology and Information Management (IJTIM), Invited Paper Special Issue on Knowledge Management and Business Intelligence
- Këpuska, V. and Mason. S., (1995). A Neural Network Approach to Signalized Point Recognition in Aerial Photographs, Photogrammetric Engineering & Remote Sensing, Vol. 61, No. 7, pp. 917-925, July 1995.
- Mason, S. and Këpuska, V., (1992). CONSENS: An Expert System for Photogrammetric Network Design, Allgemaine Vermessungs Nachrichten, pp. 384-393, September 1992.
CONFERENCE PUBLICATIONS:
- Këpuska, V. (2012). Elevator Simulator, IEEE-ESPA, Las Vegas, 2012
- Këpuska, V., & Shih, C. (2010). Prosodic Analysis of Alerting and Referential Contexts of Sentinel Words. International Conference on Artificial Intelligence and Pattern Recognition (AIPR'10), Orlando, Florida, 2010
- Këpuska, V., & Klein, T. (2008). On Wake-Up-Word Speech Recognition Task, Technology, and Evaluation Results against HTK and Microsoft SDK 5.1. Invited Paper: World Congress on Nonlinear Analysts, Orlando 2008, To appear in Journal of Nonlinear Analysis, Theory, Methods & Applications.
- Beharry, X., Këpuska, V., Powers, S., Ramdhan, R., Rojanasthien, P., Weerasooriya, A., (2008). Patriot Robotic System Design, Florida Conference on Recent Advances in Robotics, FCRAR 2008
- Këpuska, V., Carstens, D. S., & Wallace, R. (2006). Leading and Trailing Silence in Wake-Up-Word Speech Recognition, Proceedings of the International Conference: Industry, Engineering & Management Systems 2006, Cocoa Beach, FL., 259-266.
- Këpuska V., (2006). Wake-Up-Word Application for First Responder Communication Enhancement, SPIE,Orlando, 2006.
- Këpuska V., Rogers N., Patel M., (2006). A MATLAB Tool for Speech Analysis, Processing and Recognition: SAR-LAB, ASEE, Chicago, 2006.
- Kasza T., Shahsavari M., Këpuska V., Chen Ch., (2006). Communications Protocol for RF-based Indoor Wireless Localization Systems, SPIE,Orlando, 2006.
- Anagnostopoulos G., Georgiopoulos M., Ports K., Richie S., White M., Këpuska V., Chan P. K., Wu A., Kysilka M., (2006). Engaging Undergraduate Students in Machine Learning Research: Progress, Experiences and Achievements of Project EMD-MLR, Proceedings of the ASEE 2006 Annual Conference and Exposition, June 18-21, Chicago, Illinois.
- Anagnostopoulos G., Georgiopoulos M., Ports K., Richie S., Cardinale N., White M., Këpuska V., Chan P., Wu A., Kysilka M., (2005). Project EMD-MLR: Educational Material Development and Research in Machine Learning for Undergraduate Students, Session 3232, Proceedings of the ASEE 2005 Annual Conference and Exposition, June 12-15, Portland, Oregon.
- Mason, S. and Këpuska, V., (1992). On the Representation of Close-Range Network Design Knowledge, XVII ISPRS Congress,Washington D.C., August 1992.
- Këpuska, V. and Mason, S., (1991). Automatic Signalized Point Recognition with Feed-Forward Neural Network, IEE Second International conference on Artificial Neural Networks, Bournemouth, U.K., November, 1991.
- Mason, S., Beyer, H., and Këpuska, V., (1991). An AI-based Photogrammetric Network Design System, First Australian Photogrammetric Conference,University of Newcastle,Australia, November 1991.
- Këpuska, V. and Mason, S., (1991). Artificial Neural Network Approach to Signalized Point Recognition in Aerial Photographs, First Australian Photogrammetric Conference, University of Newcastle, Australia, November 1991.
- Këpuska, V., Beyer, H. and Mason, S., (1991). Artificial Neural Networks for Calibration of CCD-Cameras, Workshop on Industrial Applications of Neural Networks, Ascona, Switzerland, September 1991.
- Këpuska, V. and Gowdy, J., (1990). On the Effect of Topological Structure of the Kohonen Network on the Performance of the Hierarchical two Layered Isolated Word Recognition System, IEEE Southeastcon Symposium, New Orleans, April 1990.
- Këpuska, V. and Gowdy, J., (1989). Investigation of Phonemic Context in Speech using Self-Organizing Feature Maps, IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP’89, Glasgow, Scotland, May 1989.
- Këpuska, V. and Gowdy, J., (1989). Phonemic Speech Recognition Based on Neural Network, IEEE Southeastcon Symposium, Columbia, April 1989.
- Këpuska, V. and Gowdy, J., (1988). The Kohonen Net for Speaker Dependent Isolated Word Recognition, IEEE Southeastern Symposium on Systems Theory, UNCC Charlotte, March 1988.
- Këpuska, V. and Gowdy, J., (1987). Evaluation of Digital Signal Processing Chips for Speech Processing Applications, IEEE Southeastern Symposium on Systems Theory, Clemson University, Clemson, March 1987.
- Këpuska, V. and Gacaferri, J., (1979). The Determination of the Polynomial Coefficients for Approximation of the EKG with Computer, (in Serbo-Croatian), Symposium JUREMA, Zagreb 1979.
- Këpuska, V. and Mason. S., (1992) NFP23: Design and Analysis of Spatial Image Sequences, Wissentsschaflicher Bericht zum Schweizerischer Nationalfonds zer Förderung der Wissentsschaftlicher Forschung, 1992.
- Këpuska, V. and Mason, S., (1992) Design and Analysis of Spatial Image Sequences, NFP 23 Third Annual Status Report,Bern,July 6, 1992.
- Këpuska, V. and Mason. S., (1991) NFP23: Design and Analysis of Spatial Image Sequences, Wissentsschaflicher Bericht zum Schweizerischer Nationalfonds zer Förderung der Wissentsschaftlicher Forschung, 1991.
- Këpuska, V. and Mason, S., (1991) Design and Analysis of Spatial Image Sequences, NFP 23 Second Annual Status Report,Bern,June 5, 1992.
- Mason, S. and Këpuska, V.,(1991) NFP 23: Design and Analysis of Spatial Image Sequences (Project Summary), SGAICO Newsletter, Swiss Group for Artificial Intelligence and Cognitive Science, 1991.
Recognition & Awards
2011 |
FaST - Calculate Potential Energy Savings-from Using Mobile Smart Technologies. |
2008 - 2009 |
|
2008 |
Greatest Commercial Potential - "Smart Room" Senior Design 2008. |
2007 |
Third Place in IEEE SouthEastCo. Student Hardware Competition: Basketball Robot |
2007 |
Best Junior Design 2007 - Visual Audio |
2006 |
Best Paper Nomination " 2006-472: A MATLAB TOOL FOR SPEECH PROCESSING, ANALYSIS AND RECOGNITION: SAR-LAB" |
2005 |
UML-ADI Assistive Device Competition, June 2005, University of Massachusetts Lowell MA, First Plac |
1984 – 1985 |
|
1987 – 1988 |
|
1977 – 1979 |
|
Research
Wake-Up-Word Speech Recognition: http://spie.org/x42008.xml
https://www.intechopen.com/chapters/15946