SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 342
DEMONSTRATION OF VISUAL-BASED AND AUDIO-BASED HCI
SYSTEM
Priya Rathore1
, Anjali Khatri2
, Aparna Pushpad3
, Swekcha Jain4
1
Assistant Professor, Electronics and Communication Department, Medi-caps Institute of Science and Technology,
Madhya Pradesh, India
2
UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya
Pradesh, India
3
UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya
Pradesh, India
4
UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya
Pradesh, India
Abstract
This paper is an attempt to provide a bird’s eye view to the concept of Human Compute Interaction (HCI). The intention is to
focus on the uni-modal architecture of HCI; especially the HCI system based on visual-based and color-based communication
channels viz-a-viz color recognition and speech recognition. We have developed a Graphical User Interface (GUI) for the same
using MATLAB; one push button assigned for color input (through webcam) and the other push button assigned for speech input
(through microphone). In color recognition, primary colors i.e. RGB are detected in frames captured in real time or images
uploaded offline. Subsequently, desired operation is executed (we have set commands to open D drive). In speech recognition,
audio input through microphone is compared with a pre-stored audio file and then an operation is performed automatically (here,
we have set commands to open Google web browser). The respective algorithms of these two processes have been described with
flow-charts and snapshots of MATLAB results have been displayed.
Keywords: Human Computer Interaction, Uni-Modal Architecture, Color Recognition, Speech Recognition
--------------------------------------------------------------***----------------------------------------------------------------
1. INTRODUCTION
In earlier days, computing was only restricted to research in
scientific and astronomical laboratories. Only highly trained
specialists of these fields could handle computers. And also,
in the name of computers, there were massive expensive
computing machines which required programming and
functioning by skilled veterans. But today, we can make out
that from gigantic unhandy electronic machines to slim
laptops and i-pads in our bags, computers have come a long
way! Unlike the 1980s, majority of computer users today
have not received any specialized training in computer
programming, though they can effectively interact with
computers for a broad range of utilities.
Human Computer Interaction (HCI) is the study of the
„interaction‟ between humans and computers. The definition
doesn‟t end there. The word „interaction‟ used above must
be clarified with the phrase „easy interaction to obtain
effective and desired results‟. It is a study of the ways in
which man handles or utilizes computer technology in order
to influence his life with a target of simplifying things. It is
about creating a software or any other technology that
people will want to use, will be able to use and will find
effective when used. The Curriculum Development Group
of the ACM Special Interest Group defines HCI as a
„discipline concerned with the design, evaluation and
implementation of interactive computing systems for human
use and with the study of major phenomena surrounding
them‟.
As evident from above, HCI can be defined in numerous
ways. But all the definitions hover around the concept of
„interfacing‟. Utilizing computers has always required easy
interfacing. Over a few decades, the methods of interfacing
with computers have changed drastically. Thus, HCI has
become a ubiquitous research area attracting many
professionals from various fields. The main focus though
has always been on functionality and usability. Both of these
terms are interconnected. Functionality of a certain system is
defined as the range of operations that can be run or
performed, i.e. the services that it provides to its users.
However, functionality can only be considered when the
system can be efficiently and adequately utilized by the
users. Here comes the issue of usability. Usability is a
quality attribute that inspects how easy user interfaces are to
use. In short, it can be defined solely as ease-of-use. These
two attributes jointly determine the importance and need of
a certain HCI system.
1.1 Architecture of HCI Systems
An HCI system is usually defined by the behavior and
diversity of its inputs and outputs. These inputs and outputs
are the key attributes which form the basic outline of the
way in which interaction is occurring between human and
the computer. This is the very aspect defined by the
architecture of an HCI system. It can be classified into two
types:
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 343
1.1.1 Uni-modal HCI systems
An HCI system which depends on only one genre of inputs
is known as a uni-modal HCI system. The independent
channels are also known as modality. Thus, a uni-modal
system comprises on one modality. A lot of new ways to
provide input to HCI systems have been evolved; for
example, visual-based, audio-based, sensor-based, sixth
sense-based etc. In this paper, we intend to cover only uni-
modal HCI systems; especially visual-based and audio-
based. These are expounded later on in other sections of this
paper.
1.1.2 Multi-modal HCI systems
In these types of HCI systems, different modalities are
engaged, i.e. there are a variety of communication channels.
A combination of two or more modalities can be used to
design a multi-modal HCI system
1.2 Literature Survey
Learned basics digital image processing concept from the
book of digital image processing by gonzalez and woods
[1]. The overview of HCI(Human Computer Interaction) is
studied from International Journal On Smart Sensing And
Intelligent Systems, VOL. 1, NO. 1, March 2008 [2]. Color
detection methodology has being taken from Real-Time
Object Tracking Using Colour Feature, Basic Geometric
Shape And Primary Colour Detection Using Image
Processing On MATLAB [3] [4] [5]. Speech recognition
has been included from An HCI Speech-Based Architecture
for Man-To-Machine and Machine-To-Man Communication
in Yorùbá Language[ 6]
1.3 Color Detection and Speech Recognition
We are performing two techniques for color detection that
are offline and online. Offline color detection can be done
by selecting predefined colors in the computer, for this we
have taken snapshots of blue green and red color in
rectangle shape and for online color detection webcam is
used to capture real time image of the object and basic color
are being recognize and message is being displayed in both
the cases of offline and online.
Speech is a means of communication. When we talk about
interaction with computers, intially it started with devices
like mouse, keyboard etc via electromechanical means,
eventually it upgraded to a more natural way that is speech;
and for this we need a speech recognition system. In this
paper we have described about the application of the speech
recognition system. With the use of speech recognition we
have opened web page by browsing certain sound which we
have already stored; detection of the sound is done in real
time and if it is matched with the stored sound, desired work
is done according to the back end programming. Speech
recognition is a hierarchy of level in which the process is
compiled.
2. ALGORITHMS
The algorithm for color detection is:
Step1: Graphic user interface (GUI) is made with two push
buttons, namely, offline and online as shown in fig.2.
Step2: When we press offline: three colors already stored in
our computer‟s memory is displayed and by clicking on any
of the color message is further of red, blue, or green as
shown in fig.3.
Step 3: By pressing online named push button we will
observe that web camera will start and the process will start
to detect the basic colors as shown in fig.5.
Step 4:In case of offline color, the colored image is being
converted into gray scale and then the gray image into
binary .
Step 5: In case of online color, web camera is configured
and frames are set to manual and returned type color space
to RGB.
Step 6: By switching for different colors we apply same
algorithm for three basic colors.
Step 7: Initially it is converted into gray color and then
filtering is done to avoid noise after this the image is being
converted into black and white.
Step 8: Then the region properties like Bounding Box and
Centroid are used.
Step 9: Plot the image and message box showing the color
detected.
Algorithm for speech detection:
Step 1: Record the sound using a microphone for the desired
time as already stated in the programme.
Step 2: Save the sound recorded as shown in fig.7.
Step 3: During speech recognition, real time speech is
recorded; and is checked with the already recorded sound.
Step 4: As soon as sound is matched with the pre-recorded
sound, the work alotted to that particular sound is done (
opening of D drive and web browser page) as shown in fig.6
and fig.8.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 344
MATLAB results:
Fig 1. GUI of human computer interaction system
Fig 2. Color acquisition GUI
Fig 3. Offline color acquisition from database
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 345
Fig 4. Outputs of Color recognition in offline mode
Fig 5. Real time color detection
Fig 6. Output as local disk D
Fig 7. Speech acquisition GUI
Fig 8. Output as google web browser
3. CONCLUSION
Human computer is basically description of ways in which a
man can interface with machine in the most efficient and
easiest way. This technology has reached to an extent where
an individual just have to make an gesture to perform a
particular task, like in sixth sense technology just an gesture
of taking a picture results in an image, also words from hard
soft can be transferred to soft copy for further processing, in
addition to this just a plane sheet of paper can help user to
play a racing game making it very interesting and easy. Our
day to day life activities can be automated. This made us
work on the project similar to this technology. We have
researched regarding speech recognition and color detection.
It holds immense importance in today's era; boon for people
with disabilities making their work easier and saves time as
we would not have to go through a large set of steps. Instead
we can do it by just speaking a word. One of the significant
feature of speech recognition is rhythm, which may generate
a conspicuous change, thus we need to look after it. It can be
removed effectively with the use of concatenative synthesis.
Storing words and their attributes for efficient detection
which may slow down the system due to large database. In
color recognition, lightning effect play a major role. In order
to identify color correctly. We need to have proper light and
less or no noise background. We have achieved 98%
efficiency in color detection with considerable noise,
thereby getting correct results. The major drawback is that
the programming required is very complex.
REFERENCES
[1]. Digital image processing using Matlab -Gonzalez
woods & Eddins.
[2]. Fakhreddine Karray, Milad Alemzadeh, Jamil Abou
Saleh and Mo Nours Arab,”Human-Computer
Interaction: Overview on State of the Art”,
International Journal On Smart Sensing And Intelligent
Systems, Vol. 1, No. 1, March 2008.
[3]. Shubham Srivastava, Pratibha Singh, “Real-Time
Object Tracking Using Colour Feature”, International
Journal of Innovative Technology and Exploring
Engineering (IJITEE), ISSN: 2278-3075, Volume-3,
Issue-8, January 2014.
[4]. Shambhavi Vijay Chhaya, Sachin Khera, Pradeep
Kumar S, “Basic Geometric Shape And Primary
Colour Detection Using Image Processing On Matlab”,
IJRET: International Journal of Research in
Engineering and Technology eISSN: 2319-1163 |
pISSN: 2321-7308.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 346
[5]. D.Senthamaraikannan, S.Shriram, Dr.J.William,” Real
Time Color Recognition”, International Journal Of
Innovative Research In Electrical, Electronics,
Instrumentation And Control Engineering, Vol. 2, Issue
3, March 2014.
[6]. Akintola A. G., Ibiyemi T. S. Adewole K. S., “An HCI
Speech-Based Architecture for Man-To-Machine and
Machine-To-Man Communication in Yorùbá
Language”, Computer Engineering and Intelligent
Systems, ISSN 2222-1719 (Paper) ISSN 2222-2863
(Online), Vol.6, No.7, 2015.
BIOGRAPHIES
Priya Rathore, Assistant Professor,
Electronics and Communication
Department, Medi-caps Institute of Science
and Technology, Madhya Pradesh, India
Anjali Khatri, UG Student, 8th semester,
Electronics and Communication
Department, Medi-caps Institute of Science
and Technology, Madhya Pradesh, India
Aparna Pushpad, UG student, 8th
semester, Electronics and Communication
Department, Medi-caps Institute of Science
and Technology, Madhya Pradesh, India
Swekcha Jain, UG student, 8th
semester,
Electronics and Communication
Department, Medi-caps Institute of Science
and Technology, Madhya Pradesh, India

More Related Content

PDF
5.smart multilingual sign boards
PDF
IRJET- Sign Language and Gesture Recognition for Deaf and Dumb People
PDF
IRJET - Mutecom using Tensorflow-Keras Model
PDF
PDF
IRJET - Paint using Hand Gesture
PDF
Near Field Communication : Tap and Know
PDF
IRJET- Gesture Recognition for Indian Sign Language using HOG and SVM
PDF
Character recognition for bi lingual mixed-type characters using artificial n...
5.smart multilingual sign boards
IRJET- Sign Language and Gesture Recognition for Deaf and Dumb People
IRJET - Mutecom using Tensorflow-Keras Model
IRJET - Paint using Hand Gesture
Near Field Communication : Tap and Know
IRJET- Gesture Recognition for Indian Sign Language using HOG and SVM
Character recognition for bi lingual mixed-type characters using artificial n...

What's hot (20)

PDF
Real time Myanmar Sign Language Recognition System using PCA and SVM
PDF
GRS '“ Gesture based Recognition System for Indian Sign Language Recognition ...
PDF
Bt35408413
PDF
Review on Hand Gesture Recognition
PDF
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
PDF
Sign Language Recognition with Gesture Analysis
DOCX
Opticalcharacter recognition
PDF
Paper id 24201453
PDF
IRJET- Talking Receptionist Robot
PDF
Appearance based static hand gesture alphabet recognition
PDF
Real Time Facial Emotion Recognition using Kinect V2 Sensor
PPTX
A real time facial emotion recognition using 3D sensor and interfacing the re...
PDF
Voice controlled wheel chair
PDF
Cc4301455457
PDF
F018113743
PDF
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
PDF
IRJET- ASL Language Translation using ML
PDF
2 4-10
PDF
Device for text to speech production and to braille script
Real time Myanmar Sign Language Recognition System using PCA and SVM
GRS '“ Gesture based Recognition System for Indian Sign Language Recognition ...
Bt35408413
Review on Hand Gesture Recognition
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
Sign Language Recognition with Gesture Analysis
Opticalcharacter recognition
Paper id 24201453
IRJET- Talking Receptionist Robot
Appearance based static hand gesture alphabet recognition
Real Time Facial Emotion Recognition using Kinect V2 Sensor
A real time facial emotion recognition using 3D sensor and interfacing the re...
Voice controlled wheel chair
Cc4301455457
F018113743
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
IRJET- ASL Language Translation using ML
2 4-10
Device for text to speech production and to braille script
Ad

Similar to Demonstration of visual based and audio-based hci system (20)

PPTX
Ubitous computing ppt
PPTX
human computer interface
PPTX
Introduction to HCI
PPTX
Being Human
PPT
chap-01 HCI.ppt
PPTX
Being human (Human Computer Interaction)
PPTX
Human computer interaction -Input output channel
PDF
20140318 cisec-critical-hmi
PPTX
Human Computer Interface of an Information System
PPT
PDF
HCI.pdf
PDF
C0353018026
PPTX
HUMAN COMPUTER INTERACTION
DOCX
A paper on HCI by Nalaemton and Mervin
PPTX
HCI(Human Computer Interaction).pptx
PPTX
human computer interaction Lecture 2.pptx
PPTX
1.Usability Engineering.pptx
PDF
Human Computer Interaction
Ubitous computing ppt
human computer interface
Introduction to HCI
Being Human
chap-01 HCI.ppt
Being human (Human Computer Interaction)
Human computer interaction -Input output channel
20140318 cisec-critical-hmi
Human Computer Interface of an Information System
HCI.pdf
C0353018026
HUMAN COMPUTER INTERACTION
A paper on HCI by Nalaemton and Mervin
HCI(Human Computer Interaction).pptx
human computer interaction Lecture 2.pptx
1.Usability Engineering.pptx
Human Computer Interaction
Ad

More from eSAT Journals (20)

PDF
Mechanical properties of hybrid fiber reinforced concrete for pavements
PDF
Material management in construction – a case study
PDF
Managing drought short term strategies in semi arid regions a case study
PDF
Life cycle cost analysis of overlay for an urban road in bangalore
PDF
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
PDF
Laboratory investigation of expansive soil stabilized with natural inorganic ...
PDF
Influence of reinforcement on the behavior of hollow concrete block masonry p...
PDF
Influence of compaction energy on soil stabilized with chemical stabilizer
PDF
Geographical information system (gis) for water resources management
PDF
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
PDF
Factors influencing compressive strength of geopolymer concrete
PDF
Experimental investigation on circular hollow steel columns in filled with li...
PDF
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
PDF
Evaluation of punching shear in flat slabs
PDF
Evaluation of performance of intake tower dam for recent earthquake in india
PDF
Evaluation of operational efficiency of urban road network using travel time ...
PDF
Estimation of surface runoff in nallur amanikere watershed using scs cn method
PDF
Estimation of morphometric parameters and runoff using rs & gis techniques
PDF
Effect of variation of plastic hinge length on the results of non linear anal...
PDF
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Mechanical properties of hybrid fiber reinforced concrete for pavements
Material management in construction – a case study
Managing drought short term strategies in semi arid regions a case study
Life cycle cost analysis of overlay for an urban road in bangalore
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of compaction energy on soil stabilized with chemical stabilizer
Geographical information system (gis) for water resources management
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Factors influencing compressive strength of geopolymer concrete
Experimental investigation on circular hollow steel columns in filled with li...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Evaluation of punching shear in flat slabs
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of operational efficiency of urban road network using travel time ...
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of morphometric parameters and runoff using rs & gis techniques
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of use of recycled materials on indirect tensile strength of asphalt c...

Recently uploaded (20)

PPT
Project quality management in manufacturing
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPTX
additive manufacturing of ss316l using mig welding
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
Well-logging-methods_new................
PPTX
Welding lecture in detail for understanding
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Digital Logic Computer Design lecture notes
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Geodesy 1.pptx...............................................
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Project quality management in manufacturing
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Lesson 3_Tessellation.pptx finite Mathematics
additive manufacturing of ss316l using mig welding
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Well-logging-methods_new................
Welding lecture in detail for understanding
Model Code of Practice - Construction Work - 21102022 .pdf
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Digital Logic Computer Design lecture notes
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
UNIT 4 Total Quality Management .pptx
Geodesy 1.pptx...............................................
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...

Demonstration of visual based and audio-based hci system

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 342 DEMONSTRATION OF VISUAL-BASED AND AUDIO-BASED HCI SYSTEM Priya Rathore1 , Anjali Khatri2 , Aparna Pushpad3 , Swekcha Jain4 1 Assistant Professor, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India 2 UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India 3 UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India 4 UG student, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India Abstract This paper is an attempt to provide a bird’s eye view to the concept of Human Compute Interaction (HCI). The intention is to focus on the uni-modal architecture of HCI; especially the HCI system based on visual-based and color-based communication channels viz-a-viz color recognition and speech recognition. We have developed a Graphical User Interface (GUI) for the same using MATLAB; one push button assigned for color input (through webcam) and the other push button assigned for speech input (through microphone). In color recognition, primary colors i.e. RGB are detected in frames captured in real time or images uploaded offline. Subsequently, desired operation is executed (we have set commands to open D drive). In speech recognition, audio input through microphone is compared with a pre-stored audio file and then an operation is performed automatically (here, we have set commands to open Google web browser). The respective algorithms of these two processes have been described with flow-charts and snapshots of MATLAB results have been displayed. Keywords: Human Computer Interaction, Uni-Modal Architecture, Color Recognition, Speech Recognition --------------------------------------------------------------***---------------------------------------------------------------- 1. INTRODUCTION In earlier days, computing was only restricted to research in scientific and astronomical laboratories. Only highly trained specialists of these fields could handle computers. And also, in the name of computers, there were massive expensive computing machines which required programming and functioning by skilled veterans. But today, we can make out that from gigantic unhandy electronic machines to slim laptops and i-pads in our bags, computers have come a long way! Unlike the 1980s, majority of computer users today have not received any specialized training in computer programming, though they can effectively interact with computers for a broad range of utilities. Human Computer Interaction (HCI) is the study of the „interaction‟ between humans and computers. The definition doesn‟t end there. The word „interaction‟ used above must be clarified with the phrase „easy interaction to obtain effective and desired results‟. It is a study of the ways in which man handles or utilizes computer technology in order to influence his life with a target of simplifying things. It is about creating a software or any other technology that people will want to use, will be able to use and will find effective when used. The Curriculum Development Group of the ACM Special Interest Group defines HCI as a „discipline concerned with the design, evaluation and implementation of interactive computing systems for human use and with the study of major phenomena surrounding them‟. As evident from above, HCI can be defined in numerous ways. But all the definitions hover around the concept of „interfacing‟. Utilizing computers has always required easy interfacing. Over a few decades, the methods of interfacing with computers have changed drastically. Thus, HCI has become a ubiquitous research area attracting many professionals from various fields. The main focus though has always been on functionality and usability. Both of these terms are interconnected. Functionality of a certain system is defined as the range of operations that can be run or performed, i.e. the services that it provides to its users. However, functionality can only be considered when the system can be efficiently and adequately utilized by the users. Here comes the issue of usability. Usability is a quality attribute that inspects how easy user interfaces are to use. In short, it can be defined solely as ease-of-use. These two attributes jointly determine the importance and need of a certain HCI system. 1.1 Architecture of HCI Systems An HCI system is usually defined by the behavior and diversity of its inputs and outputs. These inputs and outputs are the key attributes which form the basic outline of the way in which interaction is occurring between human and the computer. This is the very aspect defined by the architecture of an HCI system. It can be classified into two types:
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 343 1.1.1 Uni-modal HCI systems An HCI system which depends on only one genre of inputs is known as a uni-modal HCI system. The independent channels are also known as modality. Thus, a uni-modal system comprises on one modality. A lot of new ways to provide input to HCI systems have been evolved; for example, visual-based, audio-based, sensor-based, sixth sense-based etc. In this paper, we intend to cover only uni- modal HCI systems; especially visual-based and audio- based. These are expounded later on in other sections of this paper. 1.1.2 Multi-modal HCI systems In these types of HCI systems, different modalities are engaged, i.e. there are a variety of communication channels. A combination of two or more modalities can be used to design a multi-modal HCI system 1.2 Literature Survey Learned basics digital image processing concept from the book of digital image processing by gonzalez and woods [1]. The overview of HCI(Human Computer Interaction) is studied from International Journal On Smart Sensing And Intelligent Systems, VOL. 1, NO. 1, March 2008 [2]. Color detection methodology has being taken from Real-Time Object Tracking Using Colour Feature, Basic Geometric Shape And Primary Colour Detection Using Image Processing On MATLAB [3] [4] [5]. Speech recognition has been included from An HCI Speech-Based Architecture for Man-To-Machine and Machine-To-Man Communication in Yorùbá Language[ 6] 1.3 Color Detection and Speech Recognition We are performing two techniques for color detection that are offline and online. Offline color detection can be done by selecting predefined colors in the computer, for this we have taken snapshots of blue green and red color in rectangle shape and for online color detection webcam is used to capture real time image of the object and basic color are being recognize and message is being displayed in both the cases of offline and online. Speech is a means of communication. When we talk about interaction with computers, intially it started with devices like mouse, keyboard etc via electromechanical means, eventually it upgraded to a more natural way that is speech; and for this we need a speech recognition system. In this paper we have described about the application of the speech recognition system. With the use of speech recognition we have opened web page by browsing certain sound which we have already stored; detection of the sound is done in real time and if it is matched with the stored sound, desired work is done according to the back end programming. Speech recognition is a hierarchy of level in which the process is compiled. 2. ALGORITHMS The algorithm for color detection is: Step1: Graphic user interface (GUI) is made with two push buttons, namely, offline and online as shown in fig.2. Step2: When we press offline: three colors already stored in our computer‟s memory is displayed and by clicking on any of the color message is further of red, blue, or green as shown in fig.3. Step 3: By pressing online named push button we will observe that web camera will start and the process will start to detect the basic colors as shown in fig.5. Step 4:In case of offline color, the colored image is being converted into gray scale and then the gray image into binary . Step 5: In case of online color, web camera is configured and frames are set to manual and returned type color space to RGB. Step 6: By switching for different colors we apply same algorithm for three basic colors. Step 7: Initially it is converted into gray color and then filtering is done to avoid noise after this the image is being converted into black and white. Step 8: Then the region properties like Bounding Box and Centroid are used. Step 9: Plot the image and message box showing the color detected. Algorithm for speech detection: Step 1: Record the sound using a microphone for the desired time as already stated in the programme. Step 2: Save the sound recorded as shown in fig.7. Step 3: During speech recognition, real time speech is recorded; and is checked with the already recorded sound. Step 4: As soon as sound is matched with the pre-recorded sound, the work alotted to that particular sound is done ( opening of D drive and web browser page) as shown in fig.6 and fig.8.
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 344 MATLAB results: Fig 1. GUI of human computer interaction system Fig 2. Color acquisition GUI Fig 3. Offline color acquisition from database
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 345 Fig 4. Outputs of Color recognition in offline mode Fig 5. Real time color detection Fig 6. Output as local disk D Fig 7. Speech acquisition GUI Fig 8. Output as google web browser 3. CONCLUSION Human computer is basically description of ways in which a man can interface with machine in the most efficient and easiest way. This technology has reached to an extent where an individual just have to make an gesture to perform a particular task, like in sixth sense technology just an gesture of taking a picture results in an image, also words from hard soft can be transferred to soft copy for further processing, in addition to this just a plane sheet of paper can help user to play a racing game making it very interesting and easy. Our day to day life activities can be automated. This made us work on the project similar to this technology. We have researched regarding speech recognition and color detection. It holds immense importance in today's era; boon for people with disabilities making their work easier and saves time as we would not have to go through a large set of steps. Instead we can do it by just speaking a word. One of the significant feature of speech recognition is rhythm, which may generate a conspicuous change, thus we need to look after it. It can be removed effectively with the use of concatenative synthesis. Storing words and their attributes for efficient detection which may slow down the system due to large database. In color recognition, lightning effect play a major role. In order to identify color correctly. We need to have proper light and less or no noise background. We have achieved 98% efficiency in color detection with considerable noise, thereby getting correct results. The major drawback is that the programming required is very complex. REFERENCES [1]. Digital image processing using Matlab -Gonzalez woods & Eddins. [2]. Fakhreddine Karray, Milad Alemzadeh, Jamil Abou Saleh and Mo Nours Arab,”Human-Computer Interaction: Overview on State of the Art”, International Journal On Smart Sensing And Intelligent Systems, Vol. 1, No. 1, March 2008. [3]. Shubham Srivastava, Pratibha Singh, “Real-Time Object Tracking Using Colour Feature”, International Journal of Innovative Technology and Exploring Engineering (IJITEE), ISSN: 2278-3075, Volume-3, Issue-8, January 2014. [4]. Shambhavi Vijay Chhaya, Sachin Khera, Pradeep Kumar S, “Basic Geometric Shape And Primary Colour Detection Using Image Processing On Matlab”, IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308.
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 05 Issue: 04 | Apr-2016, Available @ http://ijret.esatjournals.org 346 [5]. D.Senthamaraikannan, S.Shriram, Dr.J.William,” Real Time Color Recognition”, International Journal Of Innovative Research In Electrical, Electronics, Instrumentation And Control Engineering, Vol. 2, Issue 3, March 2014. [6]. Akintola A. G., Ibiyemi T. S. Adewole K. S., “An HCI Speech-Based Architecture for Man-To-Machine and Machine-To-Man Communication in Yorùbá Language”, Computer Engineering and Intelligent Systems, ISSN 2222-1719 (Paper) ISSN 2222-2863 (Online), Vol.6, No.7, 2015. BIOGRAPHIES Priya Rathore, Assistant Professor, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India Anjali Khatri, UG Student, 8th semester, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India Aparna Pushpad, UG student, 8th semester, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India Swekcha Jain, UG student, 8th semester, Electronics and Communication Department, Medi-caps Institute of Science and Technology, Madhya Pradesh, India