Introduction
Man’s continuous advancement through the years has been characterized by certain major innovations that have led him from the Agrarian Age to the Industrial Age up to the Information Age where we are. The invention of the digital computer is believed to have been the turning point of the 20th century and the facilitator for the move from the Industrial Age to the Information Age. We have been in the Information Age for quite a while and with the rapid technological developments, many scientists are of the opinion that man will soon usher in a new era that Smart refers to as the Symbiotic Age.
In a manner similar to all the other transitions through the Ages, technological innovation shall play a huge role here as well. Smart and others believe that the emergence of the Conversational User Interface (CUI) on the global web marks the beginning of the coming of a New Age especially when one considers man’s increasing inseparable connection to technological infrastructure. Smart goes further to state that there is good early evidence as to why CUIs are a revolutionary technology.
He says CUIs will help man discover better collective solutions in multiple domains such as globalization, education, governance, environment, and health among others. This will happen because conversational user interfaces will enhance our capacity to extract knowledge from our already large and yet rapidly increasing databases of information.
Conversational User Interfaces
Self-service systems, online help systems, web services, mobile communication devices, remote control systems, and dashboard computers are providing ever more functionality which implies greater complexity and a steeper learning curve for users. Conversational user interfaces allow various natural communication modes such as gestures and speech for input and output while exploiting the context in which an input is used to compute its meaning. The increasing interest in CUIs is driven by our desire to support natural, flexible, efficient and powerfully expressive means of human-computer communication that shall be easy for all to learn and use.
Wahlster adds that the development of conversational user interfaces will enable the average person to interact with computers anytime and anywhere without special skills or training, using such common devices as a mobile phone.
It is vital to note that CUIs differ generally in the degree to which the system or user controls the conversation. Currently, there are three broad categories: directed dialogue, ‘free-form’ dialogue and mixed dialogue systems. According to Glass et. al.:
The categorization of these spoken dialogue systems is differentiated by the degree with which the system maintains control of the conversation, and the inherent amount of flexibility provided to the user to ask for
- what they want,
- in the way they want to ask for it, and
- when they want to ask it.
In directed dialogue systems the user is restricted in terms of option, similar to the interactive voice response systems that we encounter often in phone-in customer care centers. In directed dialogue systems users have a set of prescribed functions at their disposal. It is easy to develop systems with such a restricted framework and that is why directed dialogue systems are the first conversational user interfaces to be deployed on a wide scale, successfully. Free-form dialogue systems are the exact opposite of directed dialogue systems because complete control is maintained by humans while the system remains passive.
In mixed dialogue CUIs, the approach employed offers a more flexible dialogue strategy that empowers both the user and the machine such that dialogue resembles the human face-to-face conversations where both parties have control. Therefore, it is no surprise that systems built to operate in this paradigm will be typically more complex than their directed dialogue counterparts, and hence more difficult to design and deploy. For this reason, most mixed-initiative systems remain under development in research laboratories.
Although mixed-initiative CUIs are the ultimate natural and efficient means of communication, their widespread use has been hampered by multiple fundamental technical barriers. The greatest of these technical barriers is the predicament of configuring Human Language Technologies (HLTs) necessary for the creation of mixed-initiative prototype systems. at the moment, most of the development taking place is language-specific and domain-dependent.
This has meant that dialogue management must be fine-tuned differently for each application. This application-specificity eliminates the portability of these Human Language Technologies across different acoustic environments, databases, and knowledge domains. For conversational interfaces to become as ubiquitous as the telephone, researchers have no option but to seek ways to make it easier for developers to create systems that learn and improve their performance automatically.
Despite the challenges facing the production of CUI systems researchers around the world continue to make inroads towards making the technology a common reality. For example, researchers at Philip Research developed a system called Speech Interfaces for Consumer Electronics (SPICE). SPICE is a CUI to an Electronic Program Guide developed to support navigation in a large TV program database for controlling the TV set and also be used for programming a VCR. With SPICE users can interact with the TV by issuing commands through a touch-screen input on a hand-held GUI display in combination with spoken dialogue.
The main features of the SPICE conversational interface that make it to become a powerful communication partner are a natural language input, direct access to content, cooperative dialogue and choice and a combination of modalities. The devices we use to interact with out TV sets today have a pre-defined menu and prescribed keywords that we must remember in order to use the devices. With SPICE the natural language input enables users to simply state their request e.g. “What history shows are on the BBC1?” The direct access to content implies that when say one is searching for a movie in the TV database, we are no longer restricted by classifications such as genre, title etc.
SPICE’s natural language capacity and large-vocabulary speech recognition makes it possible to retrieve the movie using unstructured input such as “Is there a James Bond movie today?” Cooperative dialogue in SPICE is made possible through the facilitation of two-way communication between the user and the TV system. The user’s input is complemented by the system as it offers suggestions based on the user’s preferences. Finally, SPICE allows users to choose the most suitable mode of input at that particular moment. The user could opt to use touch-screen input, speech, or a combination of both.
Education and Training
As we have stated earlier, one of the greater impacts that shall be experienced from the emergence of a Conversational User Interface (CUI) on the global web is its benefits to education and training. A study by Guodong and Zhongjiao in the world’s fastest-growing economy, China, informs us about the rapid development of information and communication technology and its role in education. With its huge population, a drive towards e-learning in China would reflect a great increase in the audience for the adoption and use of conversational interface systems.
By 2008, the number of universities, primary and secondary schools and scientific research institutes connecting to the China Education and Research Network (CERNET) had reached over 2000, with almost 30 million users. CERNET now has over 30 international and regional communication channels, which ensures safe and high-speed information exchange among educational institutions both at home and abroad.
The Chinese government, much like most western governments, is pushing for the steady development and reform of universities through the establishment of e-campuses as a brand new model for teaching, conducting collaborative scientific research and management in colleges and universities. Our seemingly obsessive view with the occurrences in China emanates from the fact that its population represents one-sixth of the total world population, therefore if its government can successfully encourage a shift towards online education and e-learning, then the demand for conversational interface systems would increase. A high demand implies more research funding and a greater likelihood of success for the ubiquity of CUIs.
According to Guodong and Zhongjiao the four characteristics of e-campus are: the digital storage of resources, information transfer via networks, automation of management and personalization of communication. It is within the objective for personalization of communication that we find the increasing interest in conversational user interfaces (CUIs). The development of hybrid and web-centric courses encourages a variety of interactions from the participants with the tutor fulfilling more of a facilitation role than a tutor.
In UK higher education establishments the focus has traditionally been very much on transferring a body of knowledge to a set of learners using a variety of teaching methods where the conversation is only used as a means of clarifying the learning by the instructor. With the advent of online teaching and learning, the UK’s Department for Education and Skills (DfES) sought to develop another approach. DfES funded the use of new technology to help raise academic standards through projects such as Talk 2 Learn. To help us comprehend the relevance of CUIs in education and training we shall give a brief description of Talk 2 Learn as an interactive online learning tool.
Talk 2 Learn began as a pilot project in 2000 with 1200 newly appointed UK headteachers. It organized people into various groupings known as communities so that members of each community negotiate meaning over a variety of professional issues and initiatives they are facing. Communities could be made to be private and membership ranges from one to thousands. The Talk 2 Learn communities often do not follow a set of learning outcomes so as to allow for different viewpoints and perspectives and coming to a better understanding of issues through discussion and debate. The ultimate goal of the project was to encourage social interaction and individual participation in social behaviours such as learning.
Vygotsky – a renowned scientist and expert in human cognitive development and interpersonal communication – considered that learning takes place more often outside a traditional setting, such as a school, than in it. One of the strengths of conversational interfaces is that they can be used to enhance learning outside the classroom through modern infrastructure such as mobile phones. According to Allen:
One of distinguishing features of the Talk 2 Learn software from email listings and online discussion groups is its ability to organise communities in a wide variety of ways. It is very easy to set up new groupings and sub-groupings.
Currently, there are three types of online courses: those that present material with little or no interaction, web-enhanced courses consisting of a hybrid of face-to-face and online, and web-centric which are interactive courses conducted exclusively using a course site. It is in the latter interactive courses where the use of conversational user interfaces shall have the greatest impact. As Vygotsky’s concepts state, the makeup of learning groups and an emphasis on the interaction of learners with one another is a key to successful individual learning. He goes further to state that in the past human beings formed communities that accumulated collective learning into social practices.
In those communities knowledge was not an object; it was a living part of their practice. Knowing was an act of participation. Now that the Internet has become an essential tool assisting college students in learning it is necessary to develop tools that shall transform learning from being an object into part of our modern culture. The value of conversational interfaces in to promote this shift today cannot be gainsaid.
Computer Supported Collaborative Learning (CSCL) was developed with the purpose being to provide opportunities to design and implement methods of advanced learning, such as deep learning, sustained and critical discourse, and effective discussion. Current insights into the CSCL-environment suggest that social interaction is likewise important for communities that support learning. Mixed-initiative CUIs enable web-centric courses to support more flexible dialogue strategies within the virtual classrooms by making the e-environment to be amiable for conversation.
Group cohesion and social interaction are a necessary first step for collaborative learning or where effective discussion is expected to take place. Chen and Wang say that simply making a computer-supported discussion forum available does not guarantee that it will be used effectively to enable learning. We must find tools and methods that shall encourage conversation among all the parties involved (including the machines).
The current state of e-learning, however does not present us with a rosy outlook. In a study conducted by Rossett and Marshall in mid 2009 it was discovered that the direction of e-learning has not shifted much over the past several years. Most of e-learning conducted is about measuring and delivering through familiar instructional strategies such as tutorials and scenarios. Our expectation of increased personalized learning, problem solving and knowledge construction on the web remains low. The anticipated increase in online discussions to support knowledge transfer from the classroom or increasing use of mobile devices was unexpectedly rare.
Use of Web 2.0 activities to promote user-generated content and collaboration remain scarce except in academia. In fact Rossett and Marshall show that the most frequently occurring e-learning practice is still that of testing of skills and knowledge. From their research the major barriers to interactive, personalized e-learning were (in decreasing order): lack of financial resources, resistance to change and technology shortcomings.
Conclusion
It is almost certain that similar to the way we moved from the Industrial Age to the Information Age, we shall be moving into a newer Age. Our predictions at the moment inform us that in that coming Age the average human interaction with the average computer will not be through a mouse and a keyboard but through voice. This will be facilitated by the development and use of conversational user interfaces (CUIs). Smart predicts that CUIs will help us discover better collective solutions in several domains and that there is good early evidence to prove this. Among the greater solutions that we anticipate is the fall of political/power/equity divides due to the elimination of the current global inequity of access to high quality, lifelong education.
Conversational user interfaces allow various natural communication modes such as gestures and speech for input and output while exploiting the context in which an input is used to compute its meaning. This would lessen the need for users to continuously undergo steep learning curves whenever new technology is introduced. Wahlster believes that CUIs will enable the average person to interact with computers anytime and anywhere without special skills or training, using such common devices as a mobile phone.
Glass et. al. says that mixed-initiative CUIs are the ultimate natural and efficient means of communication because they employ a more flexible dialogue strategy that allows both the user and the machine to control dialogue much like how we conduct human face-to-face conversations. Within the field of education and training online and e-learning present the most obvious platform for the implementation of conversational user interfaces (CUIs).
Moreover, more governments are willing to spend money towards the use of technology to raise academic performance. It is therefore imperative that we use this opportunity, when the Internet has become an essential tool assisting students in learning, to transform learning from being an object into part of our modern culture. Even though Rossett and Marshall demonstrate that we need to make substantial inroads in the arena of e-learning, as Smart states, a new Age is coming when use conversational interfaces shall be the norm.
References
- Allen, K.: Online Learning: Constructivism and Conversation as an Approach to Learning. In: Innovations in Education and Teaching International. Vol. 42. No. 3. (2005) 247 – 256.
- Boettcher, J. & Conrad, R. M. Faculty Guide for Moving Teaching and Learning to the Web Mission. Viejo, CA, League for Innovation in the Community College (1999).
- Chen, F. C. & Wang, T. C.: Social Conversation and Effective Discussion in Online Group Learning. In: Education Technology Research Development. Vol. 57 (2009) 587 – 612.
- Glass, J. et. al.: A Framework for Developing Conversational User Interfaces. In: Computer-Aided Design of User Interfaces IV. Springer, Netherlands (2005) 349 – 360.
- Guodong, Z. & Zhongjiao, J.: From E-Campus to E-Learning: An Overview of ICT Applications in Chinese Higher Education. In: British Journal of Educational Technology. Vol. 41. No. 4. (2010) 574 – 581.
- Kellner, A & Portele, T.: SPICE: A Multimodal Conversational User Interface to an Electronic Program Guide. Philips Research Laboratories, Aachen (n.d).
- Rossett, A & Marshall, J.: E-Learning: What’s Old is New Again. In: T+D (2010) 34 – 38.
- Smart, J. M.: The Conversational Interface: Our Next Great Leap Forward. In: Acceleration Watch. (n.d.). Web.
- Wahlster, W.: Conversational User Interfaces. In: Information Technology. Vol. 46. No. 6. (2004) 289 – 290.
- Zue, V & Glass, J.: Conversational Interfaces: Advances and Challenges. In: Proceedings of the IEEE. Vol. 88. No.8. (2000) 1166 – 1180.