SlideShare a Scribd company logo
Vernacular SMS and MVASA tool for mass communication in near futureExploring Opportunities and Challenges in India
Mobile: Socio-economic impact in IndiaImpact of Increase in Mobile Penetration on Growth in GD1st order of impact of increase in mobile penetration on growth in GDP (impact of voice primarily by allowing people to connect / talk)VAS and off-deck usage will bring in 2nd order of impact on GDP growth
VAS: Taking delivery thru mobile further Universal Reach / Inclusive GrowthEnterprise / SME / Rural / Poor urban areasLeapfrogging the PCLow cost access medium for entertainment, informationUtility of Content and ServicesCommerce, Rural applications, Local ContentDriving InnovationLocalization, Voice based VAS
The future of MVASA INR 121,000 crore industry over the next 5 yearsVAS as % of total telecom revenue in India remains lower as compared to other markets today; However it is expected to contribute INR 35,400 crores in 2013
Off deck mobile VASKey Growth drivers for Off deck Mobile VASEvolving ecosystem, continued push by D2C players and emergence of alternate billing channels will be the key growth drivers for off-deck mobile VAS in India
The importance of SMSA2P SMS remains the most effective and timely way to reach out to people (for business, social organizations & govt.) & is therefore critical from a social and economic perspective
SMS: Evolution of an effective mass communication mediaOften overlooked, this A2P SMS is a very powerful mass market medium. With 400 mn expected subs and 5 A2P SMS/day this corresponds to 2 Billion pieces of information delivered per day
Reach of Indian Languages10 Million Indians study Hindi every year. Similar numbers for others.Top 10 Indian newspapers are regionalTop TV channels are vernacular. Star TV converted 100% to HindiIn urban India, only 37% of the literate population knows English; the number drops to 17% in rural areas10 out of top 30 most-widely used languages worldwide are Indian 8 out of top 50 best-selling newspapers in the world are published in an	Indian languageCompared to only 3 English language papers from India in the list Despite these impressive statistics, not a single Indian language makes to the list to most-widely used languages in the mobile/Internet world
SMS Usage in IndiaSMS continues to be most popular non-voice service but usage levels are still low compared to other countries
Explaining the GapOne of the contributing factors towards high SMS usage in Philippines and China is pricing In Philippines, SMS started as a free service! Voice calls are cheaper in India but SMS pricing levels are relatively higher Another important factor is the use of standards-based solutions for local language SMSIn Philippines, local languages use Roman script, so 7-bit default GSM alphabet can be used The inherent efficiency of Chinese language mitigates the inefficiency of 2- byte UCS-2 encoding Both countries have lower English-literacy levels compared to IndiaLocal language SMS is used heavily
The ChallengesEstablish a standard in the eco-systemTelecom OperatorsDevice ManufacturersApplication Providers
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
Existing Coding schemes for messagingGSM Alphabet – Supports languages that use Latin character set UCS-2 – Supports all major languages of the world User-defined – Any arbitrary character set can be defined
Encoding Indian Languages in UCS-2Users who want to send an SMS in an Indian language are faced with two disadvantages from the encoding point of viewComplexity of Indic scripts results in relatively high number of characters per word on the averageWith UCS-2 encoding, only 70 characters are allowed per messageThe net result is that only a few words can go in a single SMS
Current ScenarioSMS in several Indic languages is currently supported by operatorsBased on proprietary picture-messaging based solutions since UCS-2 is inefficient and/or not supportedPicture-enabled handsets needed to display SMS
Indic SMS ChallengesIndic SMS challengesStandards-based solution at the same cost as English SMSInter-operability across operators and devices
RequirementsBasic requirementStandards-based solution with the same level of encoding efficiency as the 7-bit GSM alphabet Additional requirementsAll 22 official languages must be supportedSupport for bi-lingual messaging (Indic + English) Optional requirementEnable easy transliteration between Indian languages
Support for National LanguagesTwo methods have been specified for including National Language characters in a short messageSingle shift mechanism‘Escape’ character is used to signal that the next character is encoded using a National Language Shift TableLocking shift mechanismAll the characters are encoded using a National Language Locking Shift Table In both cases, the language being used is indicated by an identifier added in the message payload
Work till dateIndian language tables developedBased on the 3GPP templates for 7-bit locking and single shift tablesUsing the character sets for Indic languages defined by Unicode ConsortiumTables for 10 languages/scripts have been definedBengali, Gujarati, Hindi, Kannada, Malayalam, Oriya, Punjabi, Tamil, Telugu, and UrduAs the same script is sometimes used by more than one language, the 10 tables can support all the 22 official languages of India
Indic Locking Shift TablesLocking Shift Table of an Indic language consists of Control characters pre-defined in the template (4) The lower case English alphabet (26) The numbers 0-9 (10) A small subset of special characters commonly used in SMS (8) The commonly used characters of that language (80, at most) Non-Indic characters appear in the same positions as in the GSM tableThis ensures that even if a handset does not have the locking table, at least the English part of a bi-lingual SMS can be decoded Similar sounding characters of different languages are put in the same positions (not always possible because of differences in character sets)Enables easy transliteration
Indic Single Shift TablesSingle Shift Table of an Indic language consists of Control characters pre-defined in the template (4)The upper case English alphabet (26)Remaining special characters from default GSM alphabet (31)Characters of the Indic language which do not appear in its locking shift table (67, at most)
Key Benefits in New Encoding SchemeAn SMS written in any of the 22 official Indian languages can have up to 154 charactersNearly as many as in an English SMS!English words can be included in the message without extra overheadTransliteration from one language to another is very simpleCompatibility with Unicode character sets for Indian languagesPossibility to augment the character sets using free spaces available in the single shift tables
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
English is “NOT” an Indic Script
Understanding Unicode for “Indian languages”
Language over script
Language over scriptUnicode script code pages are almost exhaustive for a “script”Support for a language is significantly simpler3GPP Indic Language Tables also adopt the simplicity“LANGUAGE” is different from a “SCRIPT”
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
The text entry Challenge: The English Bias
Indianization of the key pad
Incomplete solutions!
Closing the LoopIntroducingStatistical Techniques to Language SMSFaster than ENGLISH keypadBest predictive efficiencyLeast effort in key movementsLeast number of key taps ( > 3 taps per character is unacceptable)Most IntuitiveWins 97% of the times
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Making SMS services cost effective. Presently India is high on the SMS versus Call costings. Penetration will be a factor of pricing. Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
Issue : Backward Compatibility with Legacy DevicesA handset that supports the new 7bit Indic encoding sends an Indic language SMS to a legacy device
Solution CConversion from 7bit Indic to UCS2 and visa versaA Silent Conversion application is requiredA Mechanism by which received messages are converted to UCS2Multiple concatenated messaged can be handled.
Solution P: Conversion before deliveryRecipient phone does not support UCS2Received Message can be converted to picture messageSolution1: Before delivering to the RecipientSolution2: SIM can send it for Conversion
Solution P: Conversion after delivery
The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Making SMS services cost effective. Presently India is high on the SMS versus Call costings. Penetration will be a factor of pricing. Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
Regulatory AspectsThe standard only defines a method for representing Indian language characters in short messages using 7-bit codes Handset vendors may or may not choose to implement it A vendor may decide to support only a subset of the Indian languages for which 7-bit tables have been defined Furthermore, the 3GPP specification (TS 23.038) states that: Encoding of a message using the national locking shift mechanism is not intended to be implemented until a formal request is issued by the relevant national regulatory body Regulatory intervention is required to mandate the support for 7-bit Encoding and other necessary features required for Indic SMS
ThanksCreditsDr. NadeemAkhtar (CeWIT)Biju R Balagopal (ComViva Technologies)Sharad Sharma (NASSCOM)SwaranLata (TDIL)VikasPhogat (SagemOrga)VivekananadaPani (Reverie Technologies)

More Related Content

PPTX
Machine translation
PPTX
ivr system
PDF
Imrb Iamai Mobile Vas Report August 2008
PPT
Mobile Value Added Services in India
PDF
Value added service (VAS) provider failures: insights from VAS product companies
PDF
Developing Value Added Services (VAS) and Product Roadmap for Telecoms Operators
PDF
Ce WiT Represented at The Mobile VAS SUMMIT 2009 by Vitue Insight
PPT
English on the move:learning English via mobile phones in Africa
Machine translation
ivr system
Imrb Iamai Mobile Vas Report August 2008
Mobile Value Added Services in India
Value added service (VAS) provider failures: insights from VAS product companies
Developing Value Added Services (VAS) and Product Roadmap for Telecoms Operators
Ce WiT Represented at The Mobile VAS SUMMIT 2009 by Vitue Insight
English on the move:learning English via mobile phones in Africa

Similar to Language Sms (20)

PDF
MULTILINGUAL CONVERSATION ASCII TO UNICODE IN INDIC SCRIPT
PPSX
Voice Sms Corp 2010
DOCX
sms lang analysis
DOC
[EN] "Multilingual Information and Retrieval Systems Technology and Applicati...
DOC
[EN] Multilingual Information and Retrieval Systems, Technology and Applicati...
PPTX
Sms (A strange way of communication)
PPT
Mobile Message Delivery: SMS and MMS_Michael Hanley
PPT
Panini keypad for handset manufacturers
PPTX
Silent Talks
PPT
Vas Presentation
PDF
Real time text communication - making it real ITAG 2011
PPTX
"Saral" Devanagari text input system for Mobile Phones
PPT
Welcome To The World Of Phones
PPT
What's been cooking in IAMAI india
PPT
What's been cooking in india: Presentation at Indian Digital Summit, 2012
PDF
International Marketing- Plan for Global Rollout of Google Android Mobile OS
KEY
Experience Design 2
PPTX
Make a shorter list by entering letters via DTMF
PDF
Mobile Marketing Platform v.4
PPT
For students of engineering in India
MULTILINGUAL CONVERSATION ASCII TO UNICODE IN INDIC SCRIPT
Voice Sms Corp 2010
sms lang analysis
[EN] "Multilingual Information and Retrieval Systems Technology and Applicati...
[EN] Multilingual Information and Retrieval Systems, Technology and Applicati...
Sms (A strange way of communication)
Mobile Message Delivery: SMS and MMS_Michael Hanley
Panini keypad for handset manufacturers
Silent Talks
Vas Presentation
Real time text communication - making it real ITAG 2011
"Saral" Devanagari text input system for Mobile Phones
Welcome To The World Of Phones
What's been cooking in IAMAI india
What's been cooking in india: Presentation at Indian Digital Summit, 2012
International Marketing- Plan for Global Rollout of Google Android Mobile OS
Experience Design 2
Make a shorter list by entering letters via DTMF
Mobile Marketing Platform v.4
For students of engineering in India
Ad

More from Manas Ganguly (20)

PDF
Brightsandz_Profile (1)
PPTX
OTT versus Operator
PDF
GSMA Mobile Economy India Report 2013
PPTX
Data business - The Indian Context
PDF
India Data Traffic – The Broadband Story
PPTX
LTE-Operational Challenges & Deployment conundrum
PDF
Data in indian context – networks & traffic
PPTX
Carrier IQ
PPTX
Profiling siri
PPTX
India @ 64
PPTX
Smart TV in Indian Context: Great product, limited functionality
PDF
Mobile Media: Future of Mobile media powered by Applications
PPTX
The Real Brand Owners
PPTX
Tablets
PPTX
Coping with Google
PPTX
How android 2
PPTX
State Of Internet In India
PPTX
The Indian Mobile Consumer
PPTX
Google’s Mobile Strategy
PPT
Digital Dividend
Brightsandz_Profile (1)
OTT versus Operator
GSMA Mobile Economy India Report 2013
Data business - The Indian Context
India Data Traffic – The Broadband Story
LTE-Operational Challenges & Deployment conundrum
Data in indian context – networks & traffic
Carrier IQ
Profiling siri
India @ 64
Smart TV in Indian Context: Great product, limited functionality
Mobile Media: Future of Mobile media powered by Applications
The Real Brand Owners
Tablets
Coping with Google
How android 2
State Of Internet In India
The Indian Mobile Consumer
Google’s Mobile Strategy
Digital Dividend
Ad

Language Sms

  • 1. Vernacular SMS and MVASA tool for mass communication in near futureExploring Opportunities and Challenges in India
  • 2. Mobile: Socio-economic impact in IndiaImpact of Increase in Mobile Penetration on Growth in GD1st order of impact of increase in mobile penetration on growth in GDP (impact of voice primarily by allowing people to connect / talk)VAS and off-deck usage will bring in 2nd order of impact on GDP growth
  • 3. VAS: Taking delivery thru mobile further Universal Reach / Inclusive GrowthEnterprise / SME / Rural / Poor urban areasLeapfrogging the PCLow cost access medium for entertainment, informationUtility of Content and ServicesCommerce, Rural applications, Local ContentDriving InnovationLocalization, Voice based VAS
  • 4. The future of MVASA INR 121,000 crore industry over the next 5 yearsVAS as % of total telecom revenue in India remains lower as compared to other markets today; However it is expected to contribute INR 35,400 crores in 2013
  • 5. Off deck mobile VASKey Growth drivers for Off deck Mobile VASEvolving ecosystem, continued push by D2C players and emergence of alternate billing channels will be the key growth drivers for off-deck mobile VAS in India
  • 6. The importance of SMSA2P SMS remains the most effective and timely way to reach out to people (for business, social organizations & govt.) & is therefore critical from a social and economic perspective
  • 7. SMS: Evolution of an effective mass communication mediaOften overlooked, this A2P SMS is a very powerful mass market medium. With 400 mn expected subs and 5 A2P SMS/day this corresponds to 2 Billion pieces of information delivered per day
  • 8. Reach of Indian Languages10 Million Indians study Hindi every year. Similar numbers for others.Top 10 Indian newspapers are regionalTop TV channels are vernacular. Star TV converted 100% to HindiIn urban India, only 37% of the literate population knows English; the number drops to 17% in rural areas10 out of top 30 most-widely used languages worldwide are Indian 8 out of top 50 best-selling newspapers in the world are published in an Indian languageCompared to only 3 English language papers from India in the list Despite these impressive statistics, not a single Indian language makes to the list to most-widely used languages in the mobile/Internet world
  • 9. SMS Usage in IndiaSMS continues to be most popular non-voice service but usage levels are still low compared to other countries
  • 10. Explaining the GapOne of the contributing factors towards high SMS usage in Philippines and China is pricing In Philippines, SMS started as a free service! Voice calls are cheaper in India but SMS pricing levels are relatively higher Another important factor is the use of standards-based solutions for local language SMSIn Philippines, local languages use Roman script, so 7-bit default GSM alphabet can be used The inherent efficiency of Chinese language mitigates the inefficiency of 2- byte UCS-2 encoding Both countries have lower English-literacy levels compared to IndiaLocal language SMS is used heavily
  • 11. The ChallengesEstablish a standard in the eco-systemTelecom OperatorsDevice ManufacturersApplication Providers
  • 12. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 13. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 14. Existing Coding schemes for messagingGSM Alphabet – Supports languages that use Latin character set UCS-2 – Supports all major languages of the world User-defined – Any arbitrary character set can be defined
  • 15. Encoding Indian Languages in UCS-2Users who want to send an SMS in an Indian language are faced with two disadvantages from the encoding point of viewComplexity of Indic scripts results in relatively high number of characters per word on the averageWith UCS-2 encoding, only 70 characters are allowed per messageThe net result is that only a few words can go in a single SMS
  • 16. Current ScenarioSMS in several Indic languages is currently supported by operatorsBased on proprietary picture-messaging based solutions since UCS-2 is inefficient and/or not supportedPicture-enabled handsets needed to display SMS
  • 17. Indic SMS ChallengesIndic SMS challengesStandards-based solution at the same cost as English SMSInter-operability across operators and devices
  • 18. RequirementsBasic requirementStandards-based solution with the same level of encoding efficiency as the 7-bit GSM alphabet Additional requirementsAll 22 official languages must be supportedSupport for bi-lingual messaging (Indic + English) Optional requirementEnable easy transliteration between Indian languages
  • 19. Support for National LanguagesTwo methods have been specified for including National Language characters in a short messageSingle shift mechanism‘Escape’ character is used to signal that the next character is encoded using a National Language Shift TableLocking shift mechanismAll the characters are encoded using a National Language Locking Shift Table In both cases, the language being used is indicated by an identifier added in the message payload
  • 20. Work till dateIndian language tables developedBased on the 3GPP templates for 7-bit locking and single shift tablesUsing the character sets for Indic languages defined by Unicode ConsortiumTables for 10 languages/scripts have been definedBengali, Gujarati, Hindi, Kannada, Malayalam, Oriya, Punjabi, Tamil, Telugu, and UrduAs the same script is sometimes used by more than one language, the 10 tables can support all the 22 official languages of India
  • 21. Indic Locking Shift TablesLocking Shift Table of an Indic language consists of Control characters pre-defined in the template (4) The lower case English alphabet (26) The numbers 0-9 (10) A small subset of special characters commonly used in SMS (8) The commonly used characters of that language (80, at most) Non-Indic characters appear in the same positions as in the GSM tableThis ensures that even if a handset does not have the locking table, at least the English part of a bi-lingual SMS can be decoded Similar sounding characters of different languages are put in the same positions (not always possible because of differences in character sets)Enables easy transliteration
  • 22. Indic Single Shift TablesSingle Shift Table of an Indic language consists of Control characters pre-defined in the template (4)The upper case English alphabet (26)Remaining special characters from default GSM alphabet (31)Characters of the Indic language which do not appear in its locking shift table (67, at most)
  • 23. Key Benefits in New Encoding SchemeAn SMS written in any of the 22 official Indian languages can have up to 154 charactersNearly as many as in an English SMS!English words can be included in the message without extra overheadTransliteration from one language to another is very simpleCompatibility with Unicode character sets for Indian languagesPossibility to augment the character sets using free spaces available in the single shift tables
  • 24. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 25. English is “NOT” an Indic Script
  • 26. Understanding Unicode for “Indian languages”
  • 28. Language over scriptUnicode script code pages are almost exhaustive for a “script”Support for a language is significantly simpler3GPP Indic Language Tables also adopt the simplicity“LANGUAGE” is different from a “SCRIPT”
  • 29. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 30. The text entry Challenge: The English Bias
  • 33. Closing the LoopIntroducingStatistical Techniques to Language SMSFaster than ENGLISH keypadBest predictive efficiencyLeast effort in key movementsLeast number of key taps ( > 3 taps per character is unacceptable)Most IntuitiveWins 97% of the times
  • 34. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Making SMS services cost effective. Presently India is high on the SMS versus Call costings. Penetration will be a factor of pricing. Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 35. Issue : Backward Compatibility with Legacy DevicesA handset that supports the new 7bit Indic encoding sends an Indic language SMS to a legacy device
  • 36. Solution CConversion from 7bit Indic to UCS2 and visa versaA Silent Conversion application is requiredA Mechanism by which received messages are converted to UCS2Multiple concatenated messaged can be handled.
  • 37. Solution P: Conversion before deliveryRecipient phone does not support UCS2Received Message can be converted to picture messageSolution1: Before delivering to the RecipientSolution2: SIM can send it for Conversion
  • 38. Solution P: Conversion after delivery
  • 39. The ChallengesTo evolve a core system of Indic MVAS/SMS to be implemented uniformly across vendors and service providers Making SMS services cost effective. Presently India is high on the SMS versus Call costings. Penetration will be a factor of pricing. Font rendering to make the fonts standard across all handsets Key pad design challenge How do legacy handsets which donot have this Indic feature be included in them Regulatory Aspects
  • 40. Regulatory AspectsThe standard only defines a method for representing Indian language characters in short messages using 7-bit codes Handset vendors may or may not choose to implement it A vendor may decide to support only a subset of the Indian languages for which 7-bit tables have been defined Furthermore, the 3GPP specification (TS 23.038) states that: Encoding of a message using the national locking shift mechanism is not intended to be implemented until a formal request is issued by the relevant national regulatory body Regulatory intervention is required to mandate the support for 7-bit Encoding and other necessary features required for Indic SMS
  • 41. ThanksCreditsDr. NadeemAkhtar (CeWIT)Biju R Balagopal (ComViva Technologies)Sharad Sharma (NASSCOM)SwaranLata (TDIL)VikasPhogat (SagemOrga)VivekananadaPani (Reverie Technologies)