SlideShare a Scribd company logo
What´s New? SAP HANA SPS 07
Text Analysis
(Delta from SPS 06 to SPS 07)
SAP HANA Product Management

November, 2013
Agenda
New or Improved Text Analysis Features
Custom dictionaries
Custom configurations
Indexing throughput

Improved Language Coverage
Social Media extraction for Japanese & Simplified Chinese
Numerical extraction for Simplified Chinese
Core extraction for Russian
Voice of Customer for Simplified Chinese

Related Topics
Fulltext search

Fuzzy search

© 2013 SAP AG. All rights reserved.

Public

2
New or Improved Text Analysis
Features
New Custom Dictionary Support
You can now specify your own entity types and names to be used with text analysis, which may be
critical for particular industries or data domains
 Single custom dictionary may support all languages or a single language
 Custom dictionaries reside in the HANA repository and benefit from its life cycle management

Steps
1.
2.
3.
4.
5.

Choose the project to contain the new dictionary in the Development perspective of SAP HANA Studio.
Enter or select a parent folder and enter the dictionary file name in the Wizard. Your text analysis dictionary file is created locally and opens as
an empty file in the text editor.
Enter your text analysis dictionary specification into the new file and save it locally.
Commit your new dictionary. The dictionary is now synchronized to the repository as a design time object and the icon shows the dictionary is
committed.
Activate once you have finished editing your dictionary. The dictionary is created in the repository as a runtime object and the icon shows the
dictionary is activated. This allows you and others to use the dictionary. If you haven’t done so previously, you will need to create a custom text
analysis configuration as well…

© 2013 SAP AG. All rights reserved.

Public

4
New Custom Configuration Support
You can now customize the features and options used for text analysis rather than using the
predefined configurations:






LINGANALYSIS_BASIC
LINGANALYSIS_STEMS
LINGANALYSIS_FULL
EXTRACTION_CORE
EXTRACTION_CORE_VOICEOFCUSTOMER

Custom configurations allow you to suppress the default output and incorporate custom dictionaries.
You can either:
 Create a new XML configuration file within SAP HANA Studio
 Copy one of the predefined configurations and modify it

© 2013 SAP AG. All rights reserved.

Public

5
Greater Indexing Throughput
Improved scalability of the highlighted preprocessing steps:
 File filtering
– converting binary document formats to text/HTML

 Tokenization
– decompose word sequence, e.g. “the quick brown fox” -> “the” “quick” “brown” “fox”

 Stemming
– reduction of tokens to linguistic base form, e.g. houses -> house; ran -> run

 Linguistic analysis

30% less time
Depending upon hardware configuration

– part-of-speech identification, e.g. quick: Adjective; houses: Plural Noun

Utilizes more threads and efficient data transfers
 Applies to all text analysis configurations

50% greater throughput
Depending upon hardware configuration

© 2013 SAP AG. All rights reserved.

Public

6
Improved Language Coverage
Available Text Analysis Configuration Options
Language

LINGANALYSIS_FULL

EXTRACTION_CORE

EXTRACTION_CORE_VOICEOFCUSTOMER

Arabic

LINGANALYSIS_BASIC
LINGANALYSIS_STEMS






X

Catalan





X

X

Chinese (Simplified)





IMPROVED

IMPROVED

Chinese (Traditional)





X

X

Croatian





X

X

Czech





X

X

Danish





X

X

Dutch







X

English









Farsi







X

French









German









Greek



X

X

X

Hebrew



X

X

X

Hungarian



X

X

X

Italian







X

Japanese





IMPROVED

X

Korean







X

Norwegian (Bokmal)





X

X

Norwegian (Nynorsk)





X

X

Polish



X

X

X

Portuguese







X

Romanian



X

X

X

Russian





IMPROVED

X

Serbian





X

X

Slovak





X

X

Slovenian





X

X

Spanish









Swedish





X

X

Thai



X

X

X

Turkish



X

X

X

© 2013 SAP AG. All rights reserved.

Public

8
Improved Social Media Extraction for Japanese & Simplified Chinese
Identifies with high recall and precision SOCIAL_MEDIA entities with corresponding offsets





Tags SOCIAL_MEDIA entities such as IDs (@MyTwitterName) or topics (#MyWeiboKeyword)
Distinguishes between SOCIAL_MEDIA entities and emoticons like @__@
Distinguishes between SOCIAL_MEDIA entities and emails like myname@domain.com
Respects important Weibo and Twitter differences, Ex: #W-TOPIC# vs. #T-TOPIC1 #T-TOPIC2

© 2013 SAP AG. All rights reserved.

Public

9
Improved Numerical Extraction for Simplified Chinese
Better identifies numerical entities with special characters
 CURRENCY – expressions denoting amounts of money
– 33.8万元
– 港币五千万
– 一百四十四亿七千万美元

 DATE – minimally composed of a number and month name
– 7月2日
– 十月十七日

 MEASURE – expressions
– 二百五十六公斤
– 5.5米

 TIME – clock times and time expressions
– 8时
– 3点零5分

© 2013 SAP AG. All rights reserved.

Public

10
Additional Predefined Core Extractions for Russian
TITLE
PERSON
PEOPLE
LANGUAGE

President
Barak Obama
Greeks
Greek

ADDRESS1
ADDRESS2
LOCALITY
REGION@MINOR
REGION@MAJOR
COUNTRY
CONTINENT
GEO_FEATURE
GEO_AREA

245 First Street Floor 16
Cambridge, MA 02142
Cambridge
Napa Country
Connecticut
Brazil
South America
Mount Fuji
Scandinavia

ORGANIZATION@COMMERCIAL
ORGANIZATION@EDUCATIONAL
ORGANIZATION@OTHER
PRODUCT
TICKER

AT&T
University of Washington
FBI
iPhone
NYSE:SAP

SOCIAL_MEDIA@TWITTER_ID
SOCIAL_MEDIA@TWITTER_TOPIC

DATE
DAY
MONTH
YEAR
TIME
TIME_PERIOD
HOLIDAY

2/14/2011
Monday
June
2011
3:47pm
3 days, from 9 to 5pm
Memorial Day

CURRENCY

17 euros

MEASURE
PERCENT

217 meters
4%

PHONE
URI@EMAIL
URI@IP
URI@URL

617-677-2030
john.smith@sap.com
165.14.2.0
http://sap.com

Syntactic Entities:
NOUN_GROUP
PROP_MISC

big umbrella
Cup o’ Soup

@SAP
#HANA

© 2013 SAP AG. All rights reserved.

Public

11
Improved Voice of Customer Extraction for Simplified Chinese
The following major fact types are classified:






Sentiments: expression of a customer’s feelings about something
Problems: a statement about something which impedes a customer’s work
Requests: expression of a customer’s desire for an enhancement/change
Profanity: defines a set of pejorative vocabulary
Emoticons: expression of someone's feelings about the whole sentence or situation

Focuses on finer extraction of online reviews and implementing customer feedback
 Dramatic overall improvement in stances and topics
 Recall and precision testing results jumped significantly higher

© 2013 SAP AG. All rights reserved.

Public

12
Disclaimer
This presentation outlines our general product direction and should not be relied on in making
a purchase decision. This presentation is not subject to your license agreement or any other
agreement with SAP.
SAP has no obligation to pursue any course of business outlined in this presentation or to
develop or release any functionality mentioned in this presentation. This presentation and
SAP’s strategy and possible future developments are subject to change and may be changed
by SAP at any time for any reason without notice.
This document is provided without a warranty of any kind, either express or implied, including
but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or
non-infringement. SAP assumes no responsibility for errors or omissions in this
document, except if such damages were caused by SAP intentionally or grossly negligent.

© 2013 SAP AG. All rights reserved.

Public

13
Thank you
Contact information
Anthony Waite
SAP HANA Product Management
AskSAPHANA@sap.com
To get the best overview of what’s new in SAP HANA SPS 07, read this blog.
© 2013 SAP AG. All rights reserved.
No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG.
The information contained herein may be changed without prior notice.
Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors.
National product specifications may vary.
These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and
SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in
the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.
SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other
countries.
Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices.

© 2013 SAP AG. All rights reserved.

Public

15
© 2013 SAP AG. Alle Rechte vorbehalten.
Weitergabe und Vervielfältigung dieser Publikation oder von Teilen daraus sind, zu welchem Zweck und in welcher Form auch immer, ohne die ausdrückliche schriftliche
Genehmigung durch SAP AG nicht gestattet. In dieser Publikation enthaltene Informationen können ohne vorherige Ankündigung geändert werden.
Einige der von der SAP AG und ihren Distributoren vermarkteten Softwareprodukte enthalten proprietäre Softwarekomponenten anderer Softwareanbieter.
Produkte können länderspezifische Unterschiede aufweisen.
Die vorliegenden Unterlagen werden von der SAP AG und ihren Konzernunternehmen („SAP-Konzern“) bereitgestellt und dienen ausschließlich zu Informationszwecken.
Der SAP-Konzern übernimmt keinerlei Haftung oder Gewährleistung für Fehler oder Unvollständigkeiten in dieser Publikation. Der SAP-Konzern steht lediglich für Produkte
und Dienstleistungen nach der Maßgabe ein, die in der Vereinbarung über die jeweiligen Produkte und Dienstleistungen ausdrücklich geregelt ist. Keine der hierin
enthaltenen Informationen ist als zusätzliche Garantie zu interpretieren.
SAP und andere in diesem Dokument erwähnte Produkte und Dienstleistungen von SAP sowie die dazugehörigen Logos sind Marken oder eingetragene Marken der SAP
AG in Deutschland und verschiedenen anderen Ländern weltweit. Weitere Hinweise und Informationen zum Markenrecht finden Sie unter http://www.sap.com/corporateen/legal/copyright/index.epx#trademark.

© 2013 SAP AG. All rights reserved.

Public

16

More Related Content

PDF
SAP HANA SPS09 - Text Analysis
PDF
Text Analysis with SAP HANA
PDF
SAP HANA SPS09 - Full-text Search
PPTX
Text Analysis with SAP HANA
PPTX
HANA SPS07 Fulltext Search
PDF
SAP HANA SPS10- Text Analysis & Text Mining
PPTX
What's new for Text in SAP HANA SPS 11
PDF
SAP HANA SPS10- SHINE
SAP HANA SPS09 - Text Analysis
Text Analysis with SAP HANA
SAP HANA SPS09 - Full-text Search
Text Analysis with SAP HANA
HANA SPS07 Fulltext Search
SAP HANA SPS10- Text Analysis & Text Mining
What's new for Text in SAP HANA SPS 11
SAP HANA SPS10- SHINE

What's hot (20)

PDF
What's New in SAP HANA SPS 11 Operations
PDF
SAP HANA SPS10- SAP HANA Development Tools
PDF
SAP HANA SPS09 - HANA Modeling
PDF
SAP HANA SPS10- Extended Application Services (XS) Programming Model
PPTX
HANA SPS07 Fuzzy Search
PDF
SAP HANA SPS10- SQLScript
PDF
SAP HANA SPS09 - HANA IM Services
PPTX
What's New for SAP HANA Smart Data Integration & Smart Data Quality
PDF
SAP HANA SPS09 - SAP HANA Core & SQL
PDF
SAP HANA SPS10- Predictive Analysis Library and Application Function Modeler
PPTX
HANA SPS07 Smart Data Access
PDF
SAP HANA SPS09 - XS Programming Model
PDF
SAP HANA SPS09 - Development Tools
PPTX
SAP Helps Reduce Silos Between Business and Spatial Data
PDF
Spark Usage in Enterprise Business Operations
PDF
Dmm203 – new approaches for data modelingwith sap hana
PDF
Building Custom Advanced Analytics Applications with SAP HANA
PDF
Hana sql
PPTX
HANA SPS07 Shine
PDF
Technical Overview of CDS View – SAP HANA Part I
What's New in SAP HANA SPS 11 Operations
SAP HANA SPS10- SAP HANA Development Tools
SAP HANA SPS09 - HANA Modeling
SAP HANA SPS10- Extended Application Services (XS) Programming Model
HANA SPS07 Fuzzy Search
SAP HANA SPS10- SQLScript
SAP HANA SPS09 - HANA IM Services
What's New for SAP HANA Smart Data Integration & Smart Data Quality
SAP HANA SPS09 - SAP HANA Core & SQL
SAP HANA SPS10- Predictive Analysis Library and Application Function Modeler
HANA SPS07 Smart Data Access
SAP HANA SPS09 - XS Programming Model
SAP HANA SPS09 - Development Tools
SAP Helps Reduce Silos Between Business and Spatial Data
Spark Usage in Enterprise Business Operations
Dmm203 – new approaches for data modelingwith sap hana
Building Custom Advanced Analytics Applications with SAP HANA
Hana sql
HANA SPS07 Shine
Technical Overview of CDS View – SAP HANA Part I
Ad

Similar to HANA SPS07 Text Analysis (20)

PPTX
HANA SPS07 Studio Development Perspective
PDF
Testing SAP HANA applications with SAP LoadRunner by HP
PDF
Master guide cdmc
PDF
Master guide cdmc
PPTX
HANA SPS07 SQL Script
PDF
SAP HANA SPS09 - SQLScript
PDF
Dmm117 – SAP HANA Processing Services Text Spatial Graph Series and Predictive
PDF
Enable End-to-End Digital Government Transformation with SAP Solutions
PPTX
HANA SPS07 River
PDF
How to build an agentry based mobile app from scratch connecting to an sap ba...
PDF
How to use abap cds for data provisioning in bw
PPTX
SAP HANA Adoption Press Briefing Japan (Paul Marriott @pmmarriott, Paul Young)
PPTX
SAP HANA and SAP Controlling – New Opportunities and New Challenges
PPTX
SAP HANA and SAP Controlling – New Opportunities and New Challenges
PDF
Interactive SAP Big Data Overview
PPTX
SAP HANA SPS08 SQLScript
PPTX
Getting Started with BI Analytics on HANA
PDF
SAP Analytics Overview and Strategy
PDF
End user experience monitoring
PDF
26764 Waldemar Adams 151116 BCN SAP Select
HANA SPS07 Studio Development Perspective
Testing SAP HANA applications with SAP LoadRunner by HP
Master guide cdmc
Master guide cdmc
HANA SPS07 SQL Script
SAP HANA SPS09 - SQLScript
Dmm117 – SAP HANA Processing Services Text Spatial Graph Series and Predictive
Enable End-to-End Digital Government Transformation with SAP Solutions
HANA SPS07 River
How to build an agentry based mobile app from scratch connecting to an sap ba...
How to use abap cds for data provisioning in bw
SAP HANA Adoption Press Briefing Japan (Paul Marriott @pmmarriott, Paul Young)
SAP HANA and SAP Controlling – New Opportunities and New Challenges
SAP HANA and SAP Controlling – New Opportunities and New Challenges
Interactive SAP Big Data Overview
SAP HANA SPS08 SQLScript
Getting Started with BI Analytics on HANA
SAP Analytics Overview and Strategy
End user experience monitoring
26764 Waldemar Adams 151116 BCN SAP Select
Ad

More from SAP Technology (20)

PPTX
SAP Integration Suite L1
PDF
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
PDF
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
PDF
Extend SAP S/4HANA to deliver real-time intelligent processes
PDF
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
PDF
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
PDF
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
PDF
Transform your business with intelligent insights and SAP S/4HANA
PDF
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
PPTX
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
PDF
The IoT Imperative for Consumer Products
PDF
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
PDF
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
PDF
The IoT Imperative in Government and Healthcare
PDF
SAP S/4HANA Finance and the Digital Core
PDF
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
PDF
Why SAP HANA?
PPTX
Spotlight on Financial Services with Calypso and SAP ASE
PPTX
SAP ASE 16 SP02 Performance Features
PPTX
What's New in SAP HANA SPS 11 Application Lifecycle Management
SAP Integration Suite L1
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
Extend SAP S/4HANA to deliver real-time intelligent processes
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Transform your business with intelligent insights and SAP S/4HANA
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
The IoT Imperative for Consumer Products
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
The IoT Imperative in Government and Healthcare
SAP S/4HANA Finance and the Digital Core
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
Why SAP HANA?
Spotlight on Financial Services with Calypso and SAP ASE
SAP ASE 16 SP02 Performance Features
What's New in SAP HANA SPS 11 Application Lifecycle Management

Recently uploaded (20)

PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
August Patch Tuesday
PDF
Encapsulation theory and applications.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
WOOl fibre morphology and structure.pdf for textiles
MIND Revenue Release Quarter 2 2025 Press Release
A comparative analysis of optical character recognition models for extracting...
Encapsulation_ Review paper, used for researhc scholars
Hindi spoken digit analysis for native and non-native speakers
Assigned Numbers - 2025 - Bluetooth® Document
gpt5_lecture_notes_comprehensive_20250812015547.pdf
TLE Review Electricity (Electricity).pptx
DP Operators-handbook-extract for the Mautical Institute
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Unlocking AI with Model Context Protocol (MCP)
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Heart disease approach using modified random forest and particle swarm optimi...
August Patch Tuesday
Encapsulation theory and applications.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
NewMind AI Weekly Chronicles - August'25-Week II
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf

HANA SPS07 Text Analysis

  • 1. What´s New? SAP HANA SPS 07 Text Analysis (Delta from SPS 06 to SPS 07) SAP HANA Product Management November, 2013
  • 2. Agenda New or Improved Text Analysis Features Custom dictionaries Custom configurations Indexing throughput Improved Language Coverage Social Media extraction for Japanese & Simplified Chinese Numerical extraction for Simplified Chinese Core extraction for Russian Voice of Customer for Simplified Chinese Related Topics Fulltext search Fuzzy search © 2013 SAP AG. All rights reserved. Public 2
  • 3. New or Improved Text Analysis Features
  • 4. New Custom Dictionary Support You can now specify your own entity types and names to be used with text analysis, which may be critical for particular industries or data domains  Single custom dictionary may support all languages or a single language  Custom dictionaries reside in the HANA repository and benefit from its life cycle management Steps 1. 2. 3. 4. 5. Choose the project to contain the new dictionary in the Development perspective of SAP HANA Studio. Enter or select a parent folder and enter the dictionary file name in the Wizard. Your text analysis dictionary file is created locally and opens as an empty file in the text editor. Enter your text analysis dictionary specification into the new file and save it locally. Commit your new dictionary. The dictionary is now synchronized to the repository as a design time object and the icon shows the dictionary is committed. Activate once you have finished editing your dictionary. The dictionary is created in the repository as a runtime object and the icon shows the dictionary is activated. This allows you and others to use the dictionary. If you haven’t done so previously, you will need to create a custom text analysis configuration as well… © 2013 SAP AG. All rights reserved. Public 4
  • 5. New Custom Configuration Support You can now customize the features and options used for text analysis rather than using the predefined configurations:      LINGANALYSIS_BASIC LINGANALYSIS_STEMS LINGANALYSIS_FULL EXTRACTION_CORE EXTRACTION_CORE_VOICEOFCUSTOMER Custom configurations allow you to suppress the default output and incorporate custom dictionaries. You can either:  Create a new XML configuration file within SAP HANA Studio  Copy one of the predefined configurations and modify it © 2013 SAP AG. All rights reserved. Public 5
  • 6. Greater Indexing Throughput Improved scalability of the highlighted preprocessing steps:  File filtering – converting binary document formats to text/HTML  Tokenization – decompose word sequence, e.g. “the quick brown fox” -> “the” “quick” “brown” “fox”  Stemming – reduction of tokens to linguistic base form, e.g. houses -> house; ran -> run  Linguistic analysis 30% less time Depending upon hardware configuration – part-of-speech identification, e.g. quick: Adjective; houses: Plural Noun Utilizes more threads and efficient data transfers  Applies to all text analysis configurations 50% greater throughput Depending upon hardware configuration © 2013 SAP AG. All rights reserved. Public 6
  • 8. Available Text Analysis Configuration Options Language LINGANALYSIS_FULL EXTRACTION_CORE EXTRACTION_CORE_VOICEOFCUSTOMER Arabic LINGANALYSIS_BASIC LINGANALYSIS_STEMS    X Catalan   X X Chinese (Simplified)   IMPROVED IMPROVED Chinese (Traditional)   X X Croatian   X X Czech   X X Danish   X X Dutch    X English     Farsi    X French     German     Greek  X X X Hebrew  X X X Hungarian  X X X Italian    X Japanese   IMPROVED X Korean    X Norwegian (Bokmal)   X X Norwegian (Nynorsk)   X X Polish  X X X Portuguese    X Romanian  X X X Russian   IMPROVED X Serbian   X X Slovak   X X Slovenian   X X Spanish     Swedish   X X Thai  X X X Turkish  X X X © 2013 SAP AG. All rights reserved. Public 8
  • 9. Improved Social Media Extraction for Japanese & Simplified Chinese Identifies with high recall and precision SOCIAL_MEDIA entities with corresponding offsets     Tags SOCIAL_MEDIA entities such as IDs (@MyTwitterName) or topics (#MyWeiboKeyword) Distinguishes between SOCIAL_MEDIA entities and emoticons like @__@ Distinguishes between SOCIAL_MEDIA entities and emails like myname@domain.com Respects important Weibo and Twitter differences, Ex: #W-TOPIC# vs. #T-TOPIC1 #T-TOPIC2 © 2013 SAP AG. All rights reserved. Public 9
  • 10. Improved Numerical Extraction for Simplified Chinese Better identifies numerical entities with special characters  CURRENCY – expressions denoting amounts of money – 33.8万元 – 港币五千万 – 一百四十四亿七千万美元  DATE – minimally composed of a number and month name – 7月2日 – 十月十七日  MEASURE – expressions – 二百五十六公斤 – 5.5米  TIME – clock times and time expressions – 8时 – 3点零5分 © 2013 SAP AG. All rights reserved. Public 10
  • 11. Additional Predefined Core Extractions for Russian TITLE PERSON PEOPLE LANGUAGE President Barak Obama Greeks Greek ADDRESS1 ADDRESS2 LOCALITY REGION@MINOR REGION@MAJOR COUNTRY CONTINENT GEO_FEATURE GEO_AREA 245 First Street Floor 16 Cambridge, MA 02142 Cambridge Napa Country Connecticut Brazil South America Mount Fuji Scandinavia ORGANIZATION@COMMERCIAL ORGANIZATION@EDUCATIONAL ORGANIZATION@OTHER PRODUCT TICKER AT&T University of Washington FBI iPhone NYSE:SAP SOCIAL_MEDIA@TWITTER_ID SOCIAL_MEDIA@TWITTER_TOPIC DATE DAY MONTH YEAR TIME TIME_PERIOD HOLIDAY 2/14/2011 Monday June 2011 3:47pm 3 days, from 9 to 5pm Memorial Day CURRENCY 17 euros MEASURE PERCENT 217 meters 4% PHONE URI@EMAIL URI@IP URI@URL 617-677-2030 john.smith@sap.com 165.14.2.0 http://sap.com Syntactic Entities: NOUN_GROUP PROP_MISC big umbrella Cup o’ Soup @SAP #HANA © 2013 SAP AG. All rights reserved. Public 11
  • 12. Improved Voice of Customer Extraction for Simplified Chinese The following major fact types are classified:      Sentiments: expression of a customer’s feelings about something Problems: a statement about something which impedes a customer’s work Requests: expression of a customer’s desire for an enhancement/change Profanity: defines a set of pejorative vocabulary Emoticons: expression of someone's feelings about the whole sentence or situation Focuses on finer extraction of online reviews and implementing customer feedback  Dramatic overall improvement in stances and topics  Recall and precision testing results jumped significantly higher © 2013 SAP AG. All rights reserved. Public 12
  • 13. Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP’s strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent. © 2013 SAP AG. All rights reserved. Public 13
  • 14. Thank you Contact information Anthony Waite SAP HANA Product Management AskSAPHANA@sap.com To get the best overview of what’s new in SAP HANA SPS 07, read this blog.
  • 15. © 2013 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices. © 2013 SAP AG. All rights reserved. Public 15
  • 16. © 2013 SAP AG. Alle Rechte vorbehalten. Weitergabe und Vervielfältigung dieser Publikation oder von Teilen daraus sind, zu welchem Zweck und in welcher Form auch immer, ohne die ausdrückliche schriftliche Genehmigung durch SAP AG nicht gestattet. In dieser Publikation enthaltene Informationen können ohne vorherige Ankündigung geändert werden. Einige der von der SAP AG und ihren Distributoren vermarkteten Softwareprodukte enthalten proprietäre Softwarekomponenten anderer Softwareanbieter. Produkte können länderspezifische Unterschiede aufweisen. Die vorliegenden Unterlagen werden von der SAP AG und ihren Konzernunternehmen („SAP-Konzern“) bereitgestellt und dienen ausschließlich zu Informationszwecken. Der SAP-Konzern übernimmt keinerlei Haftung oder Gewährleistung für Fehler oder Unvollständigkeiten in dieser Publikation. Der SAP-Konzern steht lediglich für Produkte und Dienstleistungen nach der Maßgabe ein, die in der Vereinbarung über die jeweiligen Produkte und Dienstleistungen ausdrücklich geregelt ist. Keine der hierin enthaltenen Informationen ist als zusätzliche Garantie zu interpretieren. SAP und andere in diesem Dokument erwähnte Produkte und Dienstleistungen von SAP sowie die dazugehörigen Logos sind Marken oder eingetragene Marken der SAP AG in Deutschland und verschiedenen anderen Ländern weltweit. Weitere Hinweise und Informationen zum Markenrecht finden Sie unter http://www.sap.com/corporateen/legal/copyright/index.epx#trademark. © 2013 SAP AG. All rights reserved. Public 16