SlideShare a Scribd company logo
© 2016 IBM Corporation
Highlights of the Telecommunications
Event Data Analytics toolkit
IBM Streams Version 4.2
Paul Zollna
Senior Software Developer and TEDA Architect
For questions about this presentation contact Paul Zollna
paul.zollna@de.ibm.com
2 © 2016 IBM Corporation
Important Disclaimer
THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL
PURPOSES ONLY.
WHILE EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE
INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS”, WITHOUT WARRANTY
OF ANY KIND, EXPRESS OR IMPLIED.
IN ADDITION, THIS INFORMATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY,
WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE.
IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR
OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION.
NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, OR SHALL HAVE THE EFFECT OF:
• CREATING ANY WARRANTY OR REPRESENTATION FROM IBM (OR ITS AFFILIATES OR ITS OR
THEIR SUPPLIERS AND/OR LICENSORS); OR
• ALTERING THE TERMS AND CONDITIONS OF THE APPLICABLE LICENSE AGREEMENT
GOVERNING THE USE OF IBM SOFTWARE.
IBM’s statements regarding its plans, directions, and intent are subject to change or
withdrawal without notice at IBM’s sole discretion. Information regarding potential
future products is intended to outline our general product direction and it should not
be relied on in making a purchasing decision. The information mentioned regarding
potential future products is not a commitment, promise, or legal obligation to deliver
any material, code or functionality. Information about potential future products may
not be incorporated into any contract. The development, release, and timing of any
future features or functionality described for our products remains at our sole
discretion.
THIS INFORMATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE.
IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION.
3 © 2016 IBM Corporation
Agenda
 Highlights of the com.ibm.streams.teda application framework (TEDA)
 What’s new about the operators and functions
 Tutorial & references
 DEMO: TEDA & the Secure Application Configuration
 DEMO: TEDA & Plug-In for External Applications
4 © 2016 IBM Corporation
Highlights of the application framework
 New Context Container composite operator
– The new <namespace>.context.custom::ContextContainer operator is introduced. It
allows to implement a multi-level context logic or contexts with different algorithms.
 Improved configuration of the Lookup Manager application
– Simplified XML description format
– More flexible referencing of database configuration
– The database source can delete lookup data now
– The database settings are configurable with Secure Application Configuration
 Integration of the partitioned BloomFilter feature
– The configuration, functions and output statements of the partitioned BloomFilter are fully
integrated in the ITE application framework
 Enhanced handling of CSV file with enrichment data
– Configurable handling of header lines, empty lines
– Configurable handling of quoted values of attributes
– Configurable separator, delimiter and end-of-line marker
 New shared memory segment naming
– The unique segment naming simplifies the host resource sharing
 New tuple data export plug-in interface
– An external applications can import the tuple data from the ITE application without side
effects on the performance of the ITE application
5 © 2016 IBM Corporation
What’s new about the operators and functions
 New DirectoryWatch operator
– The new DirectoryWatch operator adds watches to the system's inotify functionality to monitor
directories and report file changes using less CPU than the standard spl.adapter::DirectoryScan
operator
 Enhancement in the error reporting of the CSVParse operator
– The CSVParse operator provides new custom output functions to get error descriptions when parsing
fails
– The detailed fault information about position in the record is provided in case of a failure
 New functions in the com.ibm.streams.teda.file and com.ibm.streams.teda.file.path
namespaces
– symlink - creates symbolic links in the file system.
– space - determines the total, free, and available disk space capacity for a mounted file system.
– dirname - extracts the string from the provided path, which specifies the parent directory.
– filename - extracts the string from the provided path, which specifies the file name.
– stem - extracts the string from the provided path, which specifies the file name without the extension.
– extension - extracts the string from the provided path, which specifies the extension of the file name.
6 © 2016 IBM Corporation
Additional Resources
 Reference in IBM Knowledge Center - IBM Streams 4.2 
com.ibm.streams.teda:
– http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams.toolkits.d
oc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda.html
 TEDA Tutorial for versions 1.0.2 & 2.0.0:
– http://ibmstreams.github.io/streamsx.tutorial.teda
 A TEDA demoapp sample on https://demo.ibmcloud.com
– Available for demos on request
 An Introduction to Streaming Telecommunications Event Data Analytics:
– https://developer.ibm.com/streamsdev/docs/introduction-streaming-telecommunications-
event-data-analytics-teda
 Getting Started with Streaming Telecommunications Event Data Analytics:
– https://developer.ibm.com/streamsdev/docs/getting-started-streaming-
telecommunications-event-data-analytics-teda
7 © 2016 IBM Corporation
TEDA & Application Configuration
 The Secure Application Configuration feature
 The database as source that provides the enrichment data
 Demo: Lookup Manager application using Streams Console to configure
database credentials
8 © 2016 IBM Corporation
Secure Application Configuration
 Application specific set of properties in secure storage
 API implemented in SPL, C++ and Java
 Based on JMX communication
 Implemented in Streams Console
 JMX API to manage the Secure Application Configuration store on the
instance or domain level
9 © 2016 IBM Corporation
TEDA framework with database configuration
 Sensitive database credentials stored in secure storage
– The configuration files do not include credentials in plain text
• Lookup Manager configuration file for default settings: config.cfg
• database configuration file is not involved: connections.xml
 Changable configuration at runtime
– Update the password without cancellation and submission of the application job
10 © 2016 IBM Corporation
Demo
 Specify the database properties in the Secure Application Configuration
 Specify the Application Configuration in TEDA Framework
 Load the enrichment data form database with the Lookup Manager
application and process files in the ITE application
11 © 2016 IBM Corporation
Details
 Properties for Secure Application Configuration
– lm.db.name: DEMOAPP
– lm.db.user: db2inst1
– lm.db.password: <password>
 Configuration of the Lookup Manager application
– lm.applicationConfiguration=MyApplConfig
– lm.db.connectionName=DEMOAPP
– lm.db=on
– lm.file=off (optional)
12 © 2016 IBM Corporation
Additional Resources
 IBM Knowledge Center reference:
– http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams
.toolkits.doc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda$10
1.html
 TEDA Tutorial in Module 11:
– http://ibmstreams.github.io/streamsx.tutorial.teda/docs/2.0.0/Module-11
13 © 2016 IBM Corporation
Plug-In for TEDA & External Applications
 Plug-in interfaces in the ITE application
 Demo: Export of deduplication data to external application using plug-in
interface in ITE application
14 © 2016 IBM Corporation
TEDA toolkit Plug-in
 The TEDA application framework provides plug-in interfaces to export tuple
data to an external application
 One or more external applications can connect to 4 different plug-in points:
– Reader
– Transfomer
– Writer
– Dedup
 A weak performance of the external application does not affect the
performance of the ITE file processing
 None back pressure from external applications
 The connections are monitored by metrics
15 © 2016 IBM Corporation
Plug-in interfaces in the ITE application
Statistics
Control
IngestFiles
Context
ChainDirScan
FileType
Validator
ApplCtrl Scheduler
LogWriter
Dedup
Filename
Dedup
ChainProcessorReader
ChainSink
Chain
Control
ChainProcessorTransformer
PreFile
Reader
RejectFileWriter
File Writer
Validator
Business Logic / Transform / Enrich
Tuple
Group
Split
Taps
Post
Transformer
Tap
PostContext
Processor
Tap
Chain
Finalizer
(Files Mover)
Chain
Split
File Group
Split
Context Custom
FileReaderFileReader
Converter
ContextRestore Writer
PostContext Processor
Checkpoint
Control
Legend Custom optionalCustomCommon Common or Custom Variant CVariant B
writer
reader
transformerdedup
16 © 2016 IBM Corporation
Demo
 Specify the ITE configuration to export deduplication data
 Specify the parameter of the Export operator to import the data from ITE
application
 Connect a ‚fast‘ and ‚slow‘ importer to ITE application and compare the
performance of both jobs
17 © 2016 IBM Corporation
Plug-in interface in ITE application framework
 The ITE application framework provides 4 plug-in configurations
– The ITE application provides 4 unique export properties
– The <namespace>.streams::TypesCommon composite provides exported
stream schema specification for each plug-in configuration
 New congestionPolicy parameter in spl.adapter::Export operator
– Specifies the congestion policy of the stream that is exported
– Applicable values:
• dropConnection
 The connection is dropped when a downstream importer is not keeping up.
 A nBrokenConnections metric indicates the connection drop count at the output port
• wait
 The output port causes back pressure when congested
Value Export property Exported SPL Schema
reader ite="<namespace>.chainprocessor.reader_output_RecordValidator" TypesCommon.ReaderOutStreamType
transformer ite="<namespace>.chainprocessor.transformer_output_DataProcessor" TypesCommon.TransformerOutType
writer ite="<namespace>.chainsink_input_Writer" TypesCommon.ChainSinkStreamType
dedup ite="<namespace>.context_output_Dedup" TypesCommon.TransformerOutType
18 © 2016 IBM Corporation
Details
 Importer settings
– Import of ITE stream schema types
• use demoapp.streams::*;
– Output stream type of the Importer
• stream<TypesCommon.TransformerOutType> In = Import()
– Set subscription
• param subscription : ite=="demoapp.context_output_Dedup";
 Configuration of the ITE application
– Specify the list of exporters, here Dedup only
• ite.export.streams=dedup
19 © 2016 IBM Corporation
Additional Resources
 IBM Knowledge Center reference
– Description of the ite.export.streams configuration parameter:
• http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams.toolki
ts.doc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda$184.html
– Description of the congestionPolicy parameter in the spl.adapter::Export operator:
• http://www.ibm.com/support/knowledgecenter/en/SSCRJU_4.2.0/com.ibm.streams.toolkits.doc/spl
doc/dita/tk$spl/op$spl.adapter$Export.html#spldoc_operator__parameter__congestionPolicy
 TEDA Tutorial in Module 12:
– http://ibmstreams.github.io/streamsx.tutorial.teda/docs/2.0.0/Module-12
20 © 2016 IBM Corporation
Thank YOU!!!

More Related Content

PDF
What's New in IBM Streams V4.2
PDF
IBM Streams IoT Integration
PDF
IBM Streams V4.2 Submission Time Fusion and Configuration
PDF
Non-Blocking Checkpointing for Consistent Regions in IBM Streams V4.2.
PDF
Streaming Analytics for Bluemix Enhancements
PDF
Toolkits Overview for IBM Streams V4.2
PDF
IBM ODM Rules Compiler support in IBM Streams V4.2.
PDF
IBM Streams Getting Started Resources
What's New in IBM Streams V4.2
IBM Streams IoT Integration
IBM Streams V4.2 Submission Time Fusion and Configuration
Non-Blocking Checkpointing for Consistent Regions in IBM Streams V4.2.
Streaming Analytics for Bluemix Enhancements
Toolkits Overview for IBM Streams V4.2
IBM ODM Rules Compiler support in IBM Streams V4.2.
IBM Streams Getting Started Resources

What's hot (20)

PDF
Installation and Setup for IBM InfoSphere Streams V4.0
PDF
Security Authentication and Authorization Service (AAS) for IBM InfoSphere St...
PDF
Consistent Regions in Specialized Toolkits for IBM InfoSphere Streams V4.0
PDF
IBM Streams V4.1 and Incremental Checkpointing
PPTX
3158 - Cloud Infrastructure & It Optimization - Application Performance Manag...
PDF
IBM Streams V4.1 Integration with IBM Platform Symphony
PPTX
2397 The MQ Appliance as a messaging in a box and MQ MFT hub solution
PDF
IBM Streams V4.1 and JAAS Login Module Support
PDF
IBM Streams V4.1 REST API Support for Cross-Origin Resource Sharing (CORS)
PDF
IBM Streams V4.1 and User Authentication with Client Certificates
PPT
IBM PureSystems
PDF
DevOps & Continuous Test for IIB and IBM MQ
PPTX
S107 5 compelling reasons for using cics in the cloud
PPTX
Ibm PureApplication system
PPTX
Siebel Upgrade Best Practices &amp; Processes V2
PDF
Github Projects Overview and IBM Streams V4.1
PDF
IBM Notes Domino & Verse Update (english version)
PDF
Microsoft Sharepoint07 Arng
PPTX
IBM OMEGAMON Performance Management Suite - Long Presentation
PPTX
F5 Networks BIG-IP LTM Virtual Edition
Installation and Setup for IBM InfoSphere Streams V4.0
Security Authentication and Authorization Service (AAS) for IBM InfoSphere St...
Consistent Regions in Specialized Toolkits for IBM InfoSphere Streams V4.0
IBM Streams V4.1 and Incremental Checkpointing
3158 - Cloud Infrastructure & It Optimization - Application Performance Manag...
IBM Streams V4.1 Integration with IBM Platform Symphony
2397 The MQ Appliance as a messaging in a box and MQ MFT hub solution
IBM Streams V4.1 and JAAS Login Module Support
IBM Streams V4.1 REST API Support for Cross-Origin Resource Sharing (CORS)
IBM Streams V4.1 and User Authentication with Client Certificates
IBM PureSystems
DevOps & Continuous Test for IIB and IBM MQ
S107 5 compelling reasons for using cics in the cloud
Ibm PureApplication system
Siebel Upgrade Best Practices &amp; Processes V2
Github Projects Overview and IBM Streams V4.1
IBM Notes Domino & Verse Update (english version)
Microsoft Sharepoint07 Arng
IBM OMEGAMON Performance Management Suite - Long Presentation
F5 Networks BIG-IP LTM Virtual Edition
Ad

Similar to Highlights of the Telecommunications Event Data Analytics toolkit (20)

PDF
What's New in Toolkits for IBM Streams V4.1
PPTX
Skillwise Big Data part 2
PPTX
Skilwise Big data
PDF
Streams GitHub Products Overview for IBM InfoSphere Streams V4.0
PDF
Webinar Data Mesh - Part 3
PPTX
Big data architectures and the data lake
PDF
Flink Forward San Francisco 2019: Building Financial Identity Platform using ...
PDF
IBM Streams - Redbook
PDF
Streaming analytics
PDF
Fundamentals Big Data and AI Architecture
PPT
Real-Time Analytics for Industries
PPTX
Real time data integration best practices and architecture
PDF
Webinar future dataintegration-datamesh-and-goldengatekafka
PPTX
Hadoop Summit 2017 Enterprise Graph Analytics
PPTX
IBM Industry Models and Data Lake
PDF
Relocatable Application Bundles for IBM InfoSphere Streams V4.0
PDF
TIAD 2016 : Continuous Integration mesured and controlled
PPTX
Hadoop summit 2017 enterprise graph analytics
PPTX
Big data analyti data analytical life cycle
PPTX
Event Driven Architecture
What's New in Toolkits for IBM Streams V4.1
Skillwise Big Data part 2
Skilwise Big data
Streams GitHub Products Overview for IBM InfoSphere Streams V4.0
Webinar Data Mesh - Part 3
Big data architectures and the data lake
Flink Forward San Francisco 2019: Building Financial Identity Platform using ...
IBM Streams - Redbook
Streaming analytics
Fundamentals Big Data and AI Architecture
Real-Time Analytics for Industries
Real time data integration best practices and architecture
Webinar future dataintegration-datamesh-and-goldengatekafka
Hadoop Summit 2017 Enterprise Graph Analytics
IBM Industry Models and Data Lake
Relocatable Application Bundles for IBM InfoSphere Streams V4.0
TIAD 2016 : Continuous Integration mesured and controlled
Hadoop summit 2017 enterprise graph analytics
Big data analyti data analytical life cycle
Event Driven Architecture
Ad

More from lisanl (14)

PDF
What's New Overview for IBM Streams V4.3
PDF
SPL Event-Time Processing in IBM Streams V4.3
PDF
Option Data Types in IBM Streams V4.3
PDF
Dynamic and Elastic Scaling in IBM Streams V4.3
PDF
Data Governance with IBM Streams V4.1
PDF
Introduction to IBM Platform Symphony Integration with IBM Streams V4.1
PDF
What's New in IBM Streams V4.1
PDF
What's New in the Streams Console in IBM Streams V4.1
PDF
Expanding beyond SPL -- More language support in IBM Streams V4.1
PDF
Introduction to the Spark MLLib Toolkit in IBM Streams V4.1
PDF
An Overview of IBM Streaming Analytics for Bluemix
PDF
ZooKeeper and Embedded ZooKeeper Support for IBM InfoSphere Streams V4.0
PDF
What's New in the Timeseries Toolkit for IBM InfoSphere Streams V4.0
PDF
Streams Studio Support for IBM InfoSphere Streams V4.0
What's New Overview for IBM Streams V4.3
SPL Event-Time Processing in IBM Streams V4.3
Option Data Types in IBM Streams V4.3
Dynamic and Elastic Scaling in IBM Streams V4.3
Data Governance with IBM Streams V4.1
Introduction to IBM Platform Symphony Integration with IBM Streams V4.1
What's New in IBM Streams V4.1
What's New in the Streams Console in IBM Streams V4.1
Expanding beyond SPL -- More language support in IBM Streams V4.1
Introduction to the Spark MLLib Toolkit in IBM Streams V4.1
An Overview of IBM Streaming Analytics for Bluemix
ZooKeeper and Embedded ZooKeeper Support for IBM InfoSphere Streams V4.0
What's New in the Timeseries Toolkit for IBM InfoSphere Streams V4.0
Streams Studio Support for IBM InfoSphere Streams V4.0

Recently uploaded (20)

PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Mega Projects Data Mega Projects Data
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to machine learning and Linear Models
PPTX
1_Introduction to advance data techniques.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Foundation of Data Science unit number two notes
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Lecture1 pattern recognition............
PDF
annual-report-2024-2025 original latest.
PDF
Business Analytics and business intelligence.pdf
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Supervised vs unsupervised machine learning algorithms
IBA_Chapter_11_Slides_Final_Accessible.pptx
Mega Projects Data Mega Projects Data
Reliability_Chapter_ presentation 1221.5784
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to machine learning and Linear Models
1_Introduction to advance data techniques.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Foundation of Data Science unit number two notes
Business Ppt On Nestle.pptx huunnnhhgfvu
climate analysis of Dhaka ,Banglades.pptx
.pdf is not working space design for the following data for the following dat...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Lecture1 pattern recognition............
annual-report-2024-2025 original latest.
Business Analytics and business intelligence.pdf
Clinical guidelines as a resource for EBP(1).pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

Highlights of the Telecommunications Event Data Analytics toolkit

  • 1. © 2016 IBM Corporation Highlights of the Telecommunications Event Data Analytics toolkit IBM Streams Version 4.2 Paul Zollna Senior Software Developer and TEDA Architect For questions about this presentation contact Paul Zollna paul.zollna@de.ibm.com
  • 2. 2 © 2016 IBM Corporation Important Disclaimer THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY. WHILE EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. IN ADDITION, THIS INFORMATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE. IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION. NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, OR SHALL HAVE THE EFFECT OF: • CREATING ANY WARRANTY OR REPRESENTATION FROM IBM (OR ITS AFFILIATES OR ITS OR THEIR SUPPLIERS AND/OR LICENSORS); OR • ALTERING THE TERMS AND CONDITIONS OF THE APPLICABLE LICENSE AGREEMENT GOVERNING THE USE OF IBM SOFTWARE. IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. THIS INFORMATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE. IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION.
  • 3. 3 © 2016 IBM Corporation Agenda  Highlights of the com.ibm.streams.teda application framework (TEDA)  What’s new about the operators and functions  Tutorial & references  DEMO: TEDA & the Secure Application Configuration  DEMO: TEDA & Plug-In for External Applications
  • 4. 4 © 2016 IBM Corporation Highlights of the application framework  New Context Container composite operator – The new <namespace>.context.custom::ContextContainer operator is introduced. It allows to implement a multi-level context logic or contexts with different algorithms.  Improved configuration of the Lookup Manager application – Simplified XML description format – More flexible referencing of database configuration – The database source can delete lookup data now – The database settings are configurable with Secure Application Configuration  Integration of the partitioned BloomFilter feature – The configuration, functions and output statements of the partitioned BloomFilter are fully integrated in the ITE application framework  Enhanced handling of CSV file with enrichment data – Configurable handling of header lines, empty lines – Configurable handling of quoted values of attributes – Configurable separator, delimiter and end-of-line marker  New shared memory segment naming – The unique segment naming simplifies the host resource sharing  New tuple data export plug-in interface – An external applications can import the tuple data from the ITE application without side effects on the performance of the ITE application
  • 5. 5 © 2016 IBM Corporation What’s new about the operators and functions  New DirectoryWatch operator – The new DirectoryWatch operator adds watches to the system's inotify functionality to monitor directories and report file changes using less CPU than the standard spl.adapter::DirectoryScan operator  Enhancement in the error reporting of the CSVParse operator – The CSVParse operator provides new custom output functions to get error descriptions when parsing fails – The detailed fault information about position in the record is provided in case of a failure  New functions in the com.ibm.streams.teda.file and com.ibm.streams.teda.file.path namespaces – symlink - creates symbolic links in the file system. – space - determines the total, free, and available disk space capacity for a mounted file system. – dirname - extracts the string from the provided path, which specifies the parent directory. – filename - extracts the string from the provided path, which specifies the file name. – stem - extracts the string from the provided path, which specifies the file name without the extension. – extension - extracts the string from the provided path, which specifies the extension of the file name.
  • 6. 6 © 2016 IBM Corporation Additional Resources  Reference in IBM Knowledge Center - IBM Streams 4.2  com.ibm.streams.teda: – http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams.toolkits.d oc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda.html  TEDA Tutorial for versions 1.0.2 & 2.0.0: – http://ibmstreams.github.io/streamsx.tutorial.teda  A TEDA demoapp sample on https://demo.ibmcloud.com – Available for demos on request  An Introduction to Streaming Telecommunications Event Data Analytics: – https://developer.ibm.com/streamsdev/docs/introduction-streaming-telecommunications- event-data-analytics-teda  Getting Started with Streaming Telecommunications Event Data Analytics: – https://developer.ibm.com/streamsdev/docs/getting-started-streaming- telecommunications-event-data-analytics-teda
  • 7. 7 © 2016 IBM Corporation TEDA & Application Configuration  The Secure Application Configuration feature  The database as source that provides the enrichment data  Demo: Lookup Manager application using Streams Console to configure database credentials
  • 8. 8 © 2016 IBM Corporation Secure Application Configuration  Application specific set of properties in secure storage  API implemented in SPL, C++ and Java  Based on JMX communication  Implemented in Streams Console  JMX API to manage the Secure Application Configuration store on the instance or domain level
  • 9. 9 © 2016 IBM Corporation TEDA framework with database configuration  Sensitive database credentials stored in secure storage – The configuration files do not include credentials in plain text • Lookup Manager configuration file for default settings: config.cfg • database configuration file is not involved: connections.xml  Changable configuration at runtime – Update the password without cancellation and submission of the application job
  • 10. 10 © 2016 IBM Corporation Demo  Specify the database properties in the Secure Application Configuration  Specify the Application Configuration in TEDA Framework  Load the enrichment data form database with the Lookup Manager application and process files in the ITE application
  • 11. 11 © 2016 IBM Corporation Details  Properties for Secure Application Configuration – lm.db.name: DEMOAPP – lm.db.user: db2inst1 – lm.db.password: <password>  Configuration of the Lookup Manager application – lm.applicationConfiguration=MyApplConfig – lm.db.connectionName=DEMOAPP – lm.db=on – lm.file=off (optional)
  • 12. 12 © 2016 IBM Corporation Additional Resources  IBM Knowledge Center reference: – http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams .toolkits.doc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda$10 1.html  TEDA Tutorial in Module 11: – http://ibmstreams.github.io/streamsx.tutorial.teda/docs/2.0.0/Module-11
  • 13. 13 © 2016 IBM Corporation Plug-In for TEDA & External Applications  Plug-in interfaces in the ITE application  Demo: Export of deduplication data to external application using plug-in interface in ITE application
  • 14. 14 © 2016 IBM Corporation TEDA toolkit Plug-in  The TEDA application framework provides plug-in interfaces to export tuple data to an external application  One or more external applications can connect to 4 different plug-in points: – Reader – Transfomer – Writer – Dedup  A weak performance of the external application does not affect the performance of the ITE file processing  None back pressure from external applications  The connections are monitored by metrics
  • 15. 15 © 2016 IBM Corporation Plug-in interfaces in the ITE application Statistics Control IngestFiles Context ChainDirScan FileType Validator ApplCtrl Scheduler LogWriter Dedup Filename Dedup ChainProcessorReader ChainSink Chain Control ChainProcessorTransformer PreFile Reader RejectFileWriter File Writer Validator Business Logic / Transform / Enrich Tuple Group Split Taps Post Transformer Tap PostContext Processor Tap Chain Finalizer (Files Mover) Chain Split File Group Split Context Custom FileReaderFileReader Converter ContextRestore Writer PostContext Processor Checkpoint Control Legend Custom optionalCustomCommon Common or Custom Variant CVariant B writer reader transformerdedup
  • 16. 16 © 2016 IBM Corporation Demo  Specify the ITE configuration to export deduplication data  Specify the parameter of the Export operator to import the data from ITE application  Connect a ‚fast‘ and ‚slow‘ importer to ITE application and compare the performance of both jobs
  • 17. 17 © 2016 IBM Corporation Plug-in interface in ITE application framework  The ITE application framework provides 4 plug-in configurations – The ITE application provides 4 unique export properties – The <namespace>.streams::TypesCommon composite provides exported stream schema specification for each plug-in configuration  New congestionPolicy parameter in spl.adapter::Export operator – Specifies the congestion policy of the stream that is exported – Applicable values: • dropConnection  The connection is dropped when a downstream importer is not keeping up.  A nBrokenConnections metric indicates the connection drop count at the output port • wait  The output port causes back pressure when congested Value Export property Exported SPL Schema reader ite="<namespace>.chainprocessor.reader_output_RecordValidator" TypesCommon.ReaderOutStreamType transformer ite="<namespace>.chainprocessor.transformer_output_DataProcessor" TypesCommon.TransformerOutType writer ite="<namespace>.chainsink_input_Writer" TypesCommon.ChainSinkStreamType dedup ite="<namespace>.context_output_Dedup" TypesCommon.TransformerOutType
  • 18. 18 © 2016 IBM Corporation Details  Importer settings – Import of ITE stream schema types • use demoapp.streams::*; – Output stream type of the Importer • stream<TypesCommon.TransformerOutType> In = Import() – Set subscription • param subscription : ite=="demoapp.context_output_Dedup";  Configuration of the ITE application – Specify the list of exporters, here Dedup only • ite.export.streams=dedup
  • 19. 19 © 2016 IBM Corporation Additional Resources  IBM Knowledge Center reference – Description of the ite.export.streams configuration parameter: • http://www.ibm.com/support/knowledgecenter/SSCRJU_4.2.0/com.ibm.streams.toolki ts.doc/spldoc/dita/tk$com.ibm.streams.teda/tk$com.ibm.streams.teda$184.html – Description of the congestionPolicy parameter in the spl.adapter::Export operator: • http://www.ibm.com/support/knowledgecenter/en/SSCRJU_4.2.0/com.ibm.streams.toolkits.doc/spl doc/dita/tk$spl/op$spl.adapter$Export.html#spldoc_operator__parameter__congestionPolicy  TEDA Tutorial in Module 12: – http://ibmstreams.github.io/streamsx.tutorial.teda/docs/2.0.0/Module-12
  • 20. 20 © 2016 IBM Corporation Thank YOU!!!