SlideShare a Scribd company logo
Logging at scale
Doing more with less
Presented by Andre Fucs de Miranda
Macquarie GovernmentMacquarie Government
1. A bit about me
2. A bit of context
3. Apache NiFi and the SOC
4. Demo
5. Questions
The agenda.
| Logging at scale – Doing more with less
2
Macquarie Government
A bit
about me.
| Logging at scale – Doing more with less
Macquarie Government
Manager @ Macquarie’s Security
Operations Center
20 years working in information cyber
security
Apache NiFi committer and PMC
member
| Logging at scale – Doing more with less
https://github.com/trixpan
https://twitter.com/trixpan
About me
Macquarie Government
A bit
of context
| Logging at scale – Doing more with less
Macquarie Government
A bit of context
About Macquarie Government
• 42% of Australian Government agencies are our customers
• 3+ billion events per day;
Our tool stack is diverse and busy:
• We generate TBs of data per day.
• Since 2015 we have been using “Big Data” (i.e. Hadoop ecosystem) for reporting and
analytics.
• We are constantly looking for ways to offer our customers with better insights over the
threats targeting them.
• We also felt that relying exclusively on traditional SIEM wasn’t enough anymore.
| Logging at scale – Doing more with less
6
Macquarie Government
A bit of context
Could we leverage “big data” solutions to improve our SOC further?
• Perhaps we could rationalise the way we collect and process log messages?
• Perhaps we could do enrichment against a more diverse set of sources??
• What else?
| Logging at scale – Doing more with less
7
Macquarie Government
So we went and evaluated lots of
tools and architectures looking to
map things like:
• Ability to integrate with SIEM pipelines
natively
• Ability to consume cloud services (IaaS,
PaaS and Saas)
• Ability to query odd stuff
• Inbuilt Security
• Ability to Scale out
• How easy to maintain and extend
| Logging at scale – Doing more with less
8
and many more…
A bit of context
All Apache project logos are trademarks of the ASF and the respective projects.
Logstash is a trademark of Elasticsearch BV, registered in the U.S. and in other countries.
fluentd is trademark by Treasure Data
And the winner was…
Macquarie Government
Sorry, there was a mistake…
| Logging at scale – Doing more with less
10
All Apache project logos are trademarks of the ASF and the respective projects.
Macquarie Government
Let’s talk tech.
| Logging at scale – Doing more with less
Apache NiFi and the SOC
Macquarie Government
A bit about Apache NiFi – A brief Prologue
When you are start shipping “data” seems like an “easy” task
| Logging at scale – Doing more with less
DC1
DB
12
Macquarie Government
A bit about Apache NiFi – A brief Prologue
But as the environment grows, complexity compounds…
…but you keep adjusting your environment
| Logging at scale – Doing more with less
DC1
DB
DC2
DB
HQ
ClientX
AZ1
AZ2
13
Macquarie Government
‘til the point you suddenly
realise your pipeline is
missing a bit of cheese.
Or worse…
| Logging at scale – Doing more with less
© Luca Nebuloni
https://www.flickr.com/photos/nebulux/10708289086/
14
Macquarie Government
A bit about Apache NiFi – A brief Prologue
| Logging at scale – Doing more with less
Source: https://goo.gl/xKoavI
15
Macquarie Government
“Apache NiFi supports powerful and
scalable directed graphs of data
routing, transformation, and system
mediation logic.”
Open sourced by the National
Security Agency in 2014[1] and
submitted to The Apache Software
Foundation for on-going stewardship
[1] https://goo.gl/aZxCIC
| Logging at scale – Doing more with less
• User friendly interface
• Flexible
• Data Agnostic
• Inbuilt mechanisms to balance
between latency and throughput
• Fine grain control of delivery
guarantees (e.g. discard a flowfile once
it becomes too old to be relevant).
• “Secure”
• Data provenance (from where, to
where, changed by, etc.)
• Authorization Policies, TLS, Kerberos,
Encryption and a handful of other
features
• Designed for Extension
A bit about Apache NiFi16
Macquarie Government
A bit about Apache NiFi
NiFi allows you easily move data between A and B (and B to A) in a controlled,
secure and reliable way, while still allowing you to process and granularly
apply logic to the data in motion.
| Logging at scale – Doing more with less
17
Macquarie Government
A bit about Apache NiFi
A few examples on how NiFi capabilities help a SOC:
• Rationalising the flows of data into your SIEM
• Do you truly need your SIEM to be ingesting all your logs?
• What happens when you run more than one SIEM (because it may well happen…)?
• Enrich data against a diverse range of sources
• ElasticSearch, REST APIs, DNS, Redis, Whois, GeoIP, SQL, MISP (via HTTP)
• (Pull|push) data (from|to) a diverse set of platforms
• Object based stores such as GCS or S3, FTP, SFTP, Mainframes via WebSphere MQ, Files,
SQL and Syslog of course.
| Logging at scale – Doing more with less
18
Macquarie Government
Let’s take a
closer look.
| Logging at scale – Doing more with less
Macquarie Government | Logging at scale – Doing more with less
DEMO Time
Macquarie Government
Let’s talk.
| Logging at scale – Doing more with less
Andre Fucs de Miranda
Macquarie Government
amiranda@macquariegovernment.com
1800 004 943
Thank you.

More Related Content

PDF
Big Data security: Facing the challenge by Carlos Gómez at Big Data Spain 2017
PPTX
Managing the Dewey Decimal System
PDF
Keeping your Enterprise’s Big Data Secure by Owen O’Malley at Big Data Spain ...
PDF
Elastic at KPN
PDF
Turning Evidence into Insights: How NCIS Leverages Elastic
PDF
Achieving cyber mission assurance with near real-time impact
PPTX
Using Hadoop to Drive Down Fraud for Telcos
PDF
Security Events Logging at Bell with the Elastic Stack
Big Data security: Facing the challenge by Carlos Gómez at Big Data Spain 2017
Managing the Dewey Decimal System
Keeping your Enterprise’s Big Data Secure by Owen O’Malley at Big Data Spain ...
Elastic at KPN
Turning Evidence into Insights: How NCIS Leverages Elastic
Achieving cyber mission assurance with near real-time impact
Using Hadoop to Drive Down Fraud for Telcos
Security Events Logging at Bell with the Elastic Stack

What's hot (20)

PDF
Siscale Lightning Talk: Automated Root Cause Analysis with Elastic Stack
PDF
Liferay cloud services lnlug-6-march-2014
PDF
CSX: Real-time Business Discovery with the Elastic Stack
PPTX
Ran Rothschild - CloudZone
PDF
PLNOG 13: B. van der Sloot, S. Abdel-Hafez: Running a 2 Tbps global IP networ...
PDF
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
PDF
Countering Threats with the Elastic Stack at CERDEC/ARL
PDF
Keynote
PPTX
Risk Management for Data: Secured and Governed
PDF
Improving search at Wellcome Collection
PDF
MongoDB World 2019: A MongoDB Journey: Moving From a Relational Database to M...
PDF
Blockchain and Apache NiFi
PPTX
Strengthening critical internet infrastructure
PDF
Privacera and Northwestern Mutual - Scaling Privacy in a Spark Ecosystem
PDF
ProdSec: A Technical Approach
PPTX
Insight into Hyperconverged Infrastructure
PDF
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
PDF
Managing Big Data projects in a constantly changing environment - Rafał Zalew...
PPTX
Data Governance and Management in Cloud pak nam
PDF
Divide & Conquer - Logging Architecture in Distributed Ecosystems with Elasti...
Siscale Lightning Talk: Automated Root Cause Analysis with Elastic Stack
Liferay cloud services lnlug-6-march-2014
CSX: Real-time Business Discovery with the Elastic Stack
Ran Rothschild - CloudZone
PLNOG 13: B. van der Sloot, S. Abdel-Hafez: Running a 2 Tbps global IP networ...
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
Countering Threats with the Elastic Stack at CERDEC/ARL
Keynote
Risk Management for Data: Secured and Governed
Improving search at Wellcome Collection
MongoDB World 2019: A MongoDB Journey: Moving From a Relational Database to M...
Blockchain and Apache NiFi
Strengthening critical internet infrastructure
Privacera and Northwestern Mutual - Scaling Privacy in a Spark Ecosystem
ProdSec: A Technical Approach
Insight into Hyperconverged Infrastructure
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Managing Big Data projects in a constantly changing environment - Rafał Zalew...
Data Governance and Management in Cloud pak nam
Divide & Conquer - Logging Architecture in Distributed Ecosystems with Elasti...
Ad

Similar to Logging at scale: doing more with less (20)

PPTX
Integração de Dados com Apache NIFI - Marco Garcia Cetax
PPTX
Getting Started with Splunk Breakout Session
PPTX
Getting Started with Splunk Breakout Session
PDF
Ankus, bigdata deployment and orchestration framework
PDF
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
PPTX
5 Things that Make Hadoop a Game Changer
PDF
Leaving the Ivory Tower: Research in the Real World
PDF
Leaving the Ivory Tower: Research in the Real World
PDF
The Great Lakes: How to Approach a Big Data Implementation
PPTX
Removing dependencies between services: Messaging and Apache Kafka
PDF
Big Data made easy in the era of the Cloud - Demi Ben-Ari
PPTX
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
PDF
Getting Started with Splunk Enterprise
PPTX
How to Build Continuous Ingestion for the Internet of Things
PDF
Elastic Data Analytics Platform @Datadog
PDF
A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)
PDF
Searching Chinese Patents Presentation at Enterprise Data World
PDF
Ask bigger questions
PDF
Online Meetup #3 - Solo.io, Tidepool, Weaveworks, Buoyant
PPTX
David Knox: How do we Protect our Systems and Meet Compliance in a Rapidly Ch...
Integração de Dados com Apache NIFI - Marco Garcia Cetax
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
Ankus, bigdata deployment and orchestration framework
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
5 Things that Make Hadoop a Game Changer
Leaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real World
The Great Lakes: How to Approach a Big Data Implementation
Removing dependencies between services: Messaging and Apache Kafka
Big Data made easy in the era of the Cloud - Demi Ben-Ari
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Getting Started with Splunk Enterprise
How to Build Continuous Ingestion for the Internet of Things
Elastic Data Analytics Platform @Datadog
A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)
Searching Chinese Patents Presentation at Enterprise Data World
Ask bigger questions
Online Meetup #3 - Solo.io, Tidepool, Weaveworks, Buoyant
David Knox: How do we Protect our Systems and Meet Compliance in a Rapidly Ch...
Ad

Recently uploaded (20)

PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Getting Started with Data Integration: FME Form 101
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
Tartificialntelligence_presentation.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Approach and Philosophy of On baking technology
PDF
NewMind AI Weekly Chronicles - August'25-Week II
Assigned Numbers - 2025 - Bluetooth® Document
Getting Started with Data Integration: FME Form 101
Zenith AI: Advanced Artificial Intelligence
SOPHOS-XG Firewall Administrator PPT.pptx
Tartificialntelligence_presentation.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A novel scalable deep ensemble learning framework for big data classification...
Programs and apps: productivity, graphics, security and other tools
Hindi spoken digit analysis for native and non-native speakers
Agricultural_Statistics_at_a_Glance_2022_0.pdf
DP Operators-handbook-extract for the Mautical Institute
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Heart disease approach using modified random forest and particle swarm optimi...
1 - Historical Antecedents, Social Consideration.pdf
Encapsulation theory and applications.pdf
cloud_computing_Infrastucture_as_cloud_p
Enhancing emotion recognition model for a student engagement use case through...
TLE Review Electricity (Electricity).pptx
Approach and Philosophy of On baking technology
NewMind AI Weekly Chronicles - August'25-Week II

Logging at scale: doing more with less

  • 1. Logging at scale Doing more with less Presented by Andre Fucs de Miranda
  • 2. Macquarie GovernmentMacquarie Government 1. A bit about me 2. A bit of context 3. Apache NiFi and the SOC 4. Demo 5. Questions The agenda. | Logging at scale – Doing more with less 2
  • 3. Macquarie Government A bit about me. | Logging at scale – Doing more with less
  • 4. Macquarie Government Manager @ Macquarie’s Security Operations Center 20 years working in information cyber security Apache NiFi committer and PMC member | Logging at scale – Doing more with less https://github.com/trixpan https://twitter.com/trixpan About me
  • 5. Macquarie Government A bit of context | Logging at scale – Doing more with less
  • 6. Macquarie Government A bit of context About Macquarie Government • 42% of Australian Government agencies are our customers • 3+ billion events per day; Our tool stack is diverse and busy: • We generate TBs of data per day. • Since 2015 we have been using “Big Data” (i.e. Hadoop ecosystem) for reporting and analytics. • We are constantly looking for ways to offer our customers with better insights over the threats targeting them. • We also felt that relying exclusively on traditional SIEM wasn’t enough anymore. | Logging at scale – Doing more with less 6
  • 7. Macquarie Government A bit of context Could we leverage “big data” solutions to improve our SOC further? • Perhaps we could rationalise the way we collect and process log messages? • Perhaps we could do enrichment against a more diverse set of sources?? • What else? | Logging at scale – Doing more with less 7
  • 8. Macquarie Government So we went and evaluated lots of tools and architectures looking to map things like: • Ability to integrate with SIEM pipelines natively • Ability to consume cloud services (IaaS, PaaS and Saas) • Ability to query odd stuff • Inbuilt Security • Ability to Scale out • How easy to maintain and extend | Logging at scale – Doing more with less 8 and many more… A bit of context All Apache project logos are trademarks of the ASF and the respective projects. Logstash is a trademark of Elasticsearch BV, registered in the U.S. and in other countries. fluentd is trademark by Treasure Data
  • 9. And the winner was…
  • 10. Macquarie Government Sorry, there was a mistake… | Logging at scale – Doing more with less 10 All Apache project logos are trademarks of the ASF and the respective projects.
  • 11. Macquarie Government Let’s talk tech. | Logging at scale – Doing more with less Apache NiFi and the SOC
  • 12. Macquarie Government A bit about Apache NiFi – A brief Prologue When you are start shipping “data” seems like an “easy” task | Logging at scale – Doing more with less DC1 DB 12
  • 13. Macquarie Government A bit about Apache NiFi – A brief Prologue But as the environment grows, complexity compounds… …but you keep adjusting your environment | Logging at scale – Doing more with less DC1 DB DC2 DB HQ ClientX AZ1 AZ2 13
  • 14. Macquarie Government ‘til the point you suddenly realise your pipeline is missing a bit of cheese. Or worse… | Logging at scale – Doing more with less © Luca Nebuloni https://www.flickr.com/photos/nebulux/10708289086/ 14
  • 15. Macquarie Government A bit about Apache NiFi – A brief Prologue | Logging at scale – Doing more with less Source: https://goo.gl/xKoavI 15
  • 16. Macquarie Government “Apache NiFi supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.” Open sourced by the National Security Agency in 2014[1] and submitted to The Apache Software Foundation for on-going stewardship [1] https://goo.gl/aZxCIC | Logging at scale – Doing more with less • User friendly interface • Flexible • Data Agnostic • Inbuilt mechanisms to balance between latency and throughput • Fine grain control of delivery guarantees (e.g. discard a flowfile once it becomes too old to be relevant). • “Secure” • Data provenance (from where, to where, changed by, etc.) • Authorization Policies, TLS, Kerberos, Encryption and a handful of other features • Designed for Extension A bit about Apache NiFi16
  • 17. Macquarie Government A bit about Apache NiFi NiFi allows you easily move data between A and B (and B to A) in a controlled, secure and reliable way, while still allowing you to process and granularly apply logic to the data in motion. | Logging at scale – Doing more with less 17
  • 18. Macquarie Government A bit about Apache NiFi A few examples on how NiFi capabilities help a SOC: • Rationalising the flows of data into your SIEM • Do you truly need your SIEM to be ingesting all your logs? • What happens when you run more than one SIEM (because it may well happen…)? • Enrich data against a diverse range of sources • ElasticSearch, REST APIs, DNS, Redis, Whois, GeoIP, SQL, MISP (via HTTP) • (Pull|push) data (from|to) a diverse set of platforms • Object based stores such as GCS or S3, FTP, SFTP, Mainframes via WebSphere MQ, Files, SQL and Syslog of course. | Logging at scale – Doing more with less 18
  • 19. Macquarie Government Let’s take a closer look. | Logging at scale – Doing more with less
  • 20. Macquarie Government | Logging at scale – Doing more with less DEMO Time
  • 21. Macquarie Government Let’s talk. | Logging at scale – Doing more with less Andre Fucs de Miranda Macquarie Government amiranda@macquariegovernment.com 1800 004 943