SlideShare a Scribd company logo
guarding your data
Andrew Gilmartin
senior developer
agilmartin@crossref.org
don’t loose the data
•  Oracle holds the data in Boston
Oracle × 2 + tape + off-site tape

•  Oracle holds the data in Denver
Oracle × 2

•  Denver less than 20 minutes out of sync with Boston
syncing Boston & Denver
•  Oracle's
Oracle Streams …

•  Other's
Tungsten Replicator …

•  Our's
the tool’s context
•  use existing tooling
•  use a small set of dependent parts
•  use already monitored parts
•  Oracle
•  file systems
•  networks
•  …
the tool’s necessities
•  reliable
•  recoverable
•  chatty
•  alarming
Oracle’s archive-log
1  archive-log represent the changes to the database
since the last archive-log
2  discover created archive-log via Boston Oracle’s
tracing-log
3  transfer archive-log to Denver
4  register archive-log with Denver’s Oracle,
5  discover archive-log applied via Denver Oracle’s
tracing-log
a bash script
•  script & configuration are short at ~200 & ~10 lines
respectively

•  same script & configuration in Boston & in Denver
•  Boston based script relies on Denver based script and
vise versa

•  a known set of functions
significant event markers
•  events triggered by tracing-log files
•  events are recorded & coordinated via marker files
•  events are created, transferred, registered, & applied
•  markers are distinctively named, empty files in Boston
arch_1_640475_754538743.arc.created
archive-log’s name!

marker!
were what happens
•  Boston continuously watches for archive-log creations
invokes “transfer & register” process in Denver

•  Denver continuously watches for archive-log
applications

invokes “applied” process in Boston

•  Boston periodically investigate process’s status
•  Boston & Denver periodically clean up
some implementation notes
•  humans are are good at •  check command exit
recovery

status

•  use the operating system •  retry commands at
... ssh keys
... directory as database

•  use configuration files
rather than command
line options

•  Use command timeouts

lengthening intervals
… 5s 30s 300s …

•  use bash's -E option
some resources
•  spark.sh & spark.conf
•  ask Chuck for our tool, ckosher@crossref.org
•  Giuseppe Maxia’s

“Script it! Make professional DBA tools out of nothing”
https://www.percona.com/live/mysql-conference-2013/
sessions/script-it-make-professional-dba-tools-outnothing
2013 CrossRef Workshops System Update: Guarding Your Data, Andrew Gilmartin

More Related Content

PPTX
Logstash
PPTX
Monitoring Docker with ELK
PDF
Open Source Logging and Monitoring Tools
PPT
'Scalable Logging and Analytics with LogStash'
PDF
Logstash + Elasticsearch + Kibana Presentation on Startit Tech Meetup
PPT
Logstash
PDF
Logstash family introduction
PPTX
Introduction to ELK
Logstash
Monitoring Docker with ELK
Open Source Logging and Monitoring Tools
'Scalable Logging and Analytics with LogStash'
Logstash + Elasticsearch + Kibana Presentation on Startit Tech Meetup
Logstash
Logstash family introduction
Introduction to ELK

What's hot (20)

PPTX
Scaling an ELK stack at bol.com
PPTX
Node collaboration - sharing information between your systems
PDF
Logstash: Get to know your logs
PPTX
ELK Stack
PDF
Experiences in ELK with D3.js for Large Log Analysis and Visualization
PDF
Docker Logging Webinar
PDF
Central LogFile Storage. ELK stack Elasticsearch, Logstash and Kibana.
PDF
Monitoring with Graylog - a modern approach to monitoring?
PDF
Elastic{ON} 2016 Review - 김종민 님
PDF
Machine Learning in a Twitter ETL using ELK
PPTX
MySQL Slow Query log Monitoring using Beats & ELK
PPTX
Elastic - ELK, Logstash & Kibana
PDF
Logging logs with Logstash - Devops MK 10-02-2016
PPTX
PDF
Shipping & Visualize Your Data With ELK
PPTX
Puppetcamp Melbourne - puppetdb
PPTX
Customer Intelligence: Using the ELK Stack to Analyze ForgeRock OpenAM Audit ...
PDF
elk_stack_alexander_szalonnas
PDF
Monitoring NGINX (plus): key metrics and how-to
PPTX
Introduction to ELK
Scaling an ELK stack at bol.com
Node collaboration - sharing information between your systems
Logstash: Get to know your logs
ELK Stack
Experiences in ELK with D3.js for Large Log Analysis and Visualization
Docker Logging Webinar
Central LogFile Storage. ELK stack Elasticsearch, Logstash and Kibana.
Monitoring with Graylog - a modern approach to monitoring?
Elastic{ON} 2016 Review - 김종민 님
Machine Learning in a Twitter ETL using ELK
MySQL Slow Query log Monitoring using Beats & ELK
Elastic - ELK, Logstash & Kibana
Logging logs with Logstash - Devops MK 10-02-2016
Shipping & Visualize Your Data With ELK
Puppetcamp Melbourne - puppetdb
Customer Intelligence: Using the ELK Stack to Analyze ForgeRock OpenAM Audit ...
elk_stack_alexander_szalonnas
Monitoring NGINX (plus): key metrics and how-to
Introduction to ELK
Ad

Viewers also liked (18)

PDF
2013 CrossRef Annual Meeting Flash Update CrossCheck and CrossMark Rachael La...
PDF
2013 CrossRef Annual Meeting CrossRef Overview Ed Pentz
PPTX
CrossRef DOIs for Books
PPT
2013 CrossRef Workshops Boot Camp Introduction Carol Anne Meyer
PDF
2013 CrossRef Annual Meeting Flash Update Branding Carol Anne Meyer
PPTX
2013 CrossRef Workshops System Update Chuck Koscher
PDF
Text and Data Mining with CrossRef
PDF
2013 CrossRef Annual Meeting Flash Update FundRef Kirsty Meddings
PDF
ALPSP "New Metadata, New Identifiers" by Kirsty Meddings
PDF
Barcelona 2014: CrossRef Tools for Small Publishers by Geoffrey Bilder
PPT
Love and Metadata: CrossRef at the Hub of Scholarly Communications by Carol A...
PPTX
CrossRef DOIs for African Journal Partnership Journals
PDF
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
PDF
Crossmark, an introduction for publishers
PPT
Sex and the Scientific Publisher: How Journals and Journalists Collude (despi...
PPTX
The Journal of Clinical Investigation DOI Workflow (2011 CrossRef Workshops)
PPTX
The Persistence of Error (2011 CrossRef Annual Meeting)
PPT
UKSG Transfer Update (2011 CrossRef Workshops)
2013 CrossRef Annual Meeting Flash Update CrossCheck and CrossMark Rachael La...
2013 CrossRef Annual Meeting CrossRef Overview Ed Pentz
CrossRef DOIs for Books
2013 CrossRef Workshops Boot Camp Introduction Carol Anne Meyer
2013 CrossRef Annual Meeting Flash Update Branding Carol Anne Meyer
2013 CrossRef Workshops System Update Chuck Koscher
Text and Data Mining with CrossRef
2013 CrossRef Annual Meeting Flash Update FundRef Kirsty Meddings
ALPSP "New Metadata, New Identifiers" by Kirsty Meddings
Barcelona 2014: CrossRef Tools for Small Publishers by Geoffrey Bilder
Love and Metadata: CrossRef at the Hub of Scholarly Communications by Carol A...
CrossRef DOIs for African Journal Partnership Journals
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
Crossmark, an introduction for publishers
Sex and the Scientific Publisher: How Journals and Journalists Collude (despi...
The Journal of Clinical Investigation DOI Workflow (2011 CrossRef Workshops)
The Persistence of Error (2011 CrossRef Annual Meeting)
UKSG Transfer Update (2011 CrossRef Workshops)
Ad

Similar to 2013 CrossRef Workshops System Update: Guarding Your Data, Andrew Gilmartin (15)

PPT
PDF
Migration to Oracle 12c Made Easy Using Replication Technology
PDF
381 Pdfsam
PDF
PDF
Replication Tips & Trick for SMUG
ODP
Logitoring - log-driven monitoring and the Rocket science
PPT
Digital Destiny
PDF
Oracle forensics 101
PPTX
ADDO Open Source Observability Tools
PDF
MyersTessella_Dec2013
PDF
Mining Your Logs - Gaining Insight Through Visualization
PPT
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
PDF
Security Events Logging at Bell with the Elastic Stack
PDF
Python vs JLizard.... a python logging experience
PPTX
What’s New in Oracle Database 19c - Part 1
Migration to Oracle 12c Made Easy Using Replication Technology
381 Pdfsam
Replication Tips & Trick for SMUG
Logitoring - log-driven monitoring and the Rocket science
Digital Destiny
Oracle forensics 101
ADDO Open Source Observability Tools
MyersTessella_Dec2013
Mining Your Logs - Gaining Insight Through Visualization
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
Security Events Logging at Bell with the Elastic Stack
Python vs JLizard.... a python logging experience
What’s New in Oracle Database 19c - Part 1

More from Crossref (20)

PDF
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
PDF
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
PDF
Seminario web ‘Crossmark’, en español
PDF
Working with ROR as a Crossref member: what you need to know
PPTX
Преимущества и варианты использования метаданных в Crossref / The Value and ...
PDF
Seminario web ‘Similarity Check’, en español
PPTX
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
PDF
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
PPTX
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
PPTX
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
PPTX
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
PDF
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
PDF
Los Metadatos Para la Comunidad de Investigacion
PPTX
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
PPTX
Content Registration, Crossref ALJEBI, Indonesia
PDF
crossmark update
PPTX
Participation reports webinar December 2020
PPTX
Participation reports webinar November 2020
PDF
Introduction to Crossmark/Crossmark: O que é e como usar
PPTX
Crossref LIVE UK Online
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
Seminario web ‘Crossmark’, en español
Working with ROR as a Crossref member: what you need to know
Преимущества и варианты использования метаданных в Crossref / The Value and ...
Seminario web ‘Similarity Check’, en español
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
Los Metadatos Para la Comunidad de Investigacion
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
Content Registration, Crossref ALJEBI, Indonesia
crossmark update
Participation reports webinar December 2020
Participation reports webinar November 2020
Introduction to Crossmark/Crossmark: O que é e como usar
Crossref LIVE UK Online

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Electronic commerce courselecture one. Pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Machine Learning_overview_presentation.pptx
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
20250228 LYD VKU AI Blended-Learning.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Digital-Transformation-Roadmap-for-Companies.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Electronic commerce courselecture one. Pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation_ Review paper, used for researhc scholars
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Machine Learning_overview_presentation.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
cuic standard and advanced reporting.pdf
Unlocking AI with Model Context Protocol (MCP)
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Group 1 Presentation -Planning and Decision Making .pptx
Machine learning based COVID-19 study performance prediction
Programs and apps: productivity, graphics, security and other tools
“AI and Expert System Decision Support & Business Intelligence Systems”
Mobile App Security Testing_ A Comprehensive Guide.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

2013 CrossRef Workshops System Update: Guarding Your Data, Andrew Gilmartin

  • 1. guarding your data Andrew Gilmartin senior developer agilmartin@crossref.org
  • 2. don’t loose the data •  Oracle holds the data in Boston Oracle × 2 + tape + off-site tape •  Oracle holds the data in Denver Oracle × 2 •  Denver less than 20 minutes out of sync with Boston
  • 3. syncing Boston & Denver •  Oracle's Oracle Streams … •  Other's Tungsten Replicator … •  Our's
  • 4. the tool’s context •  use existing tooling •  use a small set of dependent parts •  use already monitored parts •  Oracle •  file systems •  networks •  …
  • 5. the tool’s necessities •  reliable •  recoverable •  chatty •  alarming
  • 6. Oracle’s archive-log 1  archive-log represent the changes to the database since the last archive-log 2  discover created archive-log via Boston Oracle’s tracing-log 3  transfer archive-log to Denver 4  register archive-log with Denver’s Oracle, 5  discover archive-log applied via Denver Oracle’s tracing-log
  • 7. a bash script •  script & configuration are short at ~200 & ~10 lines respectively •  same script & configuration in Boston & in Denver •  Boston based script relies on Denver based script and vise versa •  a known set of functions
  • 8. significant event markers •  events triggered by tracing-log files •  events are recorded & coordinated via marker files •  events are created, transferred, registered, & applied •  markers are distinctively named, empty files in Boston arch_1_640475_754538743.arc.created archive-log’s name! marker!
  • 9. were what happens •  Boston continuously watches for archive-log creations invokes “transfer & register” process in Denver •  Denver continuously watches for archive-log applications invokes “applied” process in Boston •  Boston periodically investigate process’s status •  Boston & Denver periodically clean up
  • 10. some implementation notes •  humans are are good at •  check command exit recovery status •  use the operating system •  retry commands at ... ssh keys ... directory as database •  use configuration files rather than command line options •  Use command timeouts lengthening intervals … 5s 30s 300s … •  use bash's -E option
  • 11. some resources •  spark.sh & spark.conf •  ask Chuck for our tool, ckosher@crossref.org •  Giuseppe Maxia’s “Script it! Make professional DBA tools out of nothing” https://www.percona.com/live/mysql-conference-2013/ sessions/script-it-make-professional-dba-tools-outnothing