SlideShare a Scribd company logo
A unified reporting & correlation tool for AHF Stack
Navigating Oracle Troubleshooting: AHF
Insights for Database 23ai
Sandesh Rao
VP AIOps and Machine Learning , Autonomous Database
@sandeshr
https://www.linkedin.com/in/raosandesh/
https://www.slideshare.net/SandeshRao4
AHF Stack
§ Purpose
§ Provides bird's eye view of the system from diagnostic perspective.
§ Offers insights for effective issue resolution with guidance & co-relation.
§ Unifies the AHF stack under a single user interface
§ Usage
§ Reactive, Proactive.
§ Target Users
§ Customers, Operations, Support, Development
§ To generate AHF Insights, run :
§ Collected by Default in AHF Collections from AHF 24.2
AHF Insights
AHF Insights (over command-line)
AHF Insights as part of AHF collection
AHF Insights as part of AHF collection
§ All Manual Collections
§ This is included in all AHF manual collections since 24.2
§ SRDCs (Subset)
Automatic Insights Report with AHF collection
§ dbinstancecrash
§ dbunixresources
§ crsdbhangperf
§ dbhangperflite
§ dbperf
§ dbrac
§ dbracperf
§ exadata
§ dbdataguard
§ dbhangperflite_auto
AHF Insights Overview
AHF Insights Overview
Need configurations details while
troubleshooting an issue
§ BareMetal system (DomU has access to query storage server and fabric switch details)
§ DomU has limited access
Configuration Details
Cluster Details
ASM Details
Database Details
Database Server Configuration
Database Server Configuration
Database Parameters
Kernel Parameters
Troubleshoot :
Issue due to inconsistent RPMs
§ Example Scenario - Inconsistent glibc Package Versions
§ Issue - Node Eviction and Clusterware Instability
§ Example
§ In a four-node Oracle RAC cluster, nodes 1 and 2 have glibc version 2.17-307 installed, while
nodes 3 and 4 have glibc version 2.17-307.el7.1 installed. This discrepancy can cause several
problems.
§ Impact
§ Node Eviction - Due to the different versions of glibc, nodes 3 and 4 might face eviction as the
clusterware detects inconsistencies in the environment.
§ Clusterware Instability - The inconsistency in glibc can cause instability in Oracle Clusterware,
leading to startup failures and communication errors.
Issue due to inconsistent RPMs
RPMS & Inconsistencies
Troubleshoot :
Issue due to software version lower than
MAA Software Recommendations
Recommended Software
All software should be updated regularly. Maintaining software at current or recent releases provides
the following benefits:
§ Better software security
§ More stable maintenance releases
§ Continued compatibility with newer related
software
§ Better support and faster resolution of issues
§ Ability to receive fixes for newly discovered
issues.
Troubleshoot :
Issue due to recent changes on the system
§ Issue due to Application of new Patch
§ Issue due to Changes on ASM / Database parameter
§ Issue due to New OS package installed
§ Issue due to New Oracle Software installed
Issue due to recent changes on the system
System Changes
Troubleshoot :
Space Usage Issues
Space Usage Issues
Troubleshoot :
Issue due to Best Practice Violations
Best Practice Violations
Troubleshoot :
Major Events happening across the cluster
§ For troubleshooting one needs to know :
§ What type of system does user have ?
§ What’s going on around the time of issue ?
§ Can I get a full picture across all nodes ?
§ Can I zoom into specific timeframe ?
§ Can I look at the data from various perspectives ?
Customer Complains of “Grid failure - CRS-8503 []” in SR
Customer’s System around the time of Issue
Major Events around the time of issue
Major Events around the time of issue
Major Events around the time of issue
Troubleshoot :
Operating System Issues
Customer’s System undergoes Node eviction
Customer’s System undergoes Node eviction
Customer’s System undergoes Node eviction
High Memory Pressure
Increase in RSS consumption
by ‘extract’ process
Customer’s System undergoes Node eviction
50GB RSS hogged by extract process
Troubleshoot :
Database Anomalies
Database Anomalies as observed in the Database Anomaly Advisor
Database Anomalies as observed in the Database Anomaly Advisor
Troubleshoot :
Node Eviction Problems
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
Node eviction due to Huge Page over-allocation
How customers can log SRs using
diagnostics
Sandesh_Rao_Navigating Oracle Troubleshooting- AHF Insights for Database 23ai - AIOUG July 2024.pdf
Sandesh_Rao_Navigating Oracle Troubleshooting- AHF Insights for Database 23ai - AIOUG July 2024.pdf
Sandesh_Rao_Navigating Oracle Troubleshooting- AHF Insights for Database 23ai - AIOUG July 2024.pdf
Demo
Thank you
46
Confidential – © 2019 Oracle Internal/Restricted/Highly Restricted

More Related Content

PDF
Beyond Metrics – Oracle AHF Insights for Proactive Database Management - DOAG...
PPT
Web Speed And Scalability
PPTX
Always On - Zero Downtime releases
PPTX
NCache Architecture
PDF
Application Scalability in Server Farms - NCache
PPT
Four Ways to Improve ASP .NET Performance and Scalability
PDF
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
PPTX
Design Best Practices for High Availability in Load Balancing
Beyond Metrics – Oracle AHF Insights for Proactive Database Management - DOAG...
Web Speed And Scalability
Always On - Zero Downtime releases
NCache Architecture
Application Scalability in Server Farms - NCache
Four Ways to Improve ASP .NET Performance and Scalability
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
Design Best Practices for High Availability in Load Balancing

Similar to Sandesh_Rao_Navigating Oracle Troubleshooting- AHF Insights for Database 23ai - AIOUG July 2024.pdf (20)

PPTX
Open source: Top issues in the top enterprise packages
PDF
SQL Server Alwayson for SharePoint HA/DR Step by Step Guide
PDF
Why MySQL High Availability Matters
PPT
Performance testing material
PDF
Design (Cloud systems) for Failures
PPTX
5 Quick Wins for the Cloud
PDF
Webinar slides: How to deploy and manage HAProxy, MaxScale or ProxySQL with C...
PPTX
Plan Your IaaS Environment for Optimal Performance
PDF
HA SOA Application with GlusterFS
PDF
How to build a winning solution for large scale VDI deployments
PDF
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
PPTX
OVHcloud – Enterprise Cloud Databases
PPSX
Basic Archive System overview
PDF
SQL AlwaysON for SharePoint HA/DR on Azure Global Azure Bootcamp 2017 Eisenac...
PPTX
ODW 2021 - Automated patching and compliance to improve database security.pptx
PPTX
Webinar: Five Problems Facing Business-Critical NFS Deployments
PPT
Building a Scalable Architecture for web apps
PPT
Securing Servers in Public and Hybrid Clouds
PDF
Building Robust, Adaptive Streaming Apps with Spark Streaming
PDF
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
Open source: Top issues in the top enterprise packages
SQL Server Alwayson for SharePoint HA/DR Step by Step Guide
Why MySQL High Availability Matters
Performance testing material
Design (Cloud systems) for Failures
5 Quick Wins for the Cloud
Webinar slides: How to deploy and manage HAProxy, MaxScale or ProxySQL with C...
Plan Your IaaS Environment for Optimal Performance
HA SOA Application with GlusterFS
How to build a winning solution for large scale VDI deployments
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
OVHcloud – Enterprise Cloud Databases
Basic Archive System overview
SQL AlwaysON for SharePoint HA/DR on Azure Global Azure Bootcamp 2017 Eisenac...
ODW 2021 - Automated patching and compliance to improve database security.pptx
Webinar: Five Problems Facing Business-Critical NFS Deployments
Building a Scalable Architecture for web apps
Securing Servers in Public and Hybrid Clouds
Building Robust, Adaptive Streaming Apps with Spark Streaming
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
Ad

More from Sandesh Rao (20)

PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
PDF
Will Oracle 23ai make you a better DBA or Developer?
PDF
Sandesh_Rao_Unlocking Oracle Database Mysteries AHF Insights and the AI-LLM D...
PDF
Whats new in Autonomous Database in 2022
PDF
Oracle Database performance tuning using oratop
PDF
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
PDF
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
PDF
Machine Learning and AI at Oracle
PDF
Top 20 FAQs on the Autonomous Database
PDF
How to Use EXAchk Effectively to Manage Exadata Environments
PDF
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
PDF
TFA Collector - what can one do with it
PDF
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
PDF
How to use Exachk effectively to manage Exadata environments OGBEmea
PDF
Troubleshooting tips and tricks for Oracle Database Oct 2020
PDF
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
PDF
20 tips and tricks with the Autonomous Database
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Will Oracle 23ai make you a better DBA or Developer?
Sandesh_Rao_Unlocking Oracle Database Mysteries AHF Insights and the AI-LLM D...
Whats new in Autonomous Database in 2022
Oracle Database performance tuning using oratop
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
Machine Learning and AI at Oracle
Top 20 FAQs on the Autonomous Database
How to Use EXAchk Effectively to Manage Exadata Environments
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
TFA Collector - what can one do with it
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmea
Troubleshooting tips and tricks for Oracle Database Oct 2020
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
20 tips and tricks with the Autonomous Database
Ad

Recently uploaded (20)

PDF
Empathic Computing: Creating Shared Understanding
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
cuic standard and advanced reporting.pdf
PPT
Teaching material agriculture food technology
PDF
KodekX | Application Modernization Development
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Spectroscopy.pptx food analysis technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Cloud computing and distributed systems.
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Encapsulation theory and applications.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Empathic Computing: Creating Shared Understanding
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
cuic standard and advanced reporting.pdf
Teaching material agriculture food technology
KodekX | Application Modernization Development
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Spectroscopy.pptx food analysis technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Cloud computing and distributed systems.
“AI and Expert System Decision Support & Business Intelligence Systems”
sap open course for s4hana steps from ECC to s4
Encapsulation theory and applications.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Sandesh_Rao_Navigating Oracle Troubleshooting- AHF Insights for Database 23ai - AIOUG July 2024.pdf