Skip to main content

How a Global Energy Supermajor’s Network Team transformed network operations — and kept the lights on through it all.

  • May 1, 2026
  • 0 replies
  • 11 views

NetBrain Community Team
Forum|alt.badge.img

 

NetBrain Customer Success Story

Zero Change-Related Outages.

Twelve Months. 800+ Sites. One Small Team.

How a Global Energy Supermajor’s Network Team transformed network operations — and kept the lights on through it all

 

When leadership at one of the world’s largest publicly traded oil and gas companies asked how a single engineer had documented 30 remote oil field sites for a divestment handoff in 30 minutes — a process that used to take two weeks — the answer was straightforward: NetBrain. But that moment was just one data point in a much bigger story.

For the company’s unconventional energy division, the headline achievement is this: zero change-related network outages over a sustained twelve-month period — across a network of 800+ sites, most of them unmanned and scattered across remote locations. No incidents triggered by a failed change. No rollbacks that became outages. No post-change surprises.

That outcome didn’t happen by accident. It was built deliberately — through rigorous change validation, proactive automation, and a platform that became the backbone of how the team operates.

0

Change-Related Outages

Over a sustained 12-month period

30 min

Divestment Documentation

30 sites. 1 engineer. Was 2 weeks.

30 sec

Device / Site Lookup

Any IP. Any time. Was 4 hours.

 

The Challenge: Rapid Growth, High Stakes, Small Team

When the Permian Basin experienced its resurgence, the network team found itself deploying one to two new sites per week with same-day notice. Engineers received calls that a router had been pulled off a shelf and needed to go live by end of day. Documentation lagged. Some sites had no address — just GPS coordinates and names like “XYZ Tank Battery.”

The technology environment spanned Juniper, Palo Alto, Versa, Cisco, Azure, AWS, VMware, and Nutanix, with global teams across multiple continents and an MSP handling tier-one through tier-three support.

Making changes in this environment carried real risk. Without confident pre-change validation or automated post-change verification, any network change was a potential outage in waiting. In an environment where sites are remote and unmanned, a failed change doesn’t just affect productivity — it can halt the operational technology systems that support oil field operations.

 

How They Got to Zero: The Change Protection Model

The zero-outage result came from building a systematic, repeatable three-stage approach to change management — one that eliminated the guesswork at every stage of a change lifecycle.

PRE-CHANGE

Validate baseline · confirm safe to proceed

DURING CHANGE

Real-time runbook · instant rollback signal

POST-CHANGE

Continuous monitoring · catch what windows miss

 

Want to implement this in your own operations?

See the Change Protection Playbook below for a step-by-step flow, NetBrain capabilities at each stage, and actions you can start using today.

“By adding the PA and network intents, I can now monitor behaviour and catch things I just wouldn’t be able to see until something goes down.”
— Senior Network Lead, Unconventional Energy Division

 

The Operational Shift — Key Contrasts

Before NetBrain With NetBrain
Change validation based on engineer’s best knowledge of current network state. Data-driven pre-change baseline assessment — data-backed green light before any change proceeds.
Silent redundancy failures undetected until secondary path failed — sudden loss of 30 sites. Intent-based monitoring catches peering loss immediately, before it becomes an outage.
Post-change validation required manual CLI checks across multiple devices. Automated runbook validates configuration, reachability, and health instantly after execution.

 

From Tool to Team Habit

“Someone one day goes, ‘Oh, let’s check NetBrain for this.’ And then it becomes, ‘NetBrain is just my default to check.’ And now we’re at the point where I hear someone go: ‘Did you check NetBrain first?’”
— Senior Network Lead, Unconventional Energy Division

 

That confidence expanded well beyond the division. The company’s flagship refinery — one of the largest in the world — adopted NetBrain after seeing the results. The team then hosted a two-day NetBrain training event for 25 attendees from five companies. The people who built the most capability chose to share it.

“NetBrain’s drive to do the best work possible for the customer is what causes me to be willing to champion it internally. I don’t do that with other products.”
— Senior Network Lead, Unconventional Energy Division

 

Twelve months. Hundreds of changes. Zero outages.
That’s not a goal. That’s proof.


The NetBrain Change Protection Playbook

Global Energy Supermajor · Unconventional Energy Division · 800+ sites · Zero change-related outages over 12 months

 

3 Things You Can Start Implementing Today

① Start

Run a Quick Assessment

Snapshot device state before any change. Know what’s healthy before you touch it.

② Ask

Use Runbook Companion

During the change window ask: “Do I need to roll back?” — data answers, not guesswork.

③ Schedule

Set up PAF Monitoring

Turn your best validation intent into a daily recurring check. One intent. Ongoing safety net.

 

The Change Protection Flow

Each stage maps directly to NetBrain capabilities you can configure and reuse in your own operations.

① PRE-CHANGE VALIDATION  ·  Know exact state before touching anything
Establish Baseline
Snapshot device state, interfaces & routing adjacencies.
Quick Assessment Runbook
Config Compliance
Compare running config against your golden standard.
Golden Config Intents
Go / No-Go
Confirm app reachability — data-driven green light.
Runbook Companion by AI
↓ Change approved · execute with confidence
② DURING-CHANGE IMPACT ANALYSIS  ·  Confirm the change lands correctly — rollback ready
Execute Change
Push via NetBrain template — rollback commands pre-loaded.
Automate Network Change
Validate Real-Time
Pings, config checks, interface status run automatically.
Runbook validation nodes
Rollback Signal
Instant answer: “Roll back?” — data, not CLI guesswork.
Runbook Companion by AI
↓ Change complete · continuous protection engaged
③ POST-CHANGE CONTINUOUS MONITORING  ·  Safety net stays active after the window closes
Schedule Monitoring
Validation intents run daily — not just during the window.
PAF Proactive Automation
Alert on Drift
Know immediately if config drifts from post-change state.
Intent-based alerts
Catch Silent Failures
Monitor redundant links — know before the backup fails too.
Custom Intent + PAF