The System Crashed Hard
From the Project Management in Real Life Blog
by Drake Settsu
Sharing my Project Management adventures and some tips.
I like to keep my articles brief and to the point.
Project Management is an Art, Science, and Discipline.
Just keep it simple and have fun!
Recent Posts
Job Shadowing the Daily Work Routine
Mother Hen Leadership
Taming the Wild Wild West (Project Management) environment
The Hybrid-Plus called The LAW
Risk Register (Project Team Members)
Categories
Agile,
Blog,
Budget,
Budget Creep,
Budget Planning,
Business Analysis,
Career Development,
Communications Management,
Continuous Improvement,
Contractor,
Creative,
Data Center,
Deadlines,
Disaster Avoidance,
Disaster Recovery,
Educator,
Email,
Football,
Go-Live,
Hawaii,
Hybrid,
Hybrid-Plus,
Implementation,
Kaizen,
Kanban Board,
Kickoff Meeting,
KPI,
LAW,
Leadership,
Lean,
Lessons Learned,
Meeting,
Milestones,
MS Project,
New Release,
Options,
PMO,
Presentation,
Process Improvement,
Productivity,
Project,
Project Coordinator,
Project Management,
Project Manager,
Project Plan,
Project Tracking,
Projects,
Proposal,
Quarterback,
Real Life,
Requirements Management,
Risk Assessment,
Risk Management,
Risk Management,
Risk Register,
Scope Creep,
Scope Management,
Slideshow,
Software Development,
Software Updates,
Solutions,
Stakeholder Management,
Statement of Work,
Status Report,
Subject Matter Experts,
Subprojects,
Systems Administrator,
Teams,
Tips,
Training,
Transparency,
Vendor,
Waterfall,
Whiteboard,
Work Breakdown Structures (WBS),
Zen
Date

The Disaster
A server containing multiple disks managed by a vendor for the State of Hawaii’s driver's licensing programs had multiple hard disks crash. The storage media is encrypted and secured with some data not readable. There was no security or data breach. It will take a couple of months to determine if the data can be recovered.
The Vendor
The spokesperson gives a briefing at a press conference similar to "the dog ate my homework".
1) There was a backup system in place that's supposed to protect the data when the hard drives crash, it was not properly configured.
2) They were not aware that certain documents or images were not getting backed up properly.
3) The backups are checked to make sure everything is working properly. Don't know what the details are and what the plan was on checking the data.
Summary
When implementing a new system, I have few suggestions on how to make your system bulletproof. I used to be a Systems Administrator and I never lost any data on my watch.
1) The Project Manager in charge needs to have good Subject Matter Experts to recommend the technology, security, and procedures to be in place to ensure the system has the redundancy to withstand a disaster and prevent a security breach.
2) The Statement of Work needs to always include clear expectations.
3) Service Level Agreements need to be in place for the system.
4) Disaster Avoidance plan and Recovery Strategies to meet Service Level Agreements.
5) Test your systems periodically to ensure that the data is being replicated or backed up properly to a media that will go offsite.
6) Make sure you have a good hardware / software support contract in place.
7) Never ever trust the vendor. Make a checklist of your key deliverables to be reviewed and demonstrated when you begin the sign-off stage in the project. Any missing checks will result in a big missing check for that vendor to cash.
Posted on: March 27, 2018 10:18 AM |
Permalink
Comments (9)
Please login or join to subscribe to this item
Rami Kaibni
Community Champion
Senior Projects Manager | Field & Marten Associates
New Westminster, British Columbia, Canada
They obviously needed a better Risk Management Plan in place which includes making sure their contingency plan is solid and set up properly.
Anish Abraham
Privacy Program Manager| University of Washington
Auburn, Wa, United States
Good points, Drake and thanks for sharing this. We always have a blackout contingency plan when implementing a new system or during system upgrades.
Always test data recovery to ensure that what's written is actually complete & accurate. That's one of those "once bitten, twice shy" lessons. Thanks for the walk down memory lane, Drake!
Kiron
Thanks Drake, that reminds me of the long corridor of server racks in a major corp some 20 years ago. I saw it crash once too.
Thanks Rami, Anish, Kiron, and Sante.
I appreciate your comments.
Every project manager always needs to incorporate disaster recovery in a project. It's not rocket science.
Backup and disaster recovery strategy isn't just about being able to recover your data and get back to a normal state but it should enable your organization to continue performing operations before, during, and after an issue. And this requires help of SMEs to ensure that every loophole is plugged in.Frequent review of backup data may seem time consuming but ensures that the backup system will survive a crash.
Thank you Drake for great pointers.
Thanks Najam!
Vendors just want to sell their software and overlook the most important part that is Business Continuity.
Michael Delaney
Partner| Delaney Management LLC
West Chester, Pa, United States
I have been fortunate to have some exceptional technical people on my projects. In many cases they had suggested additional backups and other safeguards that prevent major project issues. Like any project the key to success is to have the right people and when it comes to areas of specific technical skills you need to have people who know better than you and that you trust.
Please Login/Register to leave a comment.