User topics

Article topics

Log in Sign up

Morgan Chase blames Oracle for online bank crash

JP Morgan Chase online banking services crashed in New York last week, and the outage has been laid at Oracle's door. The bank's online services were unavailable for Monday night and Tuesday, with service restored very early on Wednesday. The crash prevented the bank's 16.5 million online customers carrying out online banking …

COMMENTS

House rules Send corrections

This topic is closed for new posts.

Monday 20th September 2010 15:02 GMT The BigYin

Yeah, get rid on Oracle...

...use MySQL instead. Wait....what?

0 0
1. Friday 24th September 2010 14:37 GMT Anonymous Coward
  
  "Engineered to work together"
  
  Whoever decided to put Oracle RAC on a 8 node cluster of T5420's should be fired.
  
  First of all the T chips have terrible single thread performance, the chips are made in China and the systems are made in Mexico.
  
  RAC has terrible scalability with the performance of 3 systems in a 4 system cluster.
  
  I bet Oracle will view this as an opportunity to sell Exadata....the customer should look at a better class of hardware....or maybe even moving to the mainframe.
  
  0 0
Monday 20th September 2010 15:05 GMT Anonymous Coward

Much as I enjoy a good Oracle-bashing...

A problem in an Oracle database does not necessarily equate to a problem with Oracle software.

Could be an operational screwup.

Could be an application problem.

1 0
Monday 20th September 2010 15:05 GMT Pete 2

The folly of hot backups

> and this corruption was replicated in the hot backup.

Uh, well, duh! yeah. These things only protect against hardware faults. Against buggy software or human error (or malice) they are useless, since whatever gets done to the production instance is automatically done to your copy (maybe if they'd used the word "copy" in their systems architecture, rather than "backup" the apparentness of the shortcoming would have been spotted).

it does sound like a strange choice of resilience - given that the system which failed was already clustered and presumably the EMC storage was RAIDed to hell and back. So any single hardware failure could already be detected and hopefully mitigated in the server cluster or the SAN. Sounds a lot like "management by glossy brochure" rather than a professionally designed _and_ _tested_ failover system.

2 0
1. Monday 20th September 2010 20:00 GMT Anonymous Coward
  
  No strange or unusual at all.
  
  Remote replication of data is standard - just ask those that have lost a building if it's a good idea.
  
  It's also a regulatory mandated configuration in a good deal of circumstances.
  
  It can make a good deal of sense to have multiple live copies of your databases - not only for Disaster Recovery or Business Continuity but also as further sources for backups, reports etc.
  
  However, it is certainly not a zero-cost administration configuration and does require a good deal of tech know-how.
  
  How is it a folly to run hot backups?
  
  0 0
2. Monday 20th September 2010 20:00 GMT gurner
  
  Folly?
  
  Not necessarily. If the production system suffered a corruption due to a memory issue, rebuilding the DB from an RMAN backup with little or no load on the system /may/ produce different results.
  
  Mind you, I've managed Oracle DB's for 18 years and never had one corrupt on me.
  
  0 0
Monday 20th September 2010 15:14 GMT Robert Taylor

Oracle "corrupted the data"...

So let me get this straight, they restored the backup and rolled forward all the transactions and that cured the problem?

Wll in that case, the Oracle database didn't corrupt the records as replaying exactly the same transactions a second time (from the redo logs) would have resulted in the same corruption happening.

Sounds far more likely to be a hardware problem to me (memory, SAN controller, etc).

3 1
1. Monday 20th September 2010 20:00 GMT Ian Michael Gumby
  
  @ Robert Taylor
  
  Not exactly.
  
  You could have a data corruption that occurs independent of the raw data. So it could be software and not hardware.
  
  The issue is setting the database back to a good state and then roll forward a series of transactions. They could be in the database log or it could be an external log of the transactions.
  
  0 0
2. Monday 20th September 2010 20:00 GMT Pete 2
  
  which is more likely?
  
  > Four files in the database were awry
  
  I wonder is "awry" is a code word for "accidentally deleted"?
  
  There are so many possibilities, such as a tablespace that couldn't auto-extend, database files that had permissions or ownership changed. I doubt that we'll ever be told what really went wrong unless this gets taken to court and all the gory details get reported in the technical press.
  
  Just look out for DBA, storage admin or sysadmin vacancies at Morgan Chase - that'll probably be the only clue we'll get.
  
  0 0
3. Monday 20th September 2010 20:00 GMT Anonymous Coward
  
  Unlikely a hardware issue....
  
  More likely a software problem.
  
  0 0
4. Monday 20th September 2010 20:00 GMT Matt Bryant
  
  RE: Oracle "corrupted the data"...
  
  Agreed, I hate to say this but in my view this is not an Oracle error, but an operator error. If the Oracle DB was somehow at fault then merely replaying exactly the same transactions through exactly the same systems would generate the same error or hold the risk of generating the same error, so you simply wouldn't do it. Might guess is the DB admins or sysadmins screwed up and manually "corrupted" the file, probably with a hasty vi edit, then couldn't spot the problem as everything came back as status OK. A user "corrupted" file will pass all tests as it is not corrupted, it simply has incorrect data in it. The interesting bit now will be how Larry responds - he doesn't like anyone saying nasty things about his products, so will he simply swallow it seeing as Chase must be a big customer, or does he let rip and sue them?
  
  0 1
5. Tuesday 21st September 2010 08:17 GMT Jesper Frimann
  
  Well one thing is what the article says..
  
  Well it's a cluster of 8x T5420's running Oracle, that means that it's most likely a RAC cluster. Or perhaps even one of those new Exadata boxes.
  
  // Jesper
  
  0 0
Monday 20th September 2010 15:33 GMT JaitcH

Could be a big interbank bill

Each weekday morning members of the clearing house to clear debits and credits between each and every bank at the start of the business day.

If a bank is late, for whatever reason, interest is incurred until settlement is made!

Poetic justice, the banks screw the retail customer as well as their own kind!

2 2
1. Monday 20th September 2010 20:00 GMT Ian Michael Gumby
  
  Different system.
  
  Your big bill comes at the end of term on a swap/swaption cap/caption or whatever hedging derivative they use to raise capital.
  
  So when you have a 50 million dollar deal and you're end of term, that's where you start to pay big bucks.
  
  But I digress. That would be a different system than the one they use to manage their retail clients.
  
  At least it should be....
  
  0 0
Monday 20th September 2010 20:00 GMT Ian Michael Gumby

If they are looking at IBM...

Don't know why they would be looking at DB2.

JPMChase already has Informix's IDS in house which is IBM's *other* database and is actually a much better performer in terms of HA and OLTP processing.

But then again, JPMC will take the cue from their IBM client rep and his cadre of simple minded sales reps who'll sell JPMC whatever they think will make them the most money, regardless of what is really best for the customer.

Do I sound a bit jaded? Maybe. But then again, I know more than I care to talk about. ;-P

-G

1 0
Monday 20th September 2010 20:00 GMT Anonymous Coward

Hahahahahah

DB2 - good one!

Hahahahahahahahahahahahahahaha

2 1
Monday 20th September 2010 20:00 GMT Anonymous Coward

Hmmm...

What exactly is a T4520? Certainly not a Sun/Oracle box. A quick google shows me that a T4520 is a toner cartridge from Toshiba... Might explain a bit.

Seriously though, it humors me how a company can blame a specific vendor like this and then state how they are actively looking for another vendor. Funny 'cuz the largest most successful companies in the world seem able to use Oracle every day without issue. How strange.

2 0
Monday 20th September 2010 20:00 GMT GG77

Chase has bad IT anyway

I have tried to deal with Chase several times and their IT department has always been suspect. For instance, it takes 3-5 days to reactivate an account where most banks can do this in real time. Anytime I have asked questions about IT policies and why processing takes so long it has generally taken me 4 calls and up to 10 forwards to get an answer from anyone with a clue.

In short it seems that Chase is generally clueless about their own IT, have conflicting policies and in general do not care much about making it work, much less making it work better.

I have worked with IT for several large banks and Chase by far seems to be the most behind and screwed of them.

2 0
1. Thursday 23rd September 2010 00:03 GMT Matthew Elvey
  
  More evidence their IT and IS are 'challenged':
  
  Security Alert - they don't even bother to fix identified security flaws:
  
  http://www.elvey.com/it/spr/SPR-2008-08-16.html
  
  So I wouldn't expect their disaster recovery plans to be in tiptop shape!
  
  0 0
Monday 20th September 2010 20:00 GMT Version 1.0

I see this a lot

Pretty much all of the companies out there will spend far more development time and money on "adding" and "enhancing" features to their products (be it hardware or software) than they will on making their hardware bullet proof, or writing software that actually checks the inputs and output are valid.

If the hardware failed - then shouldn't both the hardware and the software have picked that up?

If the software failed - then I'd have expected a "sanity" check would have flagged that too.

So basically - something screwed up ... and didn't flag a warning. That's pretty much pare for the course a lot more often than you'd think.

0 1
Tuesday 21st September 2010 14:53 GMT Shamus110

Crap EMC SAN

Probabley Crap EMC to blame.

Although Chase is outsourced of a lot backoffice functions to India - Say no more ;-)

1 0
Tuesday 21st September 2010 15:39 GMT Radek

"No Mr. Customer, you don't need snapshots" - said once EMC rep

This is my take on this:

I am not going to blame Oracle for this misery - quite opposite actually.

The story unfolds:

"El Reg is told that Oracle support staff pointed the finger of blame at an EMC SAN controller but that was given the all-clear on Monday night."

However:

"Monash subsequently posted that the outage was caused by corruption in an Oracle database which stored user profiles. Four files in the database were awry and this corruption was *replicated* in the hot backup."

Sweet, isn’t it? And this is what differs a decent snapshot from replication – it will not inherit a corruption which just hit primary data, because it’s read only, full stop.

And the rest is obvious:

"Recovery was accomplished by restoring the database from a Saturday night backup, and then by reapplying *874,000* transactions during the Tuesday."

So my bottom line is:

Should they have properly implemented snapshot protection in place (e.g. on NetApp storage), the extend of this outage would be quite likely substantially smaller, as a recovery to, say, an hour old snapshot would mean replaying only a handful of logs comparing to 874,000 collected between Saturday backup and Monday crash...

0 2
1. Thursday 23rd September 2010 00:15 GMT The_Tripper
  
  NetApp? Replace a SAN with a NAS box?
  
  More than likely, they are using is EMC's SRDF (assuming it's on a Symmetrix) in asynchronous mode for replication. When that is done, a "snapshop" can be made on a BCV volume, which is then replicated to the secondary Symmetrix for playback onto another BCV, then to the R2 device. Since the BCV is a complete copy of the volume, there is a huge difference between doing that and making a simple snapshot on a NAS box.
  
  And besides, I have never met a database administrator that DIDN'T jump and scream "IT'S THE STORAGE THAT DID IT!" when something went wrong. It's something in their genes that makes them blame the hardware.
  
  NetApp? Puh-leeeze.
  
  1 0
  1. Friday 24th September 2010 21:24 GMT Radek
    
    Re: NetApp? Replace a SAN with a NAS box?
    
    Hmm, interestingly enough Oracle internally uses petabytes of NetApp storage - more accurately in their Austin DC, where they use NetApp "NAS boxes" to e.g. host Oracle on Demand.
    
    Saying that NetApp is "just a NAS" is a plain lie (they can actually do NAS and SAN), but interestingly enough, Oracle uses NFS on NetApp as a storage protocol of choice, because it works *best* for them.
    
    EMC may have a lot of fancy replication tools (with even fancier acronyms), yet the plain fact is - they can't match NetApp robust snapshotting technology, which addresses most of the day-to-day backup & recovery needs (including the example we are talking here about).
    
    0 1
Wednesday 22nd September 2010 20:21 GMT Anonymous Coward

Side issue - Food chain

Labour would-be government (Our Tony): No top-up fee charges -> Labour Government: Top-up fee charges -> Extra Student loans -> More business for J P Morgan Chase -> Ha Ha! Who is J P Morgan's High Profile recruit a decade later? Coincidence?

Trebles all round, as Private Eye would say.

0 0
1. Thursday 23rd September 2010 15:20 GMT Anonymous Coward
  
  re: Side issue - Food chain
  
  That's strange, generally amanfrommars does not post anonymously...
  
  WTF?
  
  0 0
Thursday 23rd September 2010 22:08 GMT Anonymous Coward

CHASE: "a third party supplier's database software corrupted systems information"

Here's where the details are important . . .

It's always the database's fault - a common misconception.

0 0

This topic is closed for new posts.

Other stories you might like

Palantir and Oracle buddy up on cloud infrastructure

But do all Foundry workloads move to OCI? It's up to the customer, spy-tech firm says

Storage 9 Apr 2024 | 4

Mega city council's Oracle ERP system still not legally safe, compliant... 2 years after rollout

Fusion software misses another deadline, one external auditors for Birmingham City Council described as 'absolutely crucial'

Databases 17 Apr 2024 | 35

Catch Java 22, available from Oracle for a limited time

Latest release of coffee-themed programming language aspires to simplicity with a dozen new features

Software 19 Mar 2024 | 47

Oracle adds GenAI to Fusion with a whopping 50 use cases

But is there one that can sort out failing ERP projects? Well Larry, is there?

AI + ML 14 Mar 2024 | 2

Oracle AI buzz means Larry Ellison's worth $15B more today

And here you were saying tech hadn't yet made a difference to someone special

Offbeat 13 Mar 2024 | 12

Oracle investors hear the magic word 'Nvidia' and boom! Buy, buy, buy

Forget the piffle about real world results, let's look at the potential of wundertech

Databases 12 Mar 2024 |

How to Netflix Oracle’s blockbuster audit model

Opinion Terms and conditions apply. Lawyers need not

Databases 11 Mar 2024 | 32

'We had to educate Oracle about our contract,' CIO says after Big Red audit

Estimates put audits at $3B revenue for Ellison's company, so go at your own pace, experts recommend

Databases 6 Mar 2024 | 58

Google advances with vector search in MySQL, leapfrogging Oracle in LLM support

Meanwhile, only 22% of orgs are looking at GenAI strategy for databases

Databases 4 Mar 2024 | 2

City council megaproject to spend millions for manual work Oracle system was meant to do

Train-wreck public sector project was forecast to save 'bankrupt' council money

Public Sector 28 Feb 2024 | 71

Oracle Cerner system implementation risks future patient deaths, coroner warns

Doctors voiced concern over lack of Red-Amber-Green rating system, says report

Databases 29 Feb 2024 | 12

Oracle faces continued legal battle over alleged NetSuite software misrepresentations

Judge allows fraud case to continue after customer resubmits complaint

Databases 21 Feb 2024 | 8

The Register Biting the hand that feeds IT

About Us

Our Websites

Your Privacy

Situation Publishing

Copyright. All rights reserved © 1998–2024