The Register® — Biting the hand that feeds IT

Feeds

section icon Servers

Sysadmins: Your best tale of woe wins a PRIZE

I'd like to share with you the plight of a good friend of mine. He's a systems administrator for a mid-sized American accounting firm. His story isn't particularly remarkable, but all the more important because of it. Bob has a bachelor's in computer science, and a lot more experience than I'd normally expect someone clutching …

This topic is closed for new posts.

I once was caught by a network outage whose ultimate cause was:

An X509 certificate, in a Java package, had expired. The Java package was part of a UPS monitoring software (and nothing you'd ever normally play with). The UPS monitoring software went absolutely crazy and just hung the entire machine (as in, your mouse couldn't move across the screen in under a minute). The machines affected were pretty important and all of them had the problem. Because they still responded to some queries, pings, etc. (just about) nothing bothered to failover anyway but even the failovers had failed. And you couldn't log into the damn things or use any of them directly.

And obviously, the entire network slowed to an absolute crawl affecting the entire place it was in. Sure, we could force failover to a clean machine but those servers had apps running and we needed those apps (data - pish, we had that, but if you can't submit the payroll without that heavily-locked-down smartcard tied only to those machines you were stuffed).

Sounds like nothing but go through your standard procedures - everything responds, everything responds to ping, just some things are very slow. Can't narrow down to one server. Try to log in to them remotely and it timeouts. Go find the physical machines. Try to log into them. LITERALLY could not get the mouse to move. Suspect hung servers. Wait suitable length of time. Force a reboot. Server boots up at normal speed, does disk check, nothing wrong, gets to login screen, hangs like hell.

Okay. Weird. Safe mode. Same thing. Okay. Disconnect from network. Same thing. Restore from latest backup (which takes HOURS). Same thing. Okay, restore from Known Good backup (more hours!). Same thing. Confirm the machine is fully operational with install from clean Windows CD (but too much software on there to just replace every server in the space of a day or so from nothing). Perfectly working, no problems at all. Okay, restore from ancient backup taken just after the Stone Age and which DEFINITELY, DEFINITELY, DEFINITELY works. Same thing. WTH? Try other machines. Same thing.

Restore a backup to an identical machine acting as tepid-spare. Same thing. Bugger. Complete reinstall looking like only way to get it back and working. Clean Windows install on clean, isolated machine. Start reinstalling software from disks (known-good versions of everything that worked for YEARS). The reboot AFTER the UPS software is installed starts displaying the same symptoms. Damn! (But at least we can narrow down). After a couple of installation retries (and rebooting after literally every step), narrow it down to the UPS software installation. Update available? No. Reported problems? Yes, with same symptoms. No solutions.

Fall back onto the last resort of IT: Phone random IT people and see if they've had the same thing. Couple of them experiencing it, one of them in the advanced stages of diagnosis. After much jiggery-pokery and literally just turning the software on under heavy debugging/monitoring (and LOTS of rebooting) and trying to identify cause, narrow it down to accessing a certain Java package and then, inside that, find an expired certificate. How do we know it's the cert? Can you fool it by putting the clocks back? Actually, yes. Before the expiry time, it works, after it doesn't. Can you replace the cert? Yes, but the software refuses to work even if it doesn't hang. Can you fake it? No. Can you remove it? No, the software won't work. Can you run your processes with an inaccurate clock? No. Can you run your server without UPS monitoring? Not really.

So it takes this long to get to knowing what's happened and what you can do about it. The affected servers can have their filesystem accessed by something else (Linux boot disk) and have the UPS software disabled, because there was literally no other way to get the software turned off once Windows had started (and actually, I moved the UPS monitoring to a Linux machine that then would issue shutdown notices to the Windows servers instead!).

Eventually, after about a week, the UPS manufacturer issue an update and pretend it never happened (and NOBODY installed that update without first installing clean or working out what the problem was). But you try diagnosing that bugger while having a whole company breathing down your neck and silence in the server rooms.

What saved the day? Linux, contacts, careful investigation, no blind reinstalling, and not relying on the UPS manufacturer to actually DO anything about it.

The last thing you expect when your whole network goes down (and only the servers, not the clients) is some UPS software that's been working FOREVER, on known-good versions, on known-good servers to suddenly stop working even on restoration of older, working backups (whose restoration had ALWAYS worked before that point when testing backups) and then hang the machine to the extent that you literally could not do ANYTHING on it at all. I was just about ready to kill Microsoft until we saw that no updates had taken place in the weeks leading up to it and the servers had been rebooted and backed up and test-restored since then.

Bronze badge
Megaphone

As a warning to the rest of us, which UPS manufacturer are we talking about?

Bronze badge
FAIL

RE: which UPS manufacturer

Yes, enquiring minds want to know!!!

Especially considering that we have 6 UPS systems at WROK PALCE, one for each of 5 floors, and one for the server room.

Most appropriate icon I can think of.

Anonymous Coward

Not a sys admin but...

I've done something which might just about qualify - it's more hardware related than admin related to be honest. Actually, it's stupidity related. And as I'm not a sysadmin, I'll remain anon and discount myself from the prizelist.

I was, err, playing kinda "bit of stick and a few rolls of paper cricket" in the server room with a colleague (yeah I know, I know...)

My colleague scored at least a 4 as the "ball" sailed over my head and went behind a cabinet. In my hurry to ensure it didn't become a 6, I rushed round the back of the cabinet. And not just any cabinet, oh no. This was the cabinet where every terminal in the entire organisation had an RS232 patch lead going from the back of the racks into a patchboard on the wall.

I tripped, and landed in amongst all the cables. Very few of them had covers, just three wires soldered onto the pins, and most of them were a pretty tight fit into both the router-thingy (I can't remember what it was called) and the patch-board. So it just ripped the wires straight from the pins. I must have destroyed about 80 or more cables.

We had to sit soldering for at least 4 hours solid.

To add to the fun, none of the cabling was labelled, so we had no clue which machine-connector went to which patch-connector, so we were plugging people into someone else's still-logged-on account right left and centre.

You might imagine the first thing we did was panic. Wrong. The first thing we did was get rid of the bit of stick and the rolls of paper. Then we panicked.

Anonymous Coward

Re: Not a sys admin but...

this is exactly why the helpdesk are not allowed in the server room, none of them can be trusted. Next time they ask why i shall forward them to this story...................

Silver badge
Headmaster

Re: Not a sys admin but...

The greater FAIL was whoever connected and configured it in the first place. With the right jumper on the D25 (or D9) connectors, unplugging the connector would have been treated the same as a modem hangup, and that should have terminated the logged-in session if the software config had security in mind.

Re: Not a sys admin but...

Sod that - why do you have 80 or more unlabelled cables just plugging into an apparently vital device?

Get the Dymo out, ffs!

Anonymous Coward

Re: Not a sys admin but...

this is exactly why the helpdesk are not allowed in the server room, none of them can be trusted. Next time they ask why i shall forward them to this story.

I'm not helpdesk, I'm a 26-and-counting-years-experienced dev. But, and far more importantly, I'm also a big kid, and I wouldn't have it any other way :)

The greater FAIL was whoever connected and configured it in the first place. With the right jumper on the D25 (or D9) connectors, unplugging the connector would have been treated the same as a modem hangup, and that should have terminated the logged-in session if the software config had security in mind.

This was pre-internet, so no IT threats from outside. As far as internal security goes, the payroll office got armed-robbed the morning of pay-day (twice, if memory serves), and the social club regularly got broken into and all the alcohol nicked. They plastic-injection moulded approximately 4 tons of a mixture of key-rings, cups, plates, and other trinkets for an open day, the entire lot of which got stolen from the factory floor the night before the open day. It was regularly seen for sale at market stalls for months afterwards. Security was something that was to be provided by big ugly burly men, the concept of information security would have sailed right over the MD's head like, well, a paper cricket ball. To be honest, the only threat from a keyboard would have been someome clocking you over the head with it to steal your lunch money.

Sod that - why do you have 80 or more unlabelled cables just plugging into an apparently vital device?

Get the Dymo out, ffs!

No need. The entire factory and office block is now taking up valuable space in someone's landfill site.

Bronze badge
Devil

Re: Not a sys admin but...

Hmm. I bet you some of those thefts were internal.

During my first week as an ICT Techie at a school I encountered a problem with MS Office on one of the machines. Reading round a few of the knowledge base articles and support tickets the answer was to re-allocate the MS Office package to the workstation, basically un-installing and re-installing the whole of Office. Being as I'm such an efficient person I thought "Heck, why not re-allocate the package to the entire domain therefore fixing any other machines that may have the issue but no-one has noticed yet". Fantastic idea, especially 2 days later when people could finally use their computers again as it brought the whole network to a crawl.

First week in a new job not knowing enough about what your doing is a bad combination.

On the other hand, Office worked pretty well after that.

---

I also recently had a colleague panic that one of the displays on a laptop had packed in, he went through the usual processed including contacting the manufacturer to check it's still in warranty before someone said "Have you tried the key combinations to increase the brightness" to which he replied "Of course I have, but strangely it's just started working fine" - hmm...

Cheers

I once crashed an entire school network (it was RM, so the classification of it being a computer, or a network, is dubious at best) by allocating an MSI package (I say that, but the RM ones are pretty butchered - embrace and extend again!) that happened to have a space in the filename. It was only a tiny MSI of something minor but I'd called it something like "packagename update" so I could distinguish it from the base package. I'd tested it on some machines and it was fine, all I did was rename it and deploy it for installation that night (you can't really deploy stuff during the day in a school).

Every single client crashed at reboot (which obviously didn't happen until the next morning) and just hung there before the login screen because their crappy software didn't know how to handle a space (despite the fact that their network management tool on the server did to allocate you to allocate it in the first place). And there was little you could do about it except login and clean-install or (if you know the magic words) wipe out their package databases after removing it from the server.

I made the RM guys fix it in the end, because I certainly wasn't going to be held responsible for their crappy software. And even though they told me the problem was "fixed" in the next hotfix, I never deployed another package with a space in the filename, just in case.

Anonymous Coward

It's a fairly safe assumption that spaces in a path will fuck something up at some stage, even with proper operating systems.

Alert

"The printer isn't working!"

"Did you switch it on?"

"Oh."

Anonymous Coward

Re: "The printer isn't working!"

I once got woken up by a phone call at 5am - someone had gone into work early and the building was "dangerous" because it was too dark. "Have you tried turning on the lights?" "Oh."

Alert

The day the M.D. acted as a doorman...

A long time ago (this was before XP) I was running maybe five minutes late one day (co-incidences are so unfair), and as I approached the building, from inside the building the Managing Director opened the door eager and worryingly pleased to let me in...

Expecting something about being late I approached. nervously, saying good morning, and he was strangely not sounding bothered about me being late. What he WAS bothered about was that no-one in the office was able to login to their desktops as he calmly explained the situation saying he hoped I would be able to help.

The other local techie could login to our (then single) server, not his machine, and had found rebooting the server didn't help... I poked around for five minutes and saw a DHCP configuration change from about 2/3 days before, reverted it, and as if by magic everyone could login!

An offsite techie had changed something which SHOULDN'T have caused an issue, but somehow did. He was confused because of the time delay, thinking any changes should've been immediate if there had been a problem. Every DHCP request was failing (never worked out why), so no machines were getting IPs for the network (having lost the rights to the IPs from the Monday), so couldn't communicate with the server to validate the login details.

Took me WEEKS before I stopped nervously looking for the M.D. ready and waiting to open the door for me each morning.

Re: The day the M.D. acted as a doorman...

I've had techies come in and start up their own DHCP server (presumably accidentally by sheer incompetence) by leaving things like the DHCP options turned on on, say, an ADSL router or similar, and then leave it running. Things can work fine for a while until either a) machines leases expire, b) the server detects the other DHCP and shut down its own service (thus making you reliant on that little ADSL router to do EVERYTHING or sometimes c) machines "closer" (network-wise) to the DHCP server get one response, the rest a proper response from the server and sometimes the two overlapping and issuing leases for each other's IP's to each other's clients.

Then I would get a phone call (I was doing the rounds for local schools and spent a day per week in each for many years) that nothing was working and had to work out what was happening when no documentation or even help from the original installers existed for the networks they were running (is the ADSL *supposed* to be giving out addresses? In what range? What ranges are the firewall etc. configured to use? etc.). But, of course, "nobody's touched the network in 28 days" and no problems are noticed until the DHCP leases die or the ADSL goes off or whatever.

The nicer switches let you block DHCP from all but one port, which is a lovely option until you're not told that and try to replace a server that's falling over. And the places that have those nicer switches tend not to have the problem in the first place (though it is funny when someone replaces that switch and you find that there's dozens of DHCP servers all other the place in WAP's etc. and nobody was previously affected because of that safety barrier that they've just removed).

Lights, camera, silence

First week running the bank branch on my own. Walked in as usual one morning to change the tapes, turned the lights on, and had the gut churning chunk sound of every machine in the server room losing power at once.

Several hours of mainframe IPL later and we're finally back up with me on the phone to our electrical guys to work out how the hell the room lights were wired into our UPS power.

Everyone assembled onsite the next day to prove that it wasn't possible. Took everything down just in case and brought them up in standby. Most satisfying junior admin moment ever when after 3-4 attempts by the sparky I walked up, hit all four switches at once and the room promptly crashed to dropped jaws from all inside.

Turned out to be a bit of dodgy wiring in an unlabelled sealed over switchbox in that section of wall - it was an old power cutoff wired under the floor to the master cutoff switch on the far side of the room. Since the wall was just a cheap partition, the vibration of the door closing over time had jolted a wire loose, and turning the lights on at once was just enough vibration to make the wire jump free and take everything down from inside the UPS supply.

Good lesson that even completely disconnected systems can be linked, if not in the fashion you would have expected.

Gold badge

Sorry for the off-topic

"Good lesson that even completely disconnected systems can be linked, if not in the fashion you would have expected."

I remember reading in the 1980s about someone who'd bought a Ugo. Communist block car, manufactured in Yugoslavia. It was converted to right-hand drive for the UK. As it turns out, simply by sawing the old pedals off the left hand side, and covering them in carpet.

Going down the motorway at 70. Passenger yawned, and stretched, and stretched, and BRAKES!!! Could have equal amounts of 'fun' with the accelerator at junctions...

Anonymous Coward

Re: I call bollocks

They might have left the pedals in, but they wouldn't have been hooked up.

Stop

Re: I call bollocks

Wouldn't be too sure, I remember a Watchdog piece about a modern European car (I believe a Renault but not sure) where they had converted it to right hand drive by leaving the break linkage on the left hand side and bolting a bar behind the carpets to the right.

The passenger could happily apply the brakes (the real brake pedal moved) by pushing hard on the carpet. I remember this only being a problem with the brake pedal, so helpful when the other half thinks braking is a silly idea when a tree will suffice.

Re: I call bollocks

Actually, being a total cheapskate I bought my (ex)wife a Yugo a few years ago. I discovered that you *could* brake and accelerate from the passenger seat. So, your call of bollocks on this occasion is wrong. Probably why she is an Ex :))

Bronze badge

Re: I call bollocks

Good cars for driving instructors then?

Bronze badge
Meh

The I Something Department

Rescued a member of staff from a portakabin office after a door access system (Which was dumped on us after the person who brought it and someone asked how do we use this) was blaimed for keeping a door shut.

After going to check and finding the physical lock was to blame we then had to help out the trapped member of staff through a window (and then raise lots of H&S forms). Still have no idea what this has to do with the IT (Sorry don't deal with technology anymore I S(omething) department) but makes some days interesting.

Bronze badge
Devil

Not really related an IT cockup...

In my first IT job, I ended up being promoted to the network and sys admin above the trainee techs that I used to look after. There was only 3 of us in the office always bombarded by teaching staff and kids wanting their laptops fixed day-in-day-out.

Anyway; one day, I called my boss about something that a member of staff had done to their laptop quite amazingly by the excuse they gave me. During this conversation where I was explaining things quite angryily, one of my junior techs and the stage technician were messing around with a can of compressed foam (that we used to clean up the old bits of kit and resend them out). Whilst throwing this can around, I will still on the phone and my boss agreed with my concerns. Suddenly, I heard this mini-explosion behind me. Only did the stage tech throw the compressed foam onto the corner of a metal file cabinet and I turn around whilst mid-conversation to find the room fully covered in foam (including my back!). I had to keep the conversation with my boss going which was luckily ending whilst the junior tech and stage tech saw me turn round to see what the hell went on. Let's just say that the evidence was still apparent on the posters we'd put up after we manically tried to clean up the office pronto before my boss walked in who only was based next door to us.

We did tell him in the end, but that was the last time my junior tech ever got the near the compressed cleaning foam!

Anonymous Coward

Our VPN box...

[user@server ~]$ date;date;date;date;date;

Thu Oct 21 10:12:23 BST 2010

Thu Oct 21 10:12:18 BST 2010

Thu Oct 21 10:12:23 BST 2010

Thu Oct 21 10:12:18 BST 2010

Thu Oct 21 10:12:23 BST 2010

Disabled anything remotely to do with ntp. Rebooted. Didn't make a difference.

[user@server ~]$ ntpdate pool.ntp.org

All back to normal. For five minutes. A day later it was a couple of minutes out again.

Called the hosting company's support line. Three days of head scratching later, they say, "We'll swap the drives into another box." One day after that, they say, "It's all fixed". One more day and it's buggered again.

VPN box became ftp box, ftp box became VPN box. Almost two years later:

[user@server ~]$ date;date;date;date;date;

Thu Jun 28 13:43:35 BST 2012

Thu Jun 28 13:50:09 BST 2012

Thu Jun 28 13:43:35 BST 2012

Thu Jun 28 13:50:09 BST 2012

Thu Jun 28 13:43:35 BST 2012

Re: Our VPN box...

That smells like a badly configured Virtual Server mate. I've had VM's doing that in the past.

Facepalm

There's a big data centre in Wapping, with about 100 racks per floor. As it should be, this is access controlled to prevent the cleaning lady from wandering in with her polish. However, no one thought to tell the new security guard that you need to swipe your card to get out again. He just saw the big (red) button and thought that was the unlock button. He'd never seen an emergency power shutoff switch before.

1000(ish) servers down in less than 2 seconds.

Silver badge

Smoking boots and server outages

I was once paged by a major client in order to deal with an 'outage' of nearly a hundred mid-to-high end UNIX servers at around 4am. A quick wander around revealed that every server had suffered a power outage that lasted nearly an hour, and nobody would tell me what the root cause was, and looked really shifty when queried about it. I started running file system checks and getting everything up and running again, and tried to find out what the story was.

Eventually I discovered why everything had dropped and why the Ops Bridge crew were looking guilty and keeping quiet: A UPS engineer had been working on the system overnight. Due to the vendor being tight-wads and cutting contractor rates to the bone, said sparky did not have a mate with him and was working alone on the systems, and the vendor did not arrange a full outage because the systems were too critical to business. The guy fried himself and was killed and cooked over a period of time. If someone had been there with him, they could have killed the power in time and he'd have lived (he was still alive when the first medical guys arrived, but they wouldn't touch him as the power was still on). There was some delay in power-down due to concerns that HA systems would be affected. Which shows you the value placed on human life by some companies.

So I fsck'ed a bunch of machines in the wake of this mess.

The kicker: The department in question had a great safety record and all of the vendor's staff got free stuff while the record was maintained. In order not to jeopardise the clean health and safety sheet and cause awkward questions, the death officially went down as a suicide.

Because obviously you don't work on a UPS alone unless you are suicidal...

Anonymous Coward

Re: Smoking boots and server outages

and this is still a secret and no-one did/is doing time for this?

You shouldn't be telling us, you should be telling the peelers. >:(

Silver badge

Re: Smoking boots and server outages

It wasn't in the UK.

Life is cheap when it's outsourced and you're overseas.

This post has been deleted by its author

This post has been deleted by its author

Silver badge

Another classic...

Because I worked night-shift looking after a mainframe and a UNIX system, our glorious leader decided that our section would be ideal for offering overnight PC helldesk and tech support to every muppet on the airfield who had a PC. So my mid-shift naps were interrupted by a series of idiot-calls from the Air Force's finest on a routine basis.

Picture a winter's eve. The runway is closed to to bad weather. The snow is coming down and settling heavily. It is now 4am.

The phone rings.

His monitor doesn't work. The light on the front isn't on.

"Is it plugged in and switched on?"

*suspicious lack of any pause to check this before replying*

"Yes, of course."

I offer to wander up to his office to take a look, and he now reveals that he is sat in a building over the other side of the runway. It's either a ten minute walk in the snow, or a nine-minute job of defrosting a vehicle and two minutes of driving over there.

Great.

"Are you really sure that it's plugged in and switched on"

"YES!"

"Ok then. I'll be over in fifteen minutes. I'm going to bring a screwdriver and a hammer. If it's broken, I will use the screwdriver to fix it. If it's not switched on at the wall, I will use the hammer to hit you. Now: Is... it... switched... on... and... plugged... in?"

*actual sounds of rummaging*

"Oh... it's working all of a sudden..."

*hangs up*

Silver badge

I used to hate handing out root passwords or any kind of admin rights to users, regardless of the number of letters from their boss telling me that the said user 'needed' it. I cannot count the number of times I've had to rebuild stuff that users screwed up.

The worse example was when I gave someone rights to install software on his machine. The next day, I was called to his office. He'd tried to reinstall Windows for some reason that was never clear (I suspect he'd deleted the files labelled 'Windows' because they didn't appear to do anything useful). The problem was that he couldn't fit the floppies into the drive any more.

The 'any more' should have been my warning.

A visit to the office showed me that the machine was asking for the user to 'insert disk 4'. The user had successfully inserted disks one through three when requested. Concurrently. Without ejecting prior ones. They were all jammed in the drive.

This was a guy who only 24 hours had received a written testimonial from his boss tersely telling me that he 'knew what he was doing with computers'. Clearly the guy DID know what he was doing on some kind of zen level of pure stupidity which I can never understand... because he'd actually got as far as disk 4 in the procedure successfully!

A few

Story 1: I work at a small regional bank in the US.

As is often the case in a smaller IT shop like mine things seem to either be completely dead quiet or insanely busy, often with no rhyme or reason as to why since the problems that come in are often not related. A PC goes down in one office and not a few minutes later someone else has a problem in another part of the building. After several hours of people calling in with frantic requests for help - since their problem is obviously the most important thing - it can get to even the most seasoned of sysadmins resulting in a increasingly short temper.

So near the end of a particularly rough day, I get a call from a teller who insists that they can not print out a transaction receipt for a customer to their little serial printer. I run through what I think is a pretty good checklist. Is the cable connected to the device? Is the print session to the AS400 open properly? Does it have power? I check the print device on the AS400 host and its online too. This of course all takes time to do but the employee is having none of that and is becoming increasingly agitated because of course if we don't fix it in 20 seconds then we're incompetent.

"Fine I'll be out in a minute to replace it," thinking I'll just swap the stupid thing and worry about it in the morning.

Arrive out there, and the teller and the teller's boss are fuming about this because they had to hand write the receipt, oh-heaven-forbid. I open the top of the little Epson receipt printer to swap out the paper roll and see the problem. "There's no paper on the roll." I tilt the printer so they can see, shut it, and walk out without saying another word.

Story 2:

"My keyboard won't work anymore" the person in the mail room tells me, "So I can't punch in to the time clock." There's few things that upset a user more than not being able to punch in immediately when they get to work. The idea of staying an extra minute or two at the end of the day to make up for it is often met with the shrieks and cries of the damned. Fortunately he called early in the morning so I'm in a relatively good mood and his pained cries for help are easily brushed aside.

"Okay, probably just broke down, we have plenty of spares. Maybe you could punch in from someone else's desk and I'll be right down." Go to the storage closet and grab a spare, hoping the rest of the day will be this easy.

Get there, spare keyboard in hand and disconnect his old one. As I go to throw it in the trash can next to his desk a stream of water comes out of the keys, resulting in a puddle of water on the carpet. "Um, did you spill something on this keyboard?"

"Well, yeah, I spilled some water this morning. Does that break them?"

"Well, computers run on electricity and these are electronics, so yeah."

"oh.."

Users..

Story 3:

Problem with the AS400 host system of some nature and users are in a panic. They can't process loans, can't open accounts, can't do anything. After the first few calls from users we're all getting a little bit nervous I send out an email to all users saying the AS400 is having a problem and we're working on it. The stress starts to increase when a few more issues crop up on other systems since those systems can't talk to the AS400 now either. So the stress factor starts to multiply.

So we're making calls left and right to various vendors and support personnel for assistance. While trying to do these productive things people in another department start bombarding my email with messages about how the AS400 is down. I email them back tell them we're working on it. But the emails keep coming.

After the problem subsides I go up to tell them personally and directly that everything's working now. They are all laughing and said they know they were just pulling a little joke on me.

So the next morning before they came in I went up there with some tools and pulled all but one wheel off their rolling chairs. This resulted in security being involved in trying to figure out what happened and multiple emails going around asking for information. I let those emails circulate for a while and then told security what occurred and where to find the wheels. I left them behind the desks trash can of each user. I wasn't written up and that group didn't call the help desk for about two weeks after that.

In my first job technically before I became a sysadmin, I worked at a top UK university. One day a female computer science professor came in holding a rather smelly and damp-looking Apple keyboard. 'It's broken' she said. 'How' I asked. 'Well, I threw up on it this morning and I figured the acid might damage it so I hastily tried to clean it off' she replied. Okay... 'What did you use to clean it with?' I asked thinking she'd dunked it in water or something marginally less stupid. 'Cooking oil', she replied. 'Hmmm, maybe we need to get you a new one then...'.

Don't you love academics? I had a few people wash laptop keyboards under a tap as well, with the laptop still attached of course.

Don't you love academics?

No.

competition is only for US citizens

which is a shame, as I can sympathise with your mate Bob

Anonymous Coward

No RAID

This one almost sounds like an urban legend. I wouldn’t believe it myself, had I not run into the handy work of the ‘IT Guy’ in question myself. It happened before I started working where I am now, so isn’t my own story.

When taking over a large amount of outsourced IT work from another company, it was discovered that some of the servers our company were now looking after happened to be running on single disc drives. When the previous ‘IT guy’ was questioned about this lack of redundancy he absolutely insisted he’d done his homework and fitted mirrored discs. He even went to the trouble of opening up one of the systems in question and pointing to the polished (as opposed to brushed or painted) finish of one of these ‘mirrored’ drives......

"It happened before I started working where I am now"

is another way of saying that you swallowed somebody's no-shit story all the way up to the sinker. Hey, we were all young once.

Pint

Sales Guys

So I've always had a mish mash of roles but in one role where I was doing Server and desktop support we had the sales guys running the show, selling projects we couldn't possibly deliver in the time frames they promised without days of unpaid over time and the blame on US if the sales failed despite our constant protests.

Karma however, paid a visit one day.

One of the most troublesome Sales managers pops over one day, the same guy who pocketed a 6 digit bonus for selling something (read, spent a fortune on the company card at titty bars and the pub) which us IT guys worked 3 months of stress filled overtime to deliver without so much as a cheers guys....came round at 10am.

He DEMANDED a brand new laptop, built to X,Y,Z Spec, with PowerPoint installed and thoroughly tested with a VITAL sales presentation he needs to present at 2PM today, leaving the office at noon.

We got it all setup, tested, spare batteries, spare power supplies etc etc.

Handed it over but he insisted on powering up to test, fine, he does so and is happy.

2PM I get a call, Answer to instantly get a 100 decibel stream of abuse down the line from Sales Nubbin that the laptop isn't working.....

Apparently he's arrived on site, has me on speaker phone in front of the clients and wants me to explain to them why he can't show them the demo...

OK whats the laptop doing.

Nothing, it just beeps.

Beeps? ok hold the phone to the laptop so I can hear it...

That sounds like power...I'm guessing you didn't power it down when you put it in the car, is it plugged in?

OF COURSE IT'S **** plugged in!! do you think i'm a complete idiot!!

And where it's plugged in, is that switched on..

OF COUR.........*hangs up*

Well that made my day.

Bronze badge
Facepalm

Re: Sales Guys

I've always enjoyed laying a little karma down on the salesdroids.

Major PC maker I worked for owned a crap video card maker -- they couldn't write video software to save their lives, and it was clearly unsuitable for corporate use (or any other for that matter). Naturally, businesses stayed away from it in droves. Naturally, since the margins on crap video cards is so good, they'd offer the corporate salesdroids spiffs to move them. (I remember one spiff was a speedboat -- the margins were good indeed) In support, we'd get hourly ass-chewings over these cards, and in desperate cases, the other techs would ship out video cards made by competent manufacturers to replace them. Then they'd endure the ass-chewings from management about containing costs.

Not me. I'd carefully explain to the user about spiff's, crap video, what makes good video, track down their salesdroid, and drop the hot customer on the 'droid's line. It didn't take long for the salesdroids to decide that no spiff was worth that kind of epic ass-chewing.

It was good for another teaching moment, too. One tech I talked to was going to order his own PC, with one of these cards. I asked him: "WHY, for the love of G0d, WHY?!?!" He said "Because I know every way possible to *fix* these cards. I asked him why he didn't use the other brand of card. He said "Because I never get calls on them." I just sat there smirking at him until he figured it out.

Thought for a second you said 'spliffs'

But that's not really a salesman's drug, is it? More cocaine or methamphetamine, really.

Many years ago, I worked in 3rd Line Support for a reseller. We had internal IT for our internal systems, but as usual, when there's a problem, "you're a techie - fix it" is usually the way forward.

Arrived at work before 9 o'clock one Monday morning to be greeted by the Sales Director to inform me that no-one could access their email. Popped into the server oom, and found that an entire rack was down. I quickly disvovered a faulty power strip, and as there was already a spare the other side, began moving all plus across to it.

I had to connect the new power strip under the floor of the server room, so called one of the junior members to help with the lifting of tiles, and to make sure no-one fell down the hole when the floor was up. It all went well, and a short time later the rack was booting up nicely, so set off to leave the server room with the junior member to inform everyone that all would be well in the next few minutes.

Now, to leave the server room you had to hit the door release button next to the door. Unbeknown to the junior guy, next to it - under a cover that had to be lifted - was also an emergency shutdown panic button that would kill power to the whole room. Yep, you guessed it - he lifted the cover and hit what he thought was the door release. 20 server racks powering off left the room feeling ever so silent!

Cue spending the next three hours getting all of the systems up from scratch. :(

Silver badge
Facepalm

You made me jump!

I once worked in a freezing cold server room, manning a few terminals with a co-worker. It was all UPS protected of course, but the lighting wasn't, and there were fire control systems. One such file control system was some form of fire-door in the air conditioning system. It was clearly large, heavy, and metal. It was also clearly located right above our desks somewhere.

As a result, every time there was any form of power outage and UPS kicked in, there was this enormous *BANG* above our heads, and we were plunged into darkness for a few seconds.

My oppo and I were doing some fairly critical work. I can't remember what exactly. But I remember the darkness... I remember the crash of the fire-door... but the most terrible sound of all was - in the wake of this cacophony - the quiet "oops" of my co-worker, as he'd jumped at the noise and spasmed on the keyboard.

He'd only broken the whole thing with an inadvertent combination of key-presses. It took us the rest of the shift to fix.

Anonymous Coward

Re: You made me jump!

A very quiet 'oops' is when you know things have gone properly tits up.

Re: You made me jump!

Either that or a loud OH SHIT followed by frantic keyboard noises.

I like the idea of a steel fire door as a "file control system" -- apply it to the problem fileshare users often enough, and those gigabytes of MP3s just vanish like magic...

General hospital

I once had to spend a weekend installing servers at a hospital. The on-site tech told me the story of their nice new shiny server room.

They got all the servers in racks, rolled them in, and went to turn them on. Hmm, something not quite right here - no-one's installed the power yet!

OK, roll them all out, install power. Roll them in.

Hmm, no network! Bugger. Roll them out, install network, back in.

Hmm, it's getting rather warm in here. Ah, maybe we need air-conditioning?

A few weeks later, air conditioning installed. Now we're getting somewhere.

Except the air-con wasn't installed properly and flooded the server room several feet deep in water.

Roll the servers out, dry them off, install a drain in the floor just in case, roll them back in. Yes, the dried-off servers still worked. They're braver than I am re-installing flood-damaged servers but that's maybe that's the way they do it in the NHS?

Oh the joys of good server room design!

Re: General hospital

Well, i hope someone had this story on hand when my boss insisted on a drip pan under the pipe joint running over our ERP rack in just built server room. Everyone looked at him like he's crazy.

Facepalm

Oh well....

I work in a school as the "Network Support Officer" That means I am the SOLE IT bod in the secondary school with 16 servers and 300+ workstations, I maintain AD, Hyper-V, Exchange, IIs, VLE etc etc etc.

I am also the first responder....

I was once called to a classroom by a pupil who said "Miss can't use her mouse!!" I asked for more specifics but none were given. I gave the child a brand new USB mouse and off they ran. 10 minutes later while I was cleaning out a laptop that had clearly been dumped in the sandpit, the child came running back. "That mouse does not work either" so I plugged the mouse into my test rig, and it worked!! I then sent the child back to the classroom with instruction as to WHERE the mouse needs to be inserted.

10 Mins later, the teacher calls me

Teacher: "that mouse is broken",

Techie: "Are you using the laptop?",

Teacher: "Yes"

Techie: "have you tried the touch pad?"

Teacher: "yes"

Techie: "what happens?"

Teacher: "Well the mouse kinda moves then snaps back to the centre of the screen".

Techie: "What happens with the mouse plugged in?"

Teacher: "Exactly the same!"

Techie: "Ok I am on my way"

So I am now proceeding to the classroom to determine why the cursor would snap back to the centre of the screen. I arrive at the classroom with the teacher and whole class watching me with anticipation as I enter the room.

I take one look at the classroom and immediately identify the problem, but as I have been dragged away from my desk i feel I do not need to share my findings just yet.

I proceed to the desk, moving past the SMART board, and the object that was leaning against the SMART board. I had to move the object, while not making it obvious that is what I was doing!!

I then proceeded to demonstrate that not only did the mouse function correctly, but the touch pad worked fine as well. Having done so, I then proceeded to allow the teacher to try it, and they were indeed happy. I therefore left the room as I had found it, complete with object leaning against the SMART board.

5 minutes later, a slightly exhausted child returns to my office declaring that the mouse was broken again!!

So I repeated my actions, moving the object that was leaning against the SMART board, and showing that the mouse was functioning correctly. The class now seemed to think I had magical powers and that I was some kind of messiah towards computers. Surely I could not be fixing things by simply laying my hands upon them? I kept my smile to myself as I replaced the object that was leaning against the SMART board.

5 Minutes later the child appeared at my office once more. Feeling sorry for the child, I decided to pass on my mystical powers. I passed on the ancient secret of the SMART board.

The child returned to the classroom and moved the object that was leaning against the SMART board. He moved it all the way to the back of the room, held it above his head and said to the teacher "try now miss" and it worked. The whole class clapped the young boy.

Later that night, 4 bottles of Peroni arrived at my office with a note from the embarrassed teacher, "Sorry, and thanks for not making me look an idiot"

I drank the Peroni without any guilty feeling. That teacher will now always check to see if anything is leaning against her "touch sensitive SMART board" before she blames her mouse or lack of ICT support.

This is tale 1 of too many!!

Paul

This topic is closed for new posts.