Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Technology

The New IT Crisis 354

Matey-O writes "You've succeeded in delivering 5 9's, your server farm is a well oiled machine, the helpdesk lines lie dormant. No? Well then how do you get credit for the work you do, when all that's noticed is the downtime? When the IT budget has to be justified, and you're overworked, undermanned, and you have to apply three patches to 100 servers before Close of Business, what has to change in IT before we melt down? Marc Andreessen has an interesting article on what has to happen to IT next."
This discussion has been archived. No new comments can be posted.

The New IT Crisis

Comments Filter:
  • by SteweyGriffin ( 634046 ) on Saturday December 14, 2002 @01:16PM (#4887441)
    Well then how do you get credit for the work you do, when all that's noticed is the downtime?

    I feel that if I work hard (and smart enough), then I deserve free time every once in awhile. After all, I earned it.

    But, managers don't understand this. So, I relax by reading The Onion or Freshmeat at work, but always make sure my hand can quickly hit ALT+TAB to get back into my work window (usually Emacs).
    • My old approach was to say to the boss "What ya gonna do? Fire me? I'm too low paid and I claim slacktime NOW." Wierdly, it usually worked and I got my slack off time. Turns out he needed me more than I needed him.
      • "My old approach was to say to the boss "What ya gonna do? Fire me? I'm too low paid and I claim slacktime NOW." Wierdly, it usually worked and I got my slack off time. Turns out he needed me more than I needed him."

        I usually just try to always look busy. I have such a reputation for hard work, that in the (rare) slack times I can usually screw off and get away with it.

        But you get to that point by ACTUALLY working hard when there is work to do.

        Thing is, when you are a network admin for an enterprise, there is ALWAYS work that you can be doing.
        • by Dysan2k ( 126022 ) on Saturday December 14, 2002 @03:53PM (#4888169) Homepage
          God.. I pity you peeps..

          Seriously, any time I go to work for a new company, the first 2 months are spent automating everything across the board. I'll generally get no sleep quite a few nights writing software, and when I'm done, I just tell management that because of errors in the past, all work orders will be processed through this new interface. I give 'em an internal web addy and then go play Quake/EQ/etc. for the rest of my time there (which has never been less than 1 1/2 years which I get vested options from). The websites handle all the dummy checking, logs all processes to another system to check periodically to make sure all works well, and performs whatever request they want.

          I order spares for about every piece of equipment in the building including spare switches and 1-2 spare servers for the occassional *frying* motherboard. If the servers are set up in non-redundant fashion, I make sure load balancers (or happy cisco 6500) are ordered, every server has a backup, all backups are automated and working properly, and put in the DR (Disaster Recovery) report JUST incase they want to go that route.

          Frankly, I have MAYBE 1 hour of downtime a year, and that's usually attributed to my tripping over a cable that some numb-nutz (who's gonna get chewed out for an hour) layed outside of the wire maintenance tray. Only reason I move jobs is because I just get WAY too bored. Admining is easy for those who aren't inept.
          • by jobugeek ( 466084 ) on Saturday December 14, 2002 @06:00PM (#4888672) Homepage
            Aparently you have done a pretty good job, finding companies with fat budgets and no politics.

            Seriously, I don't get the money to buy redundent Cisco routers. I don't get to buy extra switches and motherboards.

            And the little internal web addy, that has been in place for 2 years? Some tool of a manager who is having a hissy fit, just walked into my office and asked why I haven't fixed his problem that he never told anyone about.

            Please......

            Whenever you are dealing with people who have almost as many deadlines as you do are involved. Things are never that easy.

            Now where I will agree with you are scripts. Learn them. Use them. Love them. But do you have an IDS system? Who verfiys those alerts? Who checks your server logs for descrepencies? Admining is rarely easy.

          • I'm busy... (Score:4, Insightful)

            by phorm ( 591458 ) on Saturday December 14, 2002 @09:02PM (#4889508) Journal
            Patching servers, fixing machines, crawling under desks. If you want a job with a fair bit of action, work at a school. If there's anyone who can find a way to screw something up, or do something unexpected, it's a 15-yr-old with a keyboard and mouse. That, or it's my 57-yr-old grandparents that somehow magically manage to screw things up that should be unscrewupable.

            Combine that with archaic computers, underfunding, etc etc... and it's an exciting job.
            • Re:I'm busy... (Score:3, Insightful)

              by buffy ( 8100 )
              Allow me to add to that...work at a university. The "abilities" of tenured professors and the sort are fairly impressive.

              Of course, by abilities, I'm referring to them in their destructive and or disruptive potential. :(
    • by LostCluster ( 625375 ) on Saturday December 14, 2002 @01:57PM (#4887660)
      If you're doing an IT system properly, whoever's assigned to supporting it should have some of those "Maytag Repairman" moments where they have nothing better to do than surf the web just to stay alert. This isn't because he's goofing off, it's just that he's done everything on the preventative maintenence checklist, and during his time alloted to responding to crisis, there is no crisis to deal with. The system is working, nobody is complaining of a downtime or slowdown.

      What you're really doing is not wasting time, you're sitting in an "on call" position waiting for your phone to ring with the next big crisis. If that call never comes, then you must have been doing something right when you were working hard.
    • Mind you virtually no one is fooled by the alt-tab: When everytime you walk near somebody their screen flickers as they swap windows, or they fervently hit the keys whenever someone nears, you know that they are slacking, but to many people the thought process will be that they're looking at porn or playing a game.

      If you aren't willing to do it as "research" (for instance as funny as it sounds Slashdot can be a work-related website for many tech firms. It is "putting your ear to the ground" in a sense), then I recommend what another recommended which is that you browse in a manner that is inconspicuous and requires no screen swapping: Lynx, or grahics-less.
    • by The Spoonman ( 634311 ) on Saturday December 14, 2002 @07:28PM (#4889110) Homepage
      I know exactly what he's writing about. I was just fired from my job of two and a half years. The reason? Check my sig. That's right, we started a personal website in our own time, with our own reasources, and because the CEO didn't like it, we were out. "Decreasing company morale" was the reason we were given. They fired me, who was the Sysadmin, the webmaster, and the only PC technician.

      Forget the fact that for the last two and a half years, I haven't had a real vacation, because I got a call EVERY FREAKIN' DAY about some minor issue.

      Forget the fact that I worked a minimum of 2-3 hours every night on company stuff, 'cause I wasn't allowed to make ANY changes during the day. (The night before I was fired, I spent 3 hours writing a script to fix a problem that was affecting only 1% of the users. No real problem, but I didn't want them to have to deal with ANY issues.)

      Forget all of that, just get the fuck out. I take some solace in the fact that two days after they fired us they sent out a global e-mail of "Please bear with us, it'll take up to five days for your calls to be answered." And, e-mail was down for almost a week because no one but us knew how it was setup!

      IT needs to get the respect it deserves. In this era of decreasing budgets, the only way companies will be able to make any money is to increase efficiency, and that means automation.

  • by PissingInTheWind ( 573929 ) on Saturday December 14, 2002 @01:20PM (#4887458)
    capitalize on other's work!

    It's the easiest way to success.

    Details here [chrispy.net].

    Quote: it's not as if he needs to sit down in front of his screen and busy himself with the notoriously arduous task of hacking out a few lines of software, which, astonishingly, is something he has never done in the short but spectacular history of Netscape Communications Corporation [...] Not a single line of computer code. Never.
  • by Gabrill ( 556503 ) on Saturday December 14, 2002 @01:21PM (#4887463)
    Blame it on management. If that fails, then blame it on the budget. If that fails, then blame it on HR. If, by that time, everything comes back to your fault, then you should have had enough time to land a new job.
  • by MoceanWorker ( 232487 ) on Saturday December 14, 2002 @01:22PM (#4887464) Homepage
    and you have to apply three patches to 100 servers before Close of Business

    Not flaming, but a fact.. Microsoft SMS will get that done for you with a couple of clicks.. literally..

    And we're expecting to receive a beta copy of SMS 2003 (incl. Active Directory, the big feature).. so i'm kind of looking forward to that..

    I have to say.. even though we run 2000 on our servers.. SMS is a dam good product and a timesaver!
    • by mcrbids ( 148650 ) on Saturday December 14, 2002 @01:52PM (#4887639) Journal
      Another solution? RHN!

      Red Hat Network allows you to do the same - a couple of clicks, and away you go!

      The advantage is that RHN is here now, and works very, very well. I've never seen a Red Hat Network update break anything. Ever.

      And they are quick about getting updates ready!
    • by LostCluster ( 625375 ) on Saturday December 14, 2002 @02:10PM (#4887715)
      Why don't we just go back to the mainframe way of doing things?

      When users had dumb terminals on their desk, they had the illision of a full power computer, but it was actually a small box with few moving parts that was linked to high-powered computers (or cluster of computers) who were actually doing the heavy lifting. Since all of the functional components were in the Computer Room, there was rarely a need for tech staff to touch the dumb terminals, and the tech people could work in their own distraction-free environment.

      What's more, failures could sometimes be abstracted away from users. Hard drive failures happen, that's a fact of the technology. However, if a HD fails on a user's desk, it means that user has lost the use of their computer until it's fixed. If an HD fails in the datacenter, there's usually a backup of the data which can be put into play immediately by mounting the backup on a good drive that's already spinning. To the users, the disk crash can be practically invisible.

      There's already tech technologies such as X-Windows and Windows Terminal Service with which to create GUIs on a dumb terminals. Why does the common secretary need a full-powered PC on her desk anyway?
      • Re: (Score:3, Interesting)

        Comment removed based on user account deletion
      • "When users had dumb terminals on their desk, they had the illision of a full power computer, but it was actually a small box with few moving parts that was linked to high-powered computers (or cluster of computers) who were actually doing the heavy lifting. Since all of the functional components were in the Computer Room, there was rarely a need for tech staff to touch the dumb terminals, and the tech people could work in their own distraction-free environment"

        This exists in the PC world. Citrix. I have 2 clients of my company that I support who uses this.

        One uses Citrix running off two powerful new 2K servers to avoid replacing armies of Pentium I desktops.
      • by Catbeller ( 118204 ) on Saturday December 14, 2002 @05:57PM (#4888662) Homepage
        As a guy who got his first start programming on an IBM mainframe, I just want to break down in hysterical laughter.

        Early nineties: everything was to be decentralized. Get rid of that antique iron! Go Intel/Microsoft! Can the tech Luddites, bring in the Bright Yunguns, who will show us the way.

        Mind you, I was one of those calling for the company to modernize on a client-server model.

        But I've watched for, what, 20 years since this all started. I watched the expensive mainframes go into the landfill, and the new servers take their place.

        I've watched those servers get more expensive. I've seen the communications hardware become massively expensive. I've watched the new Data Priesthood come into being. I've watched companies being wiped out converting to, and maintaining, the new paradigm.

        Not that it would have been good to stay on the old iron. BUT...

        We've come full circle. The hounds are baying for simpler solutions, and most importantly, for the elimination of the new mainframers, the admins.

        I predicted it over ten years ago. The PC's will cluster into data centers, pseudo-dumb clients will spread onto everyone's desks and laps, the services will automate, and the present IT industry will convulse. A shakeout is coming. Time to get that teaching career started.

        As a sidebar: in my experience, far more money was spent on executive games and perks than was ever lost on IT spending. Cost accounting is applied where management wants application.
  • My 2 cents (Score:4, Insightful)

    by WetCat ( 558132 ) on Saturday December 14, 2002 @01:22PM (#4887466)
    Create presentations, monitoring
    systems with a lot of 3d and 2d diagrams
    and histograms.
    Bosses love that.
    • You only get two cents for that? Indeed, the situation for IT workers is getting worse...
    • MRTG combined with SNMP, essentially. I showed two of my bosses this and they flipped out at all the pretty lines. I guess that's all I need to do! ;)
    • Just be careful not to show him the process. Our IT dept showed me how he generates network presentations and documentation with an SNMP based program that polls all the devices on the network and automatically spits the results to a Visio doc. A complete network inventory across four cities to a CAD diagram in less than five minutes. The real IT crisis might be the terrific software tools being developed to make your job easier.
    • Re:My 2 cents (Score:3, Informative)

      by Mr_Person ( 162211 )
      Well then you should try Nagios [nagios.org] (Used to be Netsaint). Does all sorts of monitoring and 2D maps of systems (and their status) or a 3D VRML map! Doesn't do histograms though... In all seriousness though, it really is a neat piece of software and very handy.
  • by silvaran ( 214334 ) on Saturday December 14, 2002 @01:23PM (#4887474)
    Since when do telephone systems maintain themselves? Last I heard, my voice was still running along wires and branching about in switching stations. Unless I'm mistaken, they still require maintenance. Granted that the operating costs have been reduced (fewer operators = fewer wages), but you could take a similar stance on IT. What about self-help forums where you can search a knowledge base to find answers? These can replace a lot of man hours of technical support work. There's always going to be some kind of human element to whatever equation. We're never going to find empty power plants that can generate their own electricity indefinitely, because there's always going to have to be human intervention.

    Some businesses demand complex solutions, and I fail to see how these complex solutions are going to be met by turnkey solutions -- where a manager can go out, purchase a server, turn it on, and have it run his business for a year without any kind of customization whatsoever.
    • If you are, so am I. (Score:5, Interesting)

      by __aaahtg7394 ( 307602 ) on Saturday December 14, 2002 @01:38PM (#4887564)
      I think a better point to critique on his phone analogy is the implied point that the phone system isn't held together with "bailing wire" or "chewing gum." Nope, it's all pretty standardized, well-integrated equipment. Why is the phone service so much more "professional" than IT services?

      Because phone service is a relatively well-defined, consistent, limited problem domain. Internet servers, dynamic web sites, and local security are loosely-defined, constantly shifting, open-ended problem domains. They're very different, and you can't compare one to the other.

      However, for certain applications, there are well-defined standards, and well-defined practices. Still, for a lot of IT, it's a matter of custom engineering and architecture. For example, online content management: you can buy one of the management engines off-the-shelf, which will probably do most of what you need in a structured manner. For CRM, well, there's about a dozen of those. These packages are well-behaved in that it provides a well-defined interface, but that's not always an option (i know, i used to do data migration for small- to medium-sized businesses. at the low end, when you change systems, you'd better damned well know perl or some other text processing language to massage the data--that is, you need to be good with your bailing wire).

      In the future, this situation will hopefully be better with standardization (mostly using XML it seems, even though the actual encoding doesn't really matter.. we could have standardized years ago, but nobody saw any benefit then). Having done data migration in the past, i'm all for keeping things disparate and non-standard, but that's because the work pays well and is fun ;^)

      A better analogy might be a pool of corporate autos. Except that you don't have to interconnect the cars to get them to share load dynamically, or access content generated on one to form a report on the other, etc. A lot of IT is like trying to drop a big old hemi into a metro, or getting a suburban to go anywhere with just metros to provide power (two in front and one in back, it might go up a hill!).

      Overall, I was not impressed with this article, but I'm afraid it's going to carry more clout than it should. oh well.
      • by wfrp01 ( 82831 ) on Saturday December 14, 2002 @09:08PM (#4889537) Journal
        Posolutely. I couldn't agree more.

        I'd even take this a step further. Let's just say, for the sake of argument, that we reach Marc's promised land of "it just works" hands off IT administration. Then what?

        Marc is missing the point. IT is not a necessary evil, it's a competitive advantage. I've sometimes told people who spend too much time attributing their problems to their computer that they should stop using it then. If you can do your job better without the computer, then by all means do.

        I have yet to see anyone take me up on this challenge. They know, we all know, that despite their occasionally infuriating peccadillos, that computers make us more productive.

        You cannot remain competitive by sitting on your hands. Marc's world of "it's all better so we can rest now" is pure fantasy. The problem with this article is that it portrays IT as a problem, rather than a solution and a competitive advantage. Who wouldn't like to compete against the company that decides it's achieved all essential IT objectives? You want to get your ass kicked, then stop trying to figure out better ways of doing things.
    • Bummer. IME, that's what managers are looking for in these economic times. Of course, it doesn't exist. Just ask Sales, they'll be glad to tell you.
    • by mickwd ( 196449 ) on Saturday December 14, 2002 @02:20PM (#4887754)
      "Since when do telephone systems maintain themselves?"

      It's not just telephone systems.

      Think about almost any complex engineering system. Many are computer-controlled nowadays.

      But almost all of them, even those that aren't computer-controlled, are taken for granted by everyone. Transport, power generation and distribution, warehousing, running industrial plants, you name it.

      When was the last time you praised the people that keep all these services running ?

      But when they go wrong - the power goes off, there's no produce on the shelves to buy, roads are closed or flights are badly delayed - there's millions of people jumping up and down and demanding the situation be sought out straight away.

      When IT isn't a hobby or an "entertainment medium" it's nothing more than a tool to get a job done. I know how difficult it can be to work in IT, but I've got to look at some of the responses here, compare them to the jobs of people working in some other industries, and think: "what a bunch of whingers".

  • Meatless drivel (Score:5, Insightful)

    by iceT ( 68610 ) on Saturday December 14, 2002 @01:24PM (#4887476)
    Ok. Just becuase it Marc Andreessen doens't mean that it's news.
    This is an editorial, not an article.

    How about talking about Grid computing. Or Organic IT. Developing systems and monitoring capabilities that go beyond telling you things are down, or when they're too busy. They actually add capacity on the fly through virtualization, taking from inactive systems to cover for the active system.

    I think Marc once had a vision, but I'm not sure how strong of a visionary he is these days...
    • Re:Meatless drivel (Score:5, Informative)

      by sql*kitten ( 1359 ) on Saturday December 14, 2002 @01:40PM (#4887573)
      Ok. Just becuase it Marc Andreessen doens't mean that it's news. This is an editorial, not an article.

      You mean Marc Andressen of Loudcloud, who sell server and datacentre management services, and who are desperate for revenue? That Marc Andressen?

      It isn't even an editorial, it's free advertising disguised as a story. I hope they're paying whoever owns Slashdot this week for it.
      • Useless Drivel (Score:3, Informative)

        by hemp ( 36945 )
        LoudCloud sold all of the data center hosting business to EDS. His company now develops software to 'manage more efficiently' those datacenters they couldn't make any money on. Obviously, the business model is 'if I couldn't do it and make $$$, then no other company in the world can either and they must be willing to pay me lots of $$$$ to allow them to'.

    • Re:Meatless drivel (Score:4, Interesting)

      by selectspec ( 74651 ) on Saturday December 14, 2002 @02:03PM (#4887685)
      No doubt. Andreesen is trying to fabricate a crisis so he can save his worthless company, Opsware (LoudCloud), from insolvency and his career from the where are they now file.
    • Re:Meatless drivel (Score:3, Insightful)

      by HamNRye ( 20218 )
      s/Meatless Drivel/Total Bullshit/;

      Aside from this essentially being an advertorial, there remains the fact that he presents a problem, (kindof) and then presents no solution.

      Auto Mechanics, tired of only being appreciated when that car breaks?? Try fixing it before it breaks! Don't ask how.

      Firemen, tired of only being appreciated when someone rams a plane into a building?? Try just driving around town hosing everything down.

      Now, lets talk "DataCentre Management tools". My experience is this: You need to hire two extra geeks just to get it running, It never really works that well, and spending that money of some clustering and a good perl guru would have a more drastic effect on your uptime. I personally don't have a Unix box that has not maintained 5 9's in availability over the last 4 years. I have a few NT servers that can boast the same.

      Products like these are only supposed to appeal to the shirts in a business. You know, the same guys who get freaky about "free" software.

      And to think MS once feared this guy....

      ~Hammy
    • by llywrch ( 9023 )
      Isn't this the same Marc Andreessen who once responded to a criticism of all of the new bugs in Netscape with ``Hey, we don't have the time to do it right"?

      And if you use software that is written to ``do it right", won't uptimes of 5 nines happen by itself?

      I'm just one guy who spent a year trying to get Andreesen's software to work on hundreds of computers. I really don't have any opinion about this multi-millionaire.

      Geoff
  • Opsware? (Score:5, Interesting)

    by RoyBoy ( 20792 ) <roy&sanwalka,org> on Saturday December 14, 2002 @01:24PM (#4887478) Homepage
    Ok seriously, does anyone RTFA anymore? How about the comments? This is a clear PR stunt aimed at producing more leads for Marc's new company. And ZDnet, that fine bastion of even-handed IT reporting, has once again saved us all by printing only the relevant facts. Just once I'd liek to believe that one of my old IT heroes didn't sell out and become a corporate whore (can you say RMS anyone?).
    • Moderated Articles (Score:5, Interesting)

      by md17 ( 68506 ) <`gro.drawsemaj' `ta' `semaj'> on Saturday December 14, 2002 @01:51PM (#4887628) Homepage

      I am not typically supportive of /. bashing, but recently the number of quality articles has gone way down. To go one step further than just plain old bashing, I have a suggestion... Can we start moderating articles themselves so that I can browse articles at +3 on a normal day, +4 on a busy day and +5 on those insane days?
    • True, this is PR for his company (which WAS Loudcloud, a mega-ASP, until it seems they decided that market sucked, and now they want to make large-scale server management tools). Frankly, I think there is a very real market for their product, though, and I think the issue goes beyond PR. Look at some of the big players (IBM for example) with initiatives in what they call Autonomic Computing. It's pretty clear all around that as IT becomes a commodity, if you can lower the costs of managing servers by reducing the amount of human intervention required, you can make a big win in this space.


      However, as I see it, there are two categories of features: those that are fairly generic (like all the features the Opsware guys have) seem naturally amenable to integration into the Operating System - patch deployment, app deployment, app tracking, and so on. Making these products for Windows seems like a losing proposition since MS will just assimilate the features into the next server OS release. But I would say there's a robust market for such products for the Unices.


      And then there are features of software itself that needs to be self-managing and self-monitoring. This is substantially harder - we tried to make an application at my last company that would control its own clustering, assigning workload to application instances in an app cluster appropriately (it was the kind of thing not amenable to some kludgy EJB-type solution - it needed real workload partitioning at the application level). Better software toolkits are needed to support this, J2EE just doesn't come close. But I don't know if anybody will make money making such a toolkit. Perhaps.

    • Re:Opsware? (Score:3, Insightful)

      Glad you mentioned it. Yes, this is just Mark pushing his own product (Datacenter management tools). In fact, he already has a few big customers. And this is the same direction Sun, for example, is going with N1. But it is going to have problems.

      You can't abstract away problems. And software is designed to be customized more than plug-in and exchange like lego blocks. Further, the 'leaky metaphor' problem hits these kind of systems big. (An example from a previous article, "UDP does not guarantee data delivery, and certainly not in the correct order, but TCP/IP does." So where is that guarantee if I cut a 1 foot break in the ethernet cable?)

      Software is going to have to get a lot more dumb for this sort of thing to work.
      • Software is going to have to get a lot more dumb for this sort of thing to work.

        And why shouldn't it?

        Having the ability to view VBA-enabled e-mail in Outlook is a feature that few office workers appreciate, but enables so many headache-causing virus situations.

        Having a 20 GB HD on everybody's desk allows them to save their data locally intead of on network shares where it is easier to back up. HD failures are much easier to abstract when their is a RAID system running in the back room instead of a HD on every desk.

        In fact, how many clock cycles is the average company wasting by having a gigahertz chip on every desk? Wouldn't it be easier if all the high-powered chips were in the data center, timeshared among those who need them at the moment?
        • Re:Opsware? (Score:3, Interesting)

          > Having a 20 GB HD on everybody's desk allows them to save their data locally intead of on network
          > shares where it is easier to back up. HD failures are much easier to abstract when their is a RAID
          > system running in the back room instead of a HD on every desk.

          Great. Now you've got to abstract the network reliability and performance problems. Abstract the lack of privacy, too, while you're at it.

          I think these 'magical solutions' try to avoid the appearance of trade-offs. They give the false impression that whatever is the method used is going to be perfect. And local storage vs remote storage is a good example. Assume a magical datacenter management tool gave me an option between the two. How would it abstract the weakness of one vs the weakness of the other?
      • You can't abstract away problems. And software is designed to be customized more than plug-in and exchange like lego blocks.

        You are exactly right. I believe Grady Booch said it best: The task of the software development team is to engineer the illusion of simplicity. The corollary of that is that you can never eliminate complexity, you can only shuffle it from one part of a system to another.
  • by PhysicsGenius ( 565228 ) <physics_seeker.yahoo@com> on Saturday December 14, 2002 @01:25PM (#4887484)
    You've succeeded in delivering 5 9's, your server farm is a well oiled machine, the helpdesk lines lie dormant. No? Well then how do you get credit for the work you do...

    If your servers aren't up with 5 9's and people are always calling in with problems, why do you assume that work as been done that needs credit?

    My servers are always up. I proactively manage them to keep them up (rather than "bring them back" up). My users are contented because a) I've trained them to bring me projects before they need them and b) I give them what they need to do their jobs.

    If you aren't doing the same, you suck as a sysadmin, there's no two ways about it.

    • by Kissing Crimson ( 197314 ) <(moc.edahsnosmirc) (ta) (ysenoj)> on Saturday December 14, 2002 @01:31PM (#4887521) Homepage
      (troll my ass...)

      I agree. In every sysadmin job I've had, I made sure from the day I was hired that people understood that the less they noticed me - or my work - the better I was doing my job.

      Sysadmin fix problems. Good sysadmins make sure problems don't happen.
      • by marshac ( 580242 ) on Saturday December 14, 2002 @03:28PM (#4888056) Homepage
        Where I work as a sysadmin, our 'business' lives and breathes on our IT infrastructure. Despite this, we are down several times a month. Why is this? Shoddy systems administration? No. I work for the government, and everything we do has to be approved. The chain of command extends into the server room. Every dime we spend has to be justified and even then, is usually denied. I need to buy new DLT tapes, but can't get any money. I had a 9GB 10k SCSI drive fail, and it took 2 weeks to get the money to get a new one!

        And what happens at the end of the year? We go on a spending spree. Our boss goes crazy spending.... last year, they bought iPaqs up the wazooo.... not because we needed them, but because they had tons of money left.

        I have written memos, proposals.... and it all falls on deaf ears. So I ask you, how am I supposed to give everyone else the tools to do their jobs, if they don't give me the tools to do mine?
    • I think maybe I oversimplified for the blurb at the top of the story. In the last three years, our 12 member team has:

      1. migrated to a firewalled, fiber-gigabit, switched network for 1200 machines and servers from a flat-unprotected-hub based one covering multiple locations

      2. migrated said machines from win95/W2k to Windows XP

      3. Brought up Active Directory to manage the enterprise.

      4. grown from a dozen servers to more than 100

      5. brought up a 2TB NAS system with backup, migrating from Novell.

      6. Gone from a single static webserver to multiple clustered web servers, with data served up by clustered SQL servers.

      7. Migrated away from Lotus Notes to an Exchange based email system.

      (on a pro-linux note, we ARE investigating how/if it could assist. Starting in Intrustion Detection and Monitorin)

      Now that we've done this, most servers have a great record for uptime, but the maintenance is starting to get hecktic, and it doesn't look like it's going to slow down any time soon.

    • by sql*kitten ( 1359 ) on Saturday December 14, 2002 @01:48PM (#4887607)
      If your servers aren't up with 5 9's and people are always calling in with problems, why do you assume that work as been done that needs credit?

      I can honestly say that none of the servers I look after (running Oracle databases) are anywhere near to 5 9's... and no-one cares. Why? Because no-one who doesn't explicitly have to cares about servers, they care about application availability. If you're on Unix or NT, aiming for 5 9's at a server level is counterproductive, because it means you fall behind on regular maintenance like applying patches that require reboots (yes, Unix has to be rebooted a lot too). Aiming for 5 9's application availability through clustering should be your goal, and if your clustering solution is any good it will happily tolerate slight version mismatches across its nodes so you can smoothly roll out upgrades without compromising service.
      • As a former UNIX Systems Manager, I can tell you that when I started in the position, *yes*, the damn UNIX boxes were rebooted alot. To "clean out the system", to "reset the database", to "run a scheduled job weekly at Friday 5pm when the server restarts".

        One by one, I eliminated these.

        Packaged software (read: binary only, proprietary) that leaks memory gets fixed - or gets tossed out. Same for those wonderful "zombie creators".

        Ridiculous measures (read: rebooting) to correct *symptoms*, rather than performing root-cause analysis/identification, go the way of the dinosaur - those were sometimes tough sells, but if it takes a seminar on enterprise systems management of a uniVerse database server, then so be it! Every one of those budding acolytes can be trusted to carry forth the message in the future - you do not need to reboot enterprise class servers to perform software or application "maintenance"!.

        Taking the time to identify root cause pays off, when you are able to stop rebooting the server after you've fixed the problem (and, amazingly enough, the symptoms disappear, too :-/ ).

        In some cases, it was easy: no one working there remembered why some servers were rebooted. Asinine - but easily corrected.

        You still need regular maintenance downtime - aside from urgent patches (security, availability etc) and hardware problems. These are scheduled, off-peak, and redundancy (where used) means no hit to "availability" (which applies to the service offered, not to each node).

        To sum up: what my gray hairs and the bags under the shadows under my eyes have taught me is that 99.999% of reboots that contribute to "lack of availability" are in reality lack of skill/experience, talent, or both. With a mild dose of business people making the decision to reboot servers daily or weekly, like their PCs...

        Just my experience, and my $0.02.

        • you do not need to reboot enterprise class servers to perform software or application "maintenance"!.

          OK, so you're running an Oracle database server on Solaris, in dedicated connection mode. You need to connect another application server node to it, which will push it over its permitted number of processes. To do this, you need to add more semphores in the kernel and reboot, because Solaris can't allocate them dynamically.

          So you cleanly fail over onto your second node, reconfigure and reboot node 1, rejoin the cluster and fail node 2, reconfigure it and reboot, rejoin the cluster and you're good to go without compromising application availability. If you know a way to do it without rebooting, you'd better go tell Sun, because they don't know it.

          To sum up: what my gray hairs and the bags under the shadows under my eyes have taught me is that 99.999% of reboots that contribute to "lack of availability" are in reality lack of skill/experience, talent, or both.

          A lot of people think Unix is the be-all and end-all of operating systems. But it's a long way from that.
  • I do not work in the IT department, I work in Advanced Services as a Broadband Service Supervisor, what this means is that not only am I responsible for a department of over 120 ppl I also do a lot of programming for mine and other departments in the building (NOT BECAUSE I VOLUNTEER). While any department outside of mine, that has any problems with our internal programs should be addressed via IT, very often either IT refers them to me, or I get them directly. So I guess that's how they get things done where I work. (Note: I love the IT guys, and I love my employees, just feels like sometimes I'm overworked underpaid, but hey that's the american way :)
  • by JakiChan ( 141719 ) on Saturday December 14, 2002 @01:26PM (#4887491)
    ...is the software his company is developing. He's telling us, of course, what the big problem with IT is and interestingly enough he's got a solution he can sell you.

    I'm sure that Bill Gates could write an article about the crisis facing the Internet, and how .NET will solve it...don't you love the objectivity?
  • Nothing (Score:5, Insightful)

    by halo8 ( 445515 ) on Saturday December 14, 2002 @01:30PM (#4887514)
    what has to change in IT before we melt down ? Lots

    What WILL change? nothing

    for every burnt out admin thats going to quit theres 5 more waiting to take his place

    good companies keep good employees
    eather your not a good employee
    or your not working at a good company
    • Re:Nothing (Score:3, Insightful)

      by Jerf ( 17166 )
      for every burnt out admin thats going to quit theres 5 more waiting to take his place

      I've seen that comment several times but it finally occurred to me what's wrong with it: KM. (That's "Knowlege Management".)

      When an admin leaves, he takes a tremendous amount of knowlege away with him or her, which must be painstakingly re-aquired by the next admin. This can easily take in excess of a year, depending on how well the original admin did his job. (It takes less if everything's broken; well-oiled parts of the machine may not require attention for a long time, so those take longer to learn.)

      Changing admins is far, far from free, and business will eventually notice that. (In fact, they are starting to, but my impression is that KM is still a "kooky" field, not yet mainstream. Corrections from those closer to that community welcome.)
  • by Helpadingoatemybaby ( 629248 ) on Saturday December 14, 2002 @01:32PM (#4887522)
    I have a lot of experience in this matter.

    I've worked in a few large outfits and in my own small business, and I can testify there is almost zero correlation in a large office between the work that you do and people's perception of what you do.

    The people who have the most problems, the ones who have a terrible catastrophe which just always seems to happen to them, are seen as the problem solvers. Despite the fact that their own lack of organization, incompetence, or laziness often brings these things upon them. No matter, they can proudly trumpet how they once again "saved the company" and worked 30 hours straight. The ones like me who prevent the problems, who organize their day so that nothing exciting happens if it can be avoided, and quietly solve problems on their own without assistance before people notice them, are seen as either invisible or lazy.

    And no, I don't work a 30 hour day. Ever. I don't need to. I'm not bitter... gak!

  • Quit (Score:4, Insightful)

    by Rai ( 524476 ) on Saturday December 14, 2002 @01:35PM (#4887545) Homepage
    Well then how do you get credit for the work you do, when all that's noticed is the downtime?

    The very reason I left. When something crashes, who gets blamed? When users forget their passwords (which are usually something as simple like their friggin' username), who gets hasseled? When admin lays down an impossible time table with ridiculous performance expectations, who gets told "make this work or else?" When the company starts loosing money due to poor business decisions and/or the economy being in the toilet, who's the first dept to get cut?

    Not as appealing as those tech school commercials make it out to be, huh? :)
  • "New crisis"? (Score:2, Interesting)

    by matt-fu ( 96262 )
    You've got to be kidding me. This problem is as old as IT itself.
  • by Soft ( 266615 ) on Saturday December 14, 2002 @01:44PM (#4887590)
    Marc Andreessen proposes automating systems administration, arguing that humans are already at their limits, so scaling up the network require that the machines take care of themselves as much as possible. I fully agree. Basically, he just rediscovered what the Infrastructures.org [infrastructures.org] people have been saying: that a network should be thought of as a single infrastructure, not a number of individual machines.

    I recommend the paper, "Bootstrapping an infrastructure [infrastructures.org]."

  • by starseeker ( 141897 ) on Saturday December 14, 2002 @01:46PM (#4887604) Homepage
    a) The phone industry had one task to do - get data over wires from point A to B, and however they did that was fine - users wouldn't have to be retrained if you replace old cables with fiber optic. That task is relatively simple, compared to the complexity of what is running and being distributed over the internet today. Automation works best when the target is static and clearly definable. I'm not sure either applies with servers/IT/internet.

    b) A significant amount of trouble with maintaining systems comes from having to figure out lots of different pieces of hardware. Lots of random equipment makes IT support a great deal more difficult. There are two solutions:

    1. Standardize all company hardware on a small number of systems/components, say one type of desktop, one type of server, and a few special purpose machines, and then only support those. Tools like VASystemImager then can make tasks like upgrading and bug fixing vastly simpler.

    2. Use inexpensive thin clients interfacing to some powerful central server, ala Largo, and only have to maintain that central machine and swap out cheap, dumb clients. Also simplifies things tremendously.

    People will no doubt point out that you have to run different types of OSs for different jobs and so on, but you can still use the central server/thin client approach and just make the connection to whichever OS you need transparent. It takes thought to set up, but once it is working you don't have 4000 individual support headaches to deal with. Only a few machines to upgrade, support, etc.

    Unfortunately, this won't happen. First, you would have to have a truly MASSIVE infastructure upgrade, which replaces a working system. Riiight. Second, you need to have management willing to try something new and be patient to wait for the long term results. That's not how they think - they think next quarter profits. There is also sheer mental inertia to contend with.

    It would be much easier for new companies to adopt this idea from the get go, than for older companies to adopt it. That may be where new, useful IT principles get applied.

    The only way current companies will do something is if the system BREAKS, and I mean just totally stops functioning. Thats when they will wake up to the fact that significant changes are needed.
  • by WCMI92 ( 592436 ) on Saturday December 14, 2002 @01:50PM (#4887623) Homepage
    My company does IT outsourcing for several local companies, inluding one that has several remote divisions.

    We are NEVER allowed enough time to do more than barely keep things running. Which always leads to things breaking that could easily have been prevented had someone been there to do routine maitenance.

    New deployments (like a Linux server that does remote access for terminal emulation) that I do are compromised by the fact that I'm never given more time than is needed to barely get it working. I never truly get to finish a job to my own high standards.

    My employer views this service contract as "nonbillable hours" despite the fact that they pay us $9K a month for it. The boss wants myself and our other engineers working on other "billable" projects that bring in far less than that $9K. That money doesn't get this company a SINGLE dedicated contract employee (despite the fact that our whole tech staff's monthly salaries don't equal $9K)

    Not only that, our company recently fired our most talented engineer without cause (the week before Thanksgiving) becuase he could hire a "paper tiger" (ie: MCSE) for $5K a year less, and a H1-B who he brought in and treats like a slave.

    It's definately the dark side of IT oursourcing, and something that companies considering doing this should think about.

    If I were going to outsource an IT department for a company with multiple locations and servers, I'd keep at least ONE in-house guy and use the outsource company soley for the "it's broken" crises which need more manpower.
    • If one of my co-workers had been fired for that reason, I'd have quit on the spot. And so would most of my colleagues. Why? Because of loss of faith in the leadership (boss not understanding morale concept, and not knowing who'd be next, leads me to prefer to dictate my own future).

      The boss would be left with an empty department to explain to his VP and justify his bad decisions. Not that it would happen - I have the total faith in my boss and the morale is good, people help each other out.

      You Americans need to learn to fuck people back and not take shit from nobody. The SINGLE reason managers (in IT and elsewhere) can treat people like this, is that the people being screwed react with bending over and asking for more.

  • by IGnatius T Foobar ( 4328 ) on Saturday December 14, 2002 @01:50PM (#4887624) Homepage Journal
    Ok, I read through the article and came away with absolutely no information. He says some things we already know: data centers are expensive, IT people are overworked, and the rest of an organization only notices the technology folks when something breaks. So, what should we do about it?

    Well, here's where you expect an innovator like Andreesen to come up with a brilliant idea that's going to begin the next IT paradigm shift, but all he says is that we need to find some revolutionary way to automate our own stuff -- basically, to automate the act of automating things. And how? Well, he doesn't really know. He makes some vague reference to sending out automatic updates to hundreds of servers at a time, and that's it.

    Real bright there, Marc. Automatic patches and updates. As if that's the answer. In the real world, you don't have a huge farm of servers that all run the same patchlevel of the same operating system. I've got a few hundred boxen behind the glass, for example, that are a mix of Linux, Solaris, FreeBSD, Windows 2000, and Windows NT. And I'd guess that at least 50 percent of them would experience some sort of problem if we were to just push updates out to them unattended -- different applications require different patchlevels and break on others.

    Let's not forget the fact that there's more than just servers. There's infrastructure such as routers, firewalls, and switches. And of course there is the dreaded desktop, which is probably the source of 90+ percent of IT headaches. Until the IT world wakes up and gets the hell off local desktops, the maintenance nightmare will continue. Seen what Microsoft is doing lately? Their vision of the future is one in which applications are loaded through a browser and executed in a local .NET environment. It's basically the same as Java applets, but they call it "Smart Clients" to give you the impression that it's something they invented. Sounds a lot like Network Computing to me -- which simply means that Network Computing is a good idea after all! And now that Microsoft has "invented" it, the idiots who make up most of the world may finally start to adopt the idea. Make the desktop a stateless device like it was 20 years ago when we all had dumb terminals on our desks, and IT overhead will drop like a rock.

    The other trend you're going to start to see is outsourcing. People are realizing that it's expensive to build and run a data center. Fortunately, you don't have to. All you have to do is run your servers at a hosting center [xand.com] that knows how to do outsourced IT (as opposed to just hosting web sites, like the first generation of centers like Exodus did).

    There are ways of streamlining IT after all. Unfortunately, Marc Andreesen didn't touch on any of them. I give this article a "C minus."
  • An accross the board problem in business these days, not just in IT, is that bosses refuse to spend money on anything that does not have a direct effect on the bottom line. Things like IT cost money, but don't directly add revenues to anything. IT is an overhead cost.

    The financial rewards of having a good IT department cannot be expresed in real numbers. The cost of a downtime that didn't happen is impossible to accurately measure, while it's easy to measure the cost of having a quality IT team working to prevent downtimes rather than to fix them.

    Managers need to understand that cutting their IT department will cause lost productivity in other departments in small units. Those 10 minutes of work time Sally loses while trying to find an admin to remind her what her password is, the lost sales that result when e-mail goes down and customers instead turn to your competitor, the custom product that has to be thrown out because a wrong quanity made it on the paperwork that went down the production floor, etc.

    In the end, the business that "don't get it" will slowly become victims of natural selection and close their doors.
  • by Hubert_Shrump ( 256081 ) <cobranet@g[ ]l.com ['mai' in gap]> on Saturday December 14, 2002 @01:53PM (#4887646) Journal
    Going Postal?

    Yes, going Postal. It's hot, it's new, it's... Satisfying.

    "Cool and smooth, I don't know how I dealt with those bastards that kept asking 'this mouse?' without going Postal" -- Satisfied customer.

    And now you too can try it, for the low low price of a single aluminum hardball bat. Spun aluminum with a non-slip rubber grip means never having to say "For all that is good and holy it does not optimize your connection!"

    • > And now you too can try it, for the low low price of a single aluminum hardball
      > bat. Spun aluminum with a non-slip rubber grip means never having to say "For
      > all that is good and holy it does not optimize your connection!"

      You can't properly go postal without automatic weapons. If you're going to do something, so it right!

      dave
  • by sterno ( 16320 ) on Saturday December 14, 2002 @01:54PM (#4887648) Homepage
    Wow, that was insightful. So you mean IT is overworked and that the it'd be nice if IT could become a more automated process? Wow Marc, you've really stunned me with your insights!

    DUH! I mean c'mon, the reason why IT is such a mess is because all the IT staff are being expected to do more with less and do it faster. They are overworked fighting fires which means they aren't given the resources to do advanced planning and put together systems that would really get things right. He makes it sound like this is the responsibility of the IT people, but it's really the responsibility fo the business as a whole to have some foresight and help these IT people out.

    Overworked sysadmins do what has to be done to keep something working RIGHT NOW. Sysadmins with some free time will spend their efforts writing scripts, automating tedious tasks, and making sure fires don't happen. When fires do happen, they have the time to deal with them effectively because they've had a chance to automate a lot of the other tasks.

    I have known many a sysadmin and I have never met a one who wasn't constantly pissed off because he lacked resources. A friend of mine was sysadmin for a company that wanted to have 24 hour uptime for their systems. He was the ONLY sysadmin. That sort of crap happens constantly in the IT world because the other members of the company have the wrong atttitude about IT, that it's an expense. If they looked at it like they look at factories and buildings, as an infrastructure investment, then you'd probably see a lot more happy IT managers out there.

  • by defile ( 1059 ) on Saturday December 14, 2002 @01:55PM (#4887650) Homepage Journal

    Marc Andreessen has an interesting article on what has to happen to IT next.

    I'm going to hazard a guess that this will forecast the overworked, underpaid endgame of IT gruntwork and usher in a new era where companies with CUSTOMIZED SERVICES and NEXT GENERATION TECHNOLOGY come in and automate IT and drastically reduce every admin's workload. He happens to run just a company that provides these services.

    I will now read the article and be amazed if that isn't exactly what he says.

  • by krow ( 129804 )
    I thought that the entire article was fluff. He points out a problem, says things must change, but doesn't give us an answer, doesn't offer a solution, and just prooved that if someone yells fire people will flock around the building to watch.

    Vendor applications rarely work without issue and most companies want to find a way to link every system to every system. They then pay large sums of money out to have someone integrate the applications for them. In the process they create something which is very hard to manage.

    Saying that the bubble is over doesn't mean anything in IT. Your competition is still alive and you still need to be doing that one extra thing that makes you the better buy. If anything this means at the moment that you need to be pushing more not less.
  • by minitrue ( 213792 )
    Remember the September story on how IBM, Sun, etc. wants servers to administer themselves? [slashdot.org] Remember how sysadmins had either lukewarm or negative reactions to it across the net?

    So how do you soften people up to the idea? Wait a couple of months, release a low key but suggestive "article" to get the concept back in peoples' heads, then launch the offending software/hardware/schema again about a year later onto a public, now resigned to seeing the new 'feature' as inevitable. It's a standard pr tactic.

    We saw it with Intel's P3 PSN fiasco, numerous webmail service privacy policy changes, and the XP activation scheme. And I'm not saying that all this is the work of evildoers, just that this is what it is.
  • by melonman ( 608440 ) on Saturday December 14, 2002 @02:08PM (#4887706) Journal

    Telephony is a mature technology that doesn't completely change the way it works every five years. For 80 or so years the way the signals got routed didn't change much at all. Then exchanges went digital, and, for the transition period, it was all bailing wire like the article says, except that the telephone companies had - dare I say it - telephone number budgets to pay for the changeover.

    By comparison, the rate of change in IT is still very high. We've gone from mainframe to micro, from thin client through peer networking and back to thin client, from standalone to the Internet, we've done dial-up, ADSL, wireless...

    ... and one of the main reasons for all the bailing wire is that no company can afford to throw away all it's infrastructure every 24 months. If telephone systems stored data, and if handsets ran bespoke software, there would still be a few manual exchanges in use for backward compatibility.

  • Management... (Score:4, Interesting)

    by Chicane-UK ( 455253 ) <chicane-uk@ntl w o r l d . c om> on Saturday December 14, 2002 @02:10PM (#4887713) Homepage
    One of the biggest problems as I see it is that management dont appreciate how important IT has become in their company. Looking at my company, I think they still relate the IT department to the same IT department of five years ago when to be fair the technology was a little easier to grasp and there were much less computers in offices.

    Take networking for example - it used to be in our place BNC and the occasional run of UTP cable - all attached to relatively unintelligent devices. Now its all Cisco switches, fiber and cat5e - and it really is a full time job in itself managing a large network with so many 'intelligent' devices.

    Also taking into account the addition of so many more servers (SQL, Mail, Finance stuff, Student Records, DNS, Proxy..) - the list is endless. Again, these systems have really bloomed in the past 4 or so years, at least for where I work.

    I guess they dont see how much goes on behind doors when it comes to this business..
  • by Anonymous Coward
    Well, I work as a sysadmin at a hosting co, and the farm of FreeBSD and Red Hat machines I administer is running beautifully. Automatic upgrades, email notifications of all the important stuff, distributed shell program to run commands on all machines at once, Tripwire and firewalls to keep an eye on the hackers. If I'm not working on any projects, I only actually "work" maybe 4 hours a day.

    How do I justify my existence? Easy, the colo machines that I don't administer get hacked or broken almost once a month. Each time, I send a report and I also tell my boss "by the way, I already took care of this, we won't have this problem".

    (Maybe I should send a report each month: "security holes that DIDN'T affect us this month".)

    After a while he realized that I'm like an insurance policy (as well as my Windows counterpart who does basically the same thing).

    Now I just hope that all these colo customers don't sign up for our monitoring service, that will really make me invisible.

    Another less ethical option is to leave a few unsolved but safe problems in the machines so that you have a small fire to put out each week, to make yourself look busy.
  • by defile ( 1059 ) on Saturday December 14, 2002 @02:13PM (#4887729) Homepage Journal
    "...and you have to apply three patches to 100 servers before Close of Business..."

    No sweat.

    # for i in `cat servers`; do scp *.rpm $i:~/; done
    # for i in `cat servers`; do ssh $i rpm -Uvh ~/*.rpm; done
  • The issue is this: Upper management and end users only see the end product. I is either there or it is not. It works, or it does not. They look at that product as the end of the line for IT costs. It is out the door, gone gold -- end of story. What they don't see is the process behind the scenes that makes everything "appear" to be working as promised. It takes (surprise) people!!! Funny how an ROI on any given system may reduce 5 jobs of monkeys at the end of the line making $30K a year...However, to do it right will require (to do it right) 3 high end IT people and maybe a help desk jocky or two. And guess what -- that ROI that cut 5 jobs and $150K per year, take $200K and 3 or 4 people from IT to keep it going and maintained. Now roll forward a few years -- upper level management wonders why 7 or 8 percent of the companies investments are going into the IT "black box" costs. And now look what is happening -- they are starting to take the same chainsaw to the people who have made it possible to chainsaw through the monkeys the last 10 years.....Sad really.
  • Credibility (Score:3, Insightful)

    by Alethes ( 533985 ) on Saturday December 14, 2002 @02:15PM (#4887735)
    I'm really not trying to troll here, and even though he makes some interesting points, it's very difficult to trust the business opinions of a man that has one major dropped ball [netscape.com] in his experience and is now trying to push his automation software [loudcloud.com] as the Next Big Thing in IT. I don't honestly think he has any clue what the NBT is, but neither does anybody else.
  • I was the sole member of the IT shop for many years at a small company. Over the years, I ordered systems, software, services, did all of the installs and maintenance, wrote glue code, etc. Basically, I did it all. My philosophy was to prevent problems, rather than have to fix them.

    When we brought in a new CEO, she began looking for ways to cut costs. I was slated to be on the chopping block because she "never saw me fixing anything". Luckily, I was saved by the management who realized that I was the only one who held things together (of course, they still didn't pay me squat since I was the only person who didn't have a degree and they were mostly PhDs and Masters of whatever). Once I ended up moved on to a higher paying job, they pretty much fell apart.

    The lesson I suppose is that even if you've prevented all of the problems and the organization runs smoothly because of your efforts, you need to make your efforts visible. Sometimes, bragging isn't a bad thing.
  • You compare your IT spending to other companies.
    You point out that, though perhaps your IT guys are paid a relatively high salary compared to the rest of the company, your time between failures, and your overall spending are much less than that of similar companies. THAT is what the suits understand.

    If your IT guy is some kind of stinky zen monk who does nothing all day but medidate or work in the zen garden he build in his office, the suits will be happy if they are spending 1/10th as much as the competition, and everything just plain works.

    Believe me... every manager out there, ever CEO, gets to hear from every company he deals with, in the news, and his mother in law about how computers don't work, the network screwed up, etcetera. When he sits back and thinks "Man, mine works fine..." he will have better faith in his IT guys.
  • by mrobinso ( 456353 ) on Saturday December 14, 2002 @02:49PM (#4887908) Homepage
    Marc Andreessen sounds an awful lot like a lot of IT managers these days. You know, the people who say lots and know nothing. Boardrooms are filled with the notion that IT people are an automaton army that needs to be micromanaged right to the last char and nanosecond. Andreessen sounds like the propogator of this notion.

    In two paragraphs he pronounced to the world the basically all IT infrastructure and paradigms are broken. He later suggests that in order to keep up and succeed, things will have to change.

    Duh.

    The article is so short on details of the failure and possible solutions, I don't know why he wasted the space and bandwidth to deliver this most elegant piece of fluff.

    Fortune 2000 and enterprise in general has been raping its IT payroll for well over a year, probably close to two years. In that space, technology has changed, service delivery ramped up to top up the cuts-enhanced bottom line, media reinvented itself 4 or 5 times, and customers got a whole lot pickier and smarter.
    So the one piece of non-fluff in the article was the mention that a lot of data centers are being held together with spit and string. Well, this is what happens when you whip 5 people to do the work of 20. Seems like thats ok to do as long as the victoms have ballpoint pens in their shirt pockets and hornrimmed glasses perched on the noses.

    The biggest problem in the IT field these days is entrenched in the problems Tim Perdue experienced at SF. Every time an achievement is approached, 45000 know-it-alls with 6 digit incomes glom on, take credit, and micromanage at the DNA level. The suits, blissful in their ignorance and trusting of middlemanagement, believe the stuff that spews out of these ninnies mouths. The solution is for upper management and grassroots IT people, the folks in the trenches, to get together.

    Upper management, in order to be able to do this, needs to be sensitized to the machinations of IT people. They need to know what makes us tick, or they risk finding out what makes us ticked off. They best be doing this quick too. The downturn will end, business will pick up, and a lot of these companies will be up a creek with their infrastructure decay and miniscule overmanaged IT budgets.

    Revenge of the nerds indeed.

  • by jjohnson ( 62583 ) on Saturday December 14, 2002 @02:59PM (#4887946) Homepage
    In the 90s, I saw endless screeds about how technology professionals were a different breed, and how the best thing you could do was get a hacker and leave him alone. Things like The Hacker FAQ [setec.org] fed the self-image of geeks as lone gunman who were above petty business concerns.

    This is what feeds Andreeson's IT crisis today: the fact that technology professionals took their apparent suffering and feelings of being not understood, and used it to isolate themselves. They refused to act like businesspeople in an organization that lives and dies by its profit and loss statement. They complain about how management doesn't appreciate them, but how many learn to do a cost analysis that will show the business reasons for buying software X or hardware Y? In other words, the glorification of the geek in the 90s gave a lot of geeks the idea that they didn't have to learn the language of business to survive there. That's why they're underfunded, underappreciated, and harassed.

    I've had the benefit of a boss who demanded a business analysis for any significant technology initiative at the company. He doesn't get computers, but he understands ROI. He understands a well-presented business case for anti-virus software. We have a wireless network in our new facility in Texas for a real-time inventory management system for one reason: my cost analysis showed that the implementation costs would be recovered within a year because of labor saved from eliminating batch-mode downloading, and that the cost over five years of our wireless system was ~15% of the batch-mode system.

    When geeks figure out that they have to speak the language of the business, then the IT department gets properly funded, properly respected, and properly treated.

    • There is a problem with this mentality. It's kinda
      once sided.

      "If these damn smelly hippy UNIX admins could learn
      to act more like good corporate lapdogs, we'd make
      a bundle"

      Not to say there isn't some validity in there. But
      the other side of that coin is:

      "If these retarded, non-tech savvy idiots gave me
      a boss that understands technology, didn't let
      the morons in marketing make promises they didn't
      check if we could keep. If HR didn't have so much
      power they effectively ruin the company..." and
      so on adnauseum.

      The real problem stems from the "us" versus
      "them" philosophy. "Gotta have a strong manager
      watch over them there technogeeks or they'll walk
      all over us". And the "You don't need a technical
      person to manage technical people" mentality also.
      It has to do with the level of fear a
      non-technical person has about technical people.
      Because technical people "know things". And they
      are probably "trying to trick you". So the ball
      starts in their court. Most of the time upper
      managment fumbles that ball all the hell, and
      deserves the treatment they get from their
      overworked, underpaid wage slaves that they don't
      listen too or appreciate. That's just my opinion
      after 14 years in the business from both sides
      of the fence.
  • Nagios (Score:4, Interesting)

    by Dunkirk ( 238653 ) <david.davidkrider@com> on Saturday December 14, 2002 @03:01PM (#4887956) Homepage
    If you haven't checked into nagios [nagios.org] yet, you owe it to yourself to do so. Now. It's a monitoring application that can take action on problems. That's the first step to automating things in the datacenter. It's open-source, and it's highly useful, if a little tricky to get working.

    I moved out of a group running a lot of big Sun machines (I set up an E10K for them) because of managerial issues. Before I left, we had a budget item for about $250,000 to set up a monitoring and job-scheduling application. It was going to take *another* Sun box to run, and we were being told that it would take 3 months to get it all set up and configured.

    With Nagios, I can do everything we they were talking about implementing. I spent 3 weeks, and it cost me nothing. I employed a dual PII 266 that was collecting dust. (I also used an old P166 as a dedicated kiosk for showing the web page.) My boss and my co-workers think it's great. I'm dying to show it to my old group...
  • machine? (Score:3, Funny)

    by BreakWindows ( 442819 ) on Saturday December 14, 2002 @03:14PM (#4888006) Homepage
    your server farm is a well oiled machine

    Pretty weak-ass server farm, if you ask me..
  • Automation nation (Score:4, Interesting)

    by wytcld ( 179112 ) on Saturday December 14, 2002 @03:53PM (#4888174) Homepage
    Back in the early 80s (when Marc was in what grade?) the business press ran articles about how software was getting so good that soon we wouldn't need programmers, because writing software would itself be handled by software.

    I predict that we'll have software that can write major software at just the same time as we have software that can write convincing novels. In both cases you have the task of putting together language that respresents a broad swath of messy reality.

    Now, systems administration may be more like writing a good technical manual than writing a good novel. Ever notice how many good tech handbooks there are out there? You haven't? Maybe it's because novels are easier. Good systems administration is about leveraging people strengths with machine strengths, and vice versa. Automation without the human element is as uncompetitive as, well, the human element without automation these days.
    • by sql*kitten ( 1359 )
      I predict that we'll have software that can write major software at just the same time as we have software that can write convincing novels. In both cases you have the task of putting together language that respresents a broad swath of messy reality.

      Actually, we do already have software that can write software, it's called CASE, Comupter Aided Software Engineering.

      Unfortunately, it's takes about as much effort to explain to CASE what you want your application to do as it would to write the application yourself anyway!
  • by miffo.swe ( 547642 ) <daniel.hedblom@nOSpaM.gmail.com> on Saturday December 14, 2002 @03:55PM (#4888181) Homepage Journal
    Why?

    Well because the software we use today doesnt lend itself volontarily to extencive automation. You can automate patch installations, users added to all relevant systems at one click of a button, backup and all such things. The problem that some people seems to have a hard time grasping is that software sucks mostly. It is ridden with faults that make any automation fail randomly no matter how well it is implemented. Thats where most IT staff is doing their job, straighten out faults in the software and installing it. Support is also very hard to automate.

    Before any automation can be used on a daily basis software must get much better and have much less bugs at shipping date than today. Its a very wrong approach to go backwards and automate fixing of faults related to bugs. Fix the bugs instead.

  • by gelfling ( 6534 ) on Saturday December 14, 2002 @04:32PM (#4888338) Homepage Journal
    Your Loudcloud was supposed to do this for us and it failed.

    Why did it fail?

    1 No one really knows how to do it
    2 The infrastructure is too expensive
    3 Customer requirements are too dissimilar from one another
    4 No one has the balls to tell customers their requirements are crazy and impossible
    5 Transition costs are poorly understood
    6 Exeutives are measured by overhead and customer satisfaction and not doing the right job the best way
    7 People are not a resource they are an overhead item
  • by clunis ( 62681 ) on Saturday December 14, 2002 @04:50PM (#4888407) Homepage
    My team runs just under 6 dozen web and database servers ( Solaris and Linux ) for the University of Michigan using an open source system management suite called 'radmind [umich.edu]' and I can't say enough good things about it ( I'm not one of its developers, so I can get away with this ): fast, secure, stable, standards based, and makes a little thing like patching several dozen servers a breeze ( though ... what kind of freak patches in the middle of the day? ).

    Incidentally, the CTO of loudcloud ( a.k.a. opsware ) is Tim Howes, of LDAP fame and formerly of the UMich RSUG ( the same group that has since developed radmind ). small world.
  • by ellem ( 147712 ) <ellem52@NOSPaM.gmail.com> on Saturday December 14, 2002 @05:48PM (#4888632) Homepage Journal
    is the one you never see. That idiot running desk to desk doesn't have any idea what they are doing. The one that shows up at 8 and leaves at 6 and appears to be reading screen after screen of pr0n. That SA is the bomb.

    Worship the SA, do not replace the SA with a small shell script.
  • by scottme ( 584888 ) on Saturday December 14, 2002 @07:12PM (#4889027)
    It all sounds a lot like what IBM's Sam Palmisano was preaching back in October [slashdot.org], about "eBusiness on demand".

    The idea that impressed me then was the thought that nobody would seriously consider generating their own electricity now that it's a utility. But back in the early days companies and communities did just that. Same thing today with computing, but tomorrow...

    It strikes me there's a shade more to IBM's vision than there is to Andreessen's, though. Check out the IBM version [ibm.com] here, with links to some more in-depth material.
  • Comment removed (Score:3, Insightful)

    by account_deleted ( 4530225 ) on Saturday December 14, 2002 @09:48PM (#4889710)
    Comment removed based on user account deletion
  • by stinky wizzleteats ( 552063 ) on Sunday December 15, 2002 @03:29AM (#4890770) Homepage Journal

    There's just a few gotchas. These revolve around consistency.

    Are your servers consistent? Are your OS's consistent? Are your applications deployed the same way in each of your remote offices? Do your users have the same applications on their computers?

    Or is your IT infrastructure a giant ad hoc hairball resulting from IT decisions being made by non-IT personnel? Do you use anything because "that's what everyone else uses, so it must be good enough, therefore opinions to the contrary are wrong"?

    Have you implemented things like network management and application/workstation management only to find that the investment is worthless because your organization cannot adhere to a set of standards long enough to make such systems effective, resulting in such an expanse of policy variance that the "management" systems themselves become a net drain on your management resources?

    If so, then you've come to realize that the IT problem does not have a technical solution, rendering the entire premise of the article false.

    The problem is cultural. IT people are not trusted to make IT decisions, such as meaningful policies with regard to how technology is implemented and what resources are required to deliver a given level of service.

    What all of this means is that your Great Answer may not come from the vendor/consultant of the month, but by simply asking your people what is going on, listening to them, and giving them the leeway to make things right. And yes, this was written by a consultant.

Reality must take precedence over public relations, for Mother Nature cannot be fooled. -- R.P. Feynman

Working...