Results 1 to 27 of 27

Thread: Forum Downtime - Power Outage at the webhost

  1. #1

    Forum Downtime - Power Outage at the webhost

    Sorry for the downtime last night, our webhost, Amazon EC2 suffered a massive power outage at its eastern storage facility. That took out Netflix, Instagram, Pinterest... and Universe Today/BAUT.

    We rebuilt the server and got it running again as soon as we had power.

    It was indeed, a bad night.
    http://www.theverge.com/2012/6/30/31...tagram-go-down

  2. #2
    Join Date
    Dec 2004
    Posts
    11,219
    I noticed. I figured it was you doing some kind of
    maintenance.

    Can you explain briefly why you would need to rebuild
    the server after a power outage? I'd think a reboot
    would be completely automatic, and fairly routine, not
    requiring any human intervention, and certainly not by
    anyone not onsite.

    Also, I just asked a couple of questions in the name-
    change thread.

    -- Jeff, in Minneapolis
    http://www.FreeMars.org/jeff/

    "I find astronomy very interesting, but I wouldn't if I thought we
    were just going to sit here and look." -- "Van Rijn"

    "The other planets? Well, they just happen to be there, but the
    point of rockets is to explore them!" -- Kai Yeves

  3. #3
    Join Date
    Mar 2004
    Posts
    16,659
    Presumably it's a regeneration of the server image. This is in a cloud environment, and apparently all the power was lost (I thought they were supposed to have redundant backups) so it shouldn't be routine.

    I say there is an invisible elf in my backyard. How do you prove that I am wrong?

    The Leif Ericson Cruiser

  4. #4
    Join Date
    May 2008
    Posts
    9,396
    Quote Originally Posted by Jeff Root View Post
    Can you explain briefly why you would need to rebuild
    the server after a power outage? I'd think a reboot
    would be completely automatic, and fairly routine, not
    requiring any human intervention, and certainly not by
    anyone not onsite.
    For an ordinary reboot, where all processes are warned about the impending action, yes, it usually is a quite routine event. Things change when there is a severe powerfailure, with virtual servers and storage even more so. It is much more complex than the old days, when every server was a machine with a couple of harddrives.
    ____________
    "Dumb all over, a little ugly on the side." -- Frank Zappa
    "Your right to hold an opinion is not being contested. Your expectation that it be taken seriously is." -- Jason Thompson
    "This is really very simple, but unfortunately it's very complicated." -- publius

    Moderator comments in this color | Get moderator attention using the lower left icon:
    Recommended reading: Board Rules * Forum FAQs * Conspiracy Theory Advice * Alternate Theory Advocates Advice

  5. #5
    Even the simple servers we set up for Doctors and Dental PRactices to store their patient records and Digital Xray Images have to have a consistency check on the Database if it stopped all standing.
    Rules For Posting To This Board
    All Moderation in Purple

  6. #6
    Normally, you're able to just reboot the server, but in this case the main server wouldn't restart. So we had to create a brand new server instance matching the old server. And then I couldn't attached the active drives to the new server as they were attached to the corrupted server and wouldn't detach. So we actually had to create a new server, pull the most recent backups from a snapshot and use that instead. We might have lost a few minutes. I think we take snapshots every 5 minutes.

    So this event was very different from a normal service interruption, and there was a lot of hairpulling and fretting involved. I would think that Amazon would be more insulated from a major power outage, but I guess not.

  7. #7
    Join Date
    Aug 2005
    Location
    location
    Posts
    10,118
    Quote Originally Posted by Fraser View Post
    So this event was very different from a normal service interruption, and there was a lot of hairpulling and fretting involved. I would think that Amazon would be more insulated from a major power outage, but I guess not.
    Yeah, I thought that was the whole point of the cloud.
    Et tu BAUT? Quantum mutatus ab illo.

  8. #8
    You would think that "the Cloud" means that you can just run your programs in some nebulous scaling server infrastructure, but we're not there yet. You have to tell Amazon what size server you want to run, and that's that. The servers themselves are actually very fragile and run from a specific place. I really wish I could just pay a fee per pageview or something, but that's just not possible right now. A huge opportunity for Google, I think.

  9. #9
    Join Date
    Jun 2004
    Location
    The Great NorthWet
    Posts
    5,110
    Isn't this the second such instance in the last year or so? Or am I thinking of some other site I visit?
    Cum catapultae proscriptae erunt tum soli proscript catapultas habebunt.

  10. #10
    It's the second instance. The previous one was bad, though. It lasted for a couple of days. This was only about 8 hours. Some day we'll have a real cloud.

  11. #11
    Join Date
    Sep 2003
    Location
    The beautiful north coast (Ohio)
    Posts
    35,277
    Quote Originally Posted by Fraser View Post
    It's the second instance. The previous one was bad, though. It lasted for a couple of days. This was only about 8 hours. Some day we'll have a real cloud.
    And I always assumed astronomers hated clouds....

    At night the stars put on a show for free (Carole King)

    All moderation in purple - The rules

  12. #12
    Join Date
    May 2008
    Posts
    9,396
    Quote Originally Posted by Fraser View Post
    You would think that "the Cloud" means that you can just run your programs in some nebulous scaling server infrastructure, but we're not there yet. You have to tell Amazon what size server you want to run, and that's that. The servers themselves are actually very fragile and run from a specific place.
    Hmm.. the virtual environment that I'm currently having "my" servers put into is like that: specify server size, some storage, and the resulting server instance can be running on any of the host servers in "some nebulous scaling server infrastructure", in any of the datacenters. The hosting software itself decides where it runs, and it may move a running instance from one datacenter to another, and the attached storage may or may not move with it. It sounds real scary, but it does seem to work.
    ____________
    "Dumb all over, a little ugly on the side." -- Frank Zappa
    "Your right to hold an opinion is not being contested. Your expectation that it be taken seriously is." -- Jason Thompson
    "This is really very simple, but unfortunately it's very complicated." -- publius

    Moderator comments in this color | Get moderator attention using the lower left icon:
    Recommended reading: Board Rules * Forum FAQs * Conspiracy Theory Advice * Alternate Theory Advocates Advice

  13. #13
    @slang - It's the "specify a server size/storage size" that I have a problem with. I'd really only like to pay for what I use, and be able to scale up smoothly when there are spikes.

  14. #14
    Join Date
    May 2003
    Location
    of Greatest Eclipse, Aug. 21 2017 (Kentucky, USA)
    Posts
    4,417
    Quote Originally Posted by Swift View Post
    And I always assumed astronomers hated clouds....

    I was under the impression that real clouds were the problem, unless the storms were virtual, too.

  15. #15
    Join Date
    May 2008
    Posts
    9,396
    Quote Originally Posted by Fraser View Post
    @slang - It's the "specify a server size/storage size" that I have a problem with. I'd really only like to pay for what I use, and be able to scale up smoothly when there are spikes.
    Ah, I see what you mean now. I'm sure even that can be done, within limits, but it will probably be expensive. You'd have to balance peak usage against lower usage on other boxes.. You've got me curious now, if there's an opportunity I'll ask our virtualisation guys, or the storage people.
    Last edited by slang; 2012-Jun-30 at 10:45 PM. Reason: add quote cuz tobin wedged in!
    ____________
    "Dumb all over, a little ugly on the side." -- Frank Zappa
    "Your right to hold an opinion is not being contested. Your expectation that it be taken seriously is." -- Jason Thompson
    "This is really very simple, but unfortunately it's very complicated." -- publius

    Moderator comments in this color | Get moderator attention using the lower left icon:
    Recommended reading: Board Rules * Forum FAQs * Conspiracy Theory Advice * Alternate Theory Advocates Advice

  16. #16
    Join Date
    Dec 2004
    Posts
    11,219
    Quote Originally Posted by Fraser View Post
    It's the "specify a server size/storage size" that I have a
    problem with. I'd really only like to pay for what I use, and
    be able to scale up smoothly when there are spikes.
    I don't understand. There are spikes in numbers of users
    trying to connect, but it sounds like you are talking about
    spikes in storage size, which I wouldn't think would ever
    happen except when you are doing something like you just
    did and briefly have multiple copies of everything stored.

    -- Jeff, in Minneapolis
    http://www.FreeMars.org/jeff/

    "I find astronomy very interesting, but I wouldn't if I thought we
    were just going to sit here and look." -- "Van Rijn"

    "The other planets? Well, they just happen to be there, but the
    point of rockets is to explore them!" -- Kai Yeves

  17. #17
    More users means more processor power, more memory being used etc.
    Rules For Posting To This Board
    All Moderation in Purple

  18. #18
    Join Date
    May 2008
    Posts
    9,396
    Quote Originally Posted by captain swoop View Post
    More users means more processor power, more memory being used etc.
    For us, yes, probably mainly those. And bandwidth. Other types of clients buying hosting might also have a more dynamic need for storage. And if you can't dynamically adjust your usage to spikes, you must acquire enough resources to deal with spikes, even if 75% of those resources are idle 90% of the time.
    ____________
    "Dumb all over, a little ugly on the side." -- Frank Zappa
    "Your right to hold an opinion is not being contested. Your expectation that it be taken seriously is." -- Jason Thompson
    "This is really very simple, but unfortunately it's very complicated." -- publius

    Moderator comments in this color | Get moderator attention using the lower left icon:
    Recommended reading: Board Rules * Forum FAQs * Conspiracy Theory Advice * Alternate Theory Advocates Advice

  19. #19
    Join Date
    Dec 2004
    Posts
    11,219
    I assumed that when you go from 10 users doing stuff
    simultaneously to 100 users, the number of connections
    needed would be multiplied by 10, but the disk storage
    needed would only increase by maybe 1%.

    -- Jeff, in Minneapolis
    http://www.FreeMars.org/jeff/

    "I find astronomy very interesting, but I wouldn't if I thought we
    were just going to sit here and look." -- "Van Rijn"

    "The other planets? Well, they just happen to be there, but the
    point of rockets is to explore them!" -- Kai Yeves

  20. #20
    Join Date
    Jan 2005
    Location
    Olympia, WA
    Posts
    25,722
    Quote Originally Posted by Trebuchet View Post
    Isn't this the second such instance in the last year or so? Or am I thinking of some other site I visit?
    The last one was a year ago Easter. By coincidence, I was at a con that weekend and didn't know about the severity until I got home Sunday. This has not stopped me from using it as an example of why I'm not a fan of getting rid of physical copies of things.
    _____________________________________________
    Gillian

    "Now everyone was giving her that kind of look UFOlogists get when they suddenly say, 'Hey, if you shade your eyes you can see it is just a flock of geese after all.'"

    "You can't erase icing."

    "I can't believe it doesn't work! I found it on the internet, man!"

  21. #21
    Join Date
    Feb 2003
    Location
    Depew, NY
    Posts
    4,817
    Can't you ask for a server far, far away from Netflix and the other companies that attract bad luck and trouble?

    (I love Netflix, but they are the first people mentioned when the cloud goes down.)
    Solfe

    -----------------------------------------------------------------------------------

    'That was tops! Who's not good at math? I was all, "Four!"' - Finn, Adventure Time.

  22. #22
    Join Date
    Nov 2002
    Posts
    13,990
    Quote Originally Posted by Solfe View Post
    Can't you ask for a server far, far away from Netflix and the other companies that attract bad luck and trouble?
    The power failure that caused this still has a few million people out of power, by the last news reports. Maybe let's all keep this in a little perspective, huh? BAUT is BAUT, but it's still just an internet message board.

  23. #23
    Join Date
    Feb 2003
    Location
    Depew, NY
    Posts
    4,817
    <Tongue>They won't know we are back up for at least a week. </cheek>
    Solfe

    -----------------------------------------------------------------------------------

    'That was tops! Who's not good at math? I was all, "Four!"' - Finn, Adventure Time.

  24. #24
    Join Date
    Oct 2001
    Posts
    28,697
    Quote Originally Posted by Moose View Post
    The power failure that caused this still has a few million people out of power, by the last news reports. Maybe let's all keep this in a little perspective, huh? BAUT is BAUT, but it's still just an internet message board.
    I'm still waiting for the server for my primary email account to come back up - it's in Rockville, MD, northwest of DC, which is right about where the worst of the storm hit, at least in this area.

    Meanwhile, my parents - who live 250 miles to the southwest - are still waiting for their power to come back on after going through the same storm.
    Everything I need to know I learned through Googling.

  25. #25
    Join Date
    Jan 2005
    Location
    Olympia, WA
    Posts
    25,722
    Well, we know whose problems are more important . . . .
    _____________________________________________
    Gillian

    "Now everyone was giving her that kind of look UFOlogists get when they suddenly say, 'Hey, if you shade your eyes you can see it is just a flock of geese after all.'"

    "You can't erase icing."

    "I can't believe it doesn't work! I found it on the internet, man!"

  26. #26
    Join Date
    Oct 2001
    Posts
    28,697
    Quote Originally Posted by Gillianren View Post
    Well, we know whose problems are more important . . . .
    Well, my parents found out yesterday morning their power was back and promptly emailed me to let me know - at the account that wasn't working. So it took me a while to find out.
    Everything I need to know I learned through Googling.

  27. #27
    Join Date
    Jan 2005
    Location
    Olympia, WA
    Posts
    25,722
    Okay, that's funny. And very much a modern problem.
    _____________________________________________
    Gillian

    "Now everyone was giving her that kind of look UFOlogists get when they suddenly say, 'Hey, if you shade your eyes you can see it is just a flock of geese after all.'"

    "You can't erase icing."

    "I can't believe it doesn't work! I found it on the internet, man!"

Similar Threads

  1. BAUT Outage
    By Fraser in forum Forum Rules, FAQs, and Information
    Replies: 41
    Last Post: 2011-Apr-29, 04:16 AM
  2. "Star" power? (Fusion power reactor experiment)
    By jokergirl in forum Science and Technology
    Replies: 3
    Last Post: 2010-May-03, 07:52 AM
  3. Power outage in Eastern US
    By Sunfish in forum Astronomy
    Replies: 30
    Last Post: 2003-Aug-15, 01:46 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •