The Joyent Community

A place where the Joyent community can gather, help each other out, and stay informed.

You are not logged in.

#1 2007-11-29 10:01:13

jason
a chief (i started this place)
From: San Francisco
Registered: 2004-06-01
Posts: 8814
Website  Expertise

Performance issues for some (shared) accelerators and aggregate I/O

Some shared accelerators and accelerators are experiencing I/O issues on the backend storage nodes.

The typical symptoms are slow shells and longer page load times for web pages and applications. The appearance/experience of these symptoms are seemingly random, and they're not affecting everyone.

Fundamentally the cause is a lack of spindles/drives to handle the wide fluctuations for some of our users on "older" setups. The solutions are that we're adding more spindles/drives in hot spots to eliminate them, and we're migrating some users (typically with workloads that are fine but simply different from most of us) to an infrastructure that'll provide what they need. Also "older" doesn't mean "old", I'm trying to say that we've recognized this issue, and that recent and new deployments are not going to be experiencing this.

My apologies to those of you experiencing these issues, and thank you for your patience.

Offline

 

#2 2007-11-29 13:16:23

someguy
Moderator
Registered: 2005-09-13
Posts: 577
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

By "older" do you mean shared accelerators that were some of the first to be deployed? If so, does it make sense to update the shared accelerator configuration or just redeploy those on the "older" setups? Do you have a sense for when these issues will be resolved? (N hours, N days, N weeks, or N months, where N<10)

I'm sure the Joyent staff has been putting in extra hours and effort to try to get to the bottom of this. THANKS!

Offline

 

#3 2007-11-29 13:33:36

Eugene
Member
Registered: 2004-06-05
Posts: 823
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Thanks for the announcement and all the work from the you and team behind the scenes.

Offline

 

#4 2007-11-29 15:13:59

nriley
making it happen
From: Champaign, IL
Registered: 2005-07-31
Posts: 248
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Yay! Thanks.

Offline

 

#5 2007-11-29 15:19:41

madams
 
From: Edinburgh
Registered: 2005-05-11
Posts: 2067
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

My haven't had any problems on my Shared Accel, but I've been reading about problems others have been having for a few weeks.

Thanks, Jason. This is the type of post we like seeing from you. It clearly states the problem, who is effected, and what steps are being taken to resolve the issue. The only thing you need to add is an time frame for when you plan to have the solution in place.


Mark
Live in the city, work in the country. | OpenSolaris Immigrant

Offline

 

#6 2007-11-29 17:28:43

jason
a chief (i started this place)
From: San Francisco
Registered: 2004-06-01
Posts: 8814
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

madams wrote:

The only thing you need to add is an time frame for when you plan to have the solution in place.


From yesterday til it's done.

Offline

 

#7 2007-11-29 17:58:21

lee
Member
From: France
Registered: 2004-06-21
Posts: 477
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Kristiewells said yesterday: "We are working to resolve it and expect there to be some positive changes shortly (within days)."

Offline

 

#8 2007-11-29 18:11:00

mrdale
Member
Registered: 2005-03-02
Posts: 209
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Hope this relates to bolinas and johnson. Cuz they're slower than molasses right now. I don't know many users who are willing to watch a progress bar for 30 seconds.

But I do appreciate the update. Thanks


★ nelson (fare-thee-well, old unreliable) ★ johnson ★ bolinas ★ Sx2 ★

Offline

 

#9 2007-11-29 18:23:26

jotto
New Member
From: Wisconsin, USA
Registered: 2005-05-19
Posts: 135
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Jason wrote:

From yesterday til it's done.


that is the only kind of time frame joyent should be announcing. i like it better than when you guys set unrealistic expectations on yourselves.

Offline

 

#10 2007-11-29 19:09:09

gtcaz
Raconteur
From: Tucson, AZ
Registered: 2005-01-21
Posts: 1605
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Yes, for sure was seeing this on Myrtle and Magnolia. Great to hear you're working on in.


You're gonna have to answer to the Coca-Cola company.

Offline

 

#11 2007-11-29 19:34:34

bretthoerner
...
Registered: 2006-12-24
Posts: 843
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Hope it's easy to throw the drives in, Bridgeway is even worse today. Reid's homepage just took 1m33.681s to fetch (when it doesn't throw a 50x). I'm using his because I'm still rewriting mine to a plain "we're down" page. :(

Offline

 

#12 2007-11-29 19:54:30

reid
Pixel Wrangler
From: Atlanta, Georgia
Registered: 2004-06-01
Posts: 2115
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

bretthoerner wrote:

Reid's homepage just took 1m33.681s to fetch (when it doesn't throw a 50x).


Yes, I've had the role reversal of three clients contacting me to report that my site is down. Even my 80 year old Mom wants to know what's up with my site. My niece isn't happy either. Being 2.5 years old, that's not uncommon, though.

Me? I'm calm about it only because I don't have the time to be any other way.


"Welcome to Hell. Here's your accordion."

Offline

 

#13 2007-11-29 20:15:47

anoop
Member
From: Charlottesville, VA
Registered: 2005-03-01
Posts: 357
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

jotto wrote:

Jason wrote:

From yesterday til it's done.


that is the only kind of time frame joyent should be announcing. i like it better than when you guys set unrealistic expectations on yourselves.


+1

Good estimates > No estimates > Bad estimates

Also, could this be added to the Server Status page:

https://help.joyent.com/index.php?pg=fo … s&id=1

Right now the servers affected are not listed.

Offline

 

#14 2007-11-29 20:35:24

ronp001
#@$%^@?!
Registered: 2005-06-19
Posts: 40
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

One more thing that wasn't mentioned on this thread was the severity of the problem.
I'd like to contribute a few statistics that I collected today.

To measure performance, I ran "time ls > /dev/null" repeatedly on a directory which contains less than 30 files. I did this on two machines (cardero and bridgeway).

Some highlights of the results:

1. Most of the time (>55%), 'ls' on cardero (FreeBSD, shared hosting) takes less than 5ms to complete.
2. On bridgeway (the Solaris Premier container), only about 16% of the commands take <5ms.
3. A very significant portion of the time (>25%), the command on bridgeway takes over 100ms to complete
4. About 1% of the time the command on bridgeway took between 10 seconds and a full minute. On 2 instances the command took over 2 minutes to execute!

Here's some detailed info on what I did. To start, I used the following little piece of bash code:

Code:

while true; do echo "----"; date; time ls > /dev/null; sleep 5; done


I collected the output, did some parsing on it using a small php script, and did a histogram analysis of the time elapsed.
I did this it on both bridgeway (shared container, Solaris, premier) and on cardero (an old FreeBSD, shared hosting machine).
Hopefully there aren't too many bugs in my analysis script ;)

The results I got were as follows:

Code:

Results for cardero (FreeBSD shared)

Total measurements: 5288 Measurements taken from Thu, 29 Nov 2007 -- 09:16:04 GMT to Thu, 29 Nov 2007 -- 20:15:31 GMT less than 5ms: -- 55.862% (2954) 5ms - 10ms: -- 11.876% (628) 10ms - 50ms: -- 17.114% (905) 50ms - 100ms: -- 4.274% (226) 100ms - 1sec: -- 7.451% (394) 1sec - 5sec: -- 2.156% (114) 5sec - 10sec: -- 0.756% (40) 10sec - 20sec: -- 0.473% (25) 20sec - 30sec: -- 0.038% (2) 30sec - 60sec: -- 0.000% (0) 1 - 2 minutes: -- 0.000% (0) over 2 minutes: -- 0.000% (0)

Results for bridgeway (Solaris shared container, Premier)

Total measurements: 7076 Measurements taken from Thursday, 29 November 2007 -- 9:03:54 GMT to Thursday, 29 November 2007 -- 20:16:46 GMT less than 5ms: -- 15.941% (1128) 5ms - 10ms: -- 7.292% (516) 10ms - 50ms: -- 34.483% (2440) 50ms - 100ms: -- 15.941% (1128) 100ms - 1sec: -- 20.548% (1454) 1sec - 5sec: -- 3.477% (246) 5sec - 10sec: -- 1.074% (76) 10sec - 20sec: -- 0.565% (40) 20sec - 30sec: -- 0.212% (15) 30sec - 60sec: -- 0.254% (18) 1 - 2 minutes: -- 0.184% (13) over 2 minutes: -- 0.028% (2)


I am extremely hopeful that 'til it's done' will be very very quick.

Last edited by ronp001 (2007-11-30 14:45:19)

Offline

 

#15 2007-11-29 21:05:57

alexxale
New member
Registered: 2007-11-29
Posts: 4
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

We are using shared accelerator and we experience awful timeouts.

What if we move to dedicated accelerator - is this issue also happens on it?

Offline

 

#16 2007-11-29 21:15:21

alexxale
New member
Registered: 2007-11-29
Posts: 4
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

I mean "L", "XL" and "XXL" hosting plans

Offline

 

#17 2007-11-29 21:58:13

bretthoerner
...
Registered: 2006-12-24
Posts: 843
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

The problem we're having is unrelated to whether you're on a Shared or Solo Accelerator. We just had really bad luck with our storage infrastructure, it seems.

Offline

 

#18 2007-11-29 22:07:00

bretthoerner
...
Registered: 2006-12-24
Posts: 843
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Err, now, is this a recent problem? It's also possible that your site/app is slow regardless of this I/O problem and that you DO indeed need more CPU/RAM. A non-shared Accelerator can't hurt, for sure.

Offline

 

#19 2007-11-29 22:21:44

ichigo
panem et circenses 2.0
From: Vienna, Austria, Europe
Registered: 2005-02-25
Posts: 591
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

reid i just stumbled upon your 500 error message. hilarious. though i hope the situation improves soon.

Offline

 

#20 2007-11-29 22:26:58

alexxale
New member
Registered: 2007-11-29
Posts: 4
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

bretthoerner wrote:

Err, now, is this a recent problem? It's also possible that your site/app is slow regardless of this I/O problem and that you DO indeed need more CPU/RAM. A non-shared Accelerator can't hurt, for sure.


We have no traffic at all on our website and it loads very slow, even timeouts sometimes.

Offline

 

#21 2007-11-29 22:37:10

bretthoerner
...
Registered: 2006-12-24
Posts: 843
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Then that's the current I/O problem Jason is posting about... we can just hope they fix it soon.

Offline

 

#22 2007-11-29 22:38:48

ronp001
#@$%^@?!
Registered: 2005-06-19
Posts: 40
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

bretthoerner wrote:

we can just hope they fix it soon.


Very soon, I hope. Seems our sites have become totally unusable.

Offline

 

#23 2007-11-29 23:02:53

reid
Pixel Wrangler
From: Atlanta, Georgia
Registered: 2004-06-01
Posts: 2115
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

ichigo wrote:

reid i just stumbled upon your 500 error message. hilarious.


Thanks, I don't have a custom 503 message, but there is this one for an Error 500, Error 403, and, of course, Error 404

Might as well have fun with errors, if you're going to have them. In fact, a new Error 503 message could be a project for later tonight. It might have an illustration of an accordion.


"Welcome to Hell. Here's your accordion."

Offline

 

#24 2007-11-29 23:04:46

ngungo
a monpageur
Registered: 2004-06-01
Posts: 3465
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

What is spindle? I guess it is some kind of hardware but what is it?

Offline

 

#25 2007-11-29 23:10:07

bretthoerner
...
Registered: 2006-12-24
Posts: 843
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

It's a way of saying they need more hard drives. Spindles are what the HD platters spin on (like a car axle) and the more physical spindles they have in their (I assume) RAID-Z the faster we go.

Offline

 

#26 2007-11-29 23:16:20

ngungo
a monpageur
Registered: 2004-06-01
Posts: 3465
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

:) I took it too serious heh!

Offline

 

#27 2007-11-29 23:22:40

Jan
Joyeur Emeritus
Registered: 2004-06-01
Posts: 3081
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

I predict a performance melt down as Reid's error pages get a mention on slashdot.


Ah, you seek meaning? Then listen to the music, not the song.

Offline

 

#28 2007-11-30 00:02:21

iolaire
Member
From: Arlington, VA USA
Registered: 2005-03-02
Posts: 808
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

David's update is good also:
http://discuss.joyent.com/viewtopic.php … 57#p155657

As one of the people who is an IO user I appreciate that Joyent wants to fix the issue.


My refurbished Mac site -> While Supplies Last. The next site is Decreasing Prices with daily updates and minor tweaks now and then...

Offline

 

#29 2007-11-30 01:38:32

jdjustice
Perpetual Optimist
From: North Carolina
Registered: 2007-08-13
Posts: 215
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Jan wrote:

I predict a performance melt down as Reid's error pages get a mention on slashdot.


Ha! ;-)

Offline

 

#30 2007-11-30 02:06:28

fitzage
Chief Digression Officer
Registered: 2006-02-14
Posts: 4380
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Thanks guys. My accelerator seems fine, but Litho, Tamalpais, and Mariposa have been having issues. I have faith.


Don't ask me. I only work here.

Offline

 

#31 2007-11-30 02:09:07

jdjustice
Perpetual Optimist
From: North Carolina
Registered: 2007-08-13
Posts: 215
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

fitzage wrote:

I have faith.


Me too. I'm a believer.

Offline

 

#32 2007-11-30 02:34:24

timjcoulter
Lifer/Litho
From: Portland, Oregon
Registered: 2006-01-21
Posts: 223
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

I found a silver lining to this: Litho's recent flakiness has allowed me to reproduce some previously un-reproducible Coda bugs. All I needed was a shitty server to test on. :)

That said, let's hope this doesn't continue much longer. I've got a musical advent calendar launching soon.

Offline

 

#33 2007-11-30 04:01:30

iguy
New member
From: Wisconsin
Registered: 2007-06-30
Posts: 7
Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Overall I'm just glad that Joyent has admitted to the problem and is keeping us informed. I started moving to Magnolia from a FreeBSD and wasn't exactly stoked on the performance as I was getting stuff configured and brought online. Then thankfully this came out and told me my experience is not what should be expected. So I'm actually really happy to get this update so I can reset my expectations correctly that they are working on it and know about it.


Its all fun and games until the flying monkeys attack.

Offline

 

#34 2007-11-30 07:06:31

jordanbrock
Member
From: Perth, Australia
Registered: 2005-01-31
Posts: 344
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Jan wrote:

I predict a performance melt down as Reid's error pages get a mention on slashdot.


Can we have a mirror? Unable to even get an error page at the moment :(

lol

Offline

 

#35 2007-11-30 07:16:10

jdjustice
Perpetual Optimist
From: North Carolina
Registered: 2007-08-13
Posts: 215
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

I just got the 500 error page to load, but the home page won't load at all.
That really, really sucks. -- EDIT: I just got the 503 page waiting for the home page LOL. But that one's not even remotely funny, just sad :-(

jordanbrock wrote:

Can we have a mirror? Unable to even get an error page at the moment :(

lol

Offline

 

#36 2007-11-30 07:18:30

jdjustice
Perpetual Optimist
From: North Carolina
Registered: 2007-08-13
Posts: 215
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Spintech page loads for me but slowly. Is that on a Shared Accel?

EDIT: I see it is on a Premiere box.

Last edited by jdjustice (2007-11-30 07:23:01)

Offline

 

#37 2007-11-30 07:41:21

reid
Pixel Wrangler
From: Atlanta, Georgia
Registered: 2004-06-01
Posts: 2115
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

jdjustice wrote:

EDIT: I just got the 503 page waiting for the home page LOL. But that one's not even remotely funny, just sad


Error 503, New & Improved: Less Sad, More Funny! But no accordion. No time for a proper illustration.


"Welcome to Hell. Here's your accordion."

Offline

 

#38 2007-11-30 08:03:43

jdjustice
Perpetual Optimist
From: North Carolina
Registered: 2007-08-13
Posts: 215
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Now your site visitors have something to look at when they visit your homepage! Wait... that is your homepage. :/

reid wrote:

jdjustice wrote:

EDIT: I just got the 503 page waiting for the home page LOL. But that one's not even remotely funny, just sad


Error 503, New & Improved: Less Sad, More Funny! But no accordion. No time for a proper illustration.

Offline

 

#39 2007-12-01 14:34:34

ngungo
a monpageur
Registered: 2004-06-01
Posts: 3465
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Is everything alright now. All my sites seem very responsive at this moment.

Offline

 

#40 2007-12-01 14:36:43

ngungo
a monpageur
Registered: 2004-06-01
Posts: 3465
Website  Expertise

Re: Performance issues for some (shared) accelerators and aggregate I/O

Reid's is also pretty decent!

Offline

 

Board footer

Powered by PunBB
© Copyright 2002–2005 Rickard Andersson