Scaling up from 1 Web Server + 1 DB Server

We are Web 2.0 company that built a hosted Content Management solution from the ground up using LAMP. In short, people log into our backend to manage their website content and then use our API to extract that content. This API gets plugged into templates that can be hosted anywhere on the interwebs.

Scaling for us has progressed as follows:

Shared hosting (1and1)
Dedicated single server hosting (Rackspace)
1 Web Server, 1 DB Server (Rackspace)
1 Backend Web Server, 1 API Web Server, 1 DB Server
The question is, what's next for us? Every time one of our sites are dugg or mentioned in a popular website, our API server gets crushed with too many connections. Or every time our DB server gets overrun with queries, our Web server requests back up.

This is obviously the 'next problem' for any company like ours and I was wondering if you could point me in some directions.

I am currently attracted to the virtualization solutions but need some pointers on what to consider.

Technology

asked Oct 14 '09 at 05:09

Etienne
26 points

We do actually deploy memcache and we also do a client side API cache. The problem is that no matter how much caching you do, you're still going to have to deal with spikes in traffic and that seems to be what we're most vulnerable on right now. 1. How to handle spikes in apache connections (httpd) 2. How to handle more SQL queries (mysqld) Thanks everyone! – Etienne 16 years ago
This is your best bet - http://www.slideshare.net/josephscott/wordpress-performance-scalability-presentation – Arpit Tambi 16 years ago

7 Answers

Great answers above. Also you'll find Rackspace tech support is really helpful, and could actually help you with this problem.

They've solved this problem and seen other solve it, so at least give that a try.

answered Oct 14 '09 at 05:44

Jason
16,231 points

Agree with Jason, also, ask them about their cloud offerings... this might be something that you could use to solve the scaling problem. – Ricardo 15 years ago

It's difficult to give you the best recommendation without knowing much more about your application and setup. I would suggest getting someone knowledgeable in scaling to take a look at your architecture and evaluate your options.

As others have mentioned the best solution might be a caching layer in your application or more hardware. Another option is setting up a reverse proxy cache if you content is mostly static. Or maybe you just need to put yout assets into a CDN. Or something else altogether. It really is hard to give concrete advice without knowing more about your setup and requirements.

answered Oct 14 '09 at 05:57

User709
11 points

-2

One of your main issues is hosting with rackspace. I'm guessing you're paying hundreds on a single server without getting awesome specifications.

Start using cheaper server hosting companies with more powerful servers, and start employing load balancing techniques. You'll want to stay away from shared architectures like clouds and virtual servers, as you will be sharing resources and not getting your money's worth.

answered Oct 14 '09 at 05:16

Blekkzor
39 points

I couldn't disagree more about cloud or virtual servers, it sounds like that is exactly what he needs to scale out for spikes in traffic. I personally think you get all of your moneys worth (and sometimes more) with cloud or virtual servers since you can get burstable CPU/IO above what you would get if you just had a physical box. – James Avery 16 years ago
agreed, this is weird. Rackspace is known for their excellent server, reasonable prices, and cloud systems. – Jason 16 years ago
My perception of Rackspace, rightly or wrongly is that they are crazy expensive but worth it in terms of support. You can't even get pricing on the main website for standard hosting. Cloud is a different matter - I'm a user - excellent service at the right price. – Benjamin Wootton 15 years ago

The web server should be easy to scale out, using a load balancer and as many workers as you need. For the DB, tweak the hell out of your queries first. Then, buy a beefier server and immediately start thinking about how to shard your database, i. e. split it into chunks that can be operated independently.

A clustering DB engine like MySQL Cluster might also be worth looking at.

Also look at serverfault.com and stackoverflow.com. There's also sites dedicated to MySQL and SQL Server performance, that have great advice on making your DB more performant.

answered Oct 15 '09 at 18:44

Hanno Fietz
280 points

Why don't you talk to your server provider to give you double/triple of burstable RAM ? For that they should charge a small amount than having a guaranteed RAM.
Burstable RAM would help in handling occasional high loads.

answered Oct 15 '09 at 06:32

Skillguru
344 points

There isn't one answer and you have a number of different options. The easiest route is going to be to scale up the servers you have to the largest you can manage, that means a huge DB, Web, and API server. Of course you can only go so far this route, and you still have a big single point of failure.

Reading about your problem it seems like what you would want to do is cache the content that is being pulled from your DB at the API server level using something like memcached, that way you could scale out the number of API servers and not put too much pressure on the DB. You could employ something like EC2 to scale the API servers out as you need them and prune them when you don't need them anymore. This let's you handle spikes without having to have all the servers up at the same time.

It also sounds like you should look into document databases like Couch or MongoDB as both of these can scale out with replication and would make perfect sense for your domain (although they would probably require much more re-architecting)

I would also make sure that the people consuming the API are doing caching on their end.

answered Oct 14 '09 at 05:26

James Avery
570 points

Hi James, yes we do memcache and client side caching. I am intrigued by the EC2 solution. Will do some research - are you deploying any of that right now? – Etienne 16 years ago
Etienne, yes I am currently building a solution that is running in EC2 and working towards dynamic scaling. Right now the instances we have are handling the load but in the near future we will be adding the load balancers in front of them to scale out. – James Avery 16 years ago

You may want to re-architect your app and split in into separate layers based on functionality. In your case, it sounds like you could have a load-balancer, a front-end webserver+memcached, an app-server, an API server, and a back-end database layer. Configure it so you can have multiple instances of each running simultaneously on one or more systems. Then throw each layer onto one or more slices.

There are a lot of VPS sites like Amazon EC2, Dreamhost, or Slicehost. You can scale horizontally if the app is too slow by cloning each slice and reconfiguring so it's incorporated into the running app. As you grow, just add more slices. At some point, inter-slice communication delays get in the way and you get a point of diminishing return.

Then you can move onto your own dedicated servers and grow that way, but the basic architecture stays the same.

answered Oct 14 '09 at 05:29

Raminf
401 points