Recent events (yes, I mean the performance and availability issues) made us reconsider our infrastructure — servers, routers, storage and how these things are set up. "Infrastructure" is not really something our users should be concerned about but we, the Wikidot Team, definitely should. The Wikidot promise is: we do the dirty job managing Wikidot platform and providing tools, you can concentrate on the cool stuff. Although the "dirty stuff" and whatever improvements we plan should not directly affect the way you use Wikidot, I believe it is good to share our plans in this matter too.
The key problems we are addressing are:
- How can we adapt to long-term growth in content and traffic?
- How can we adapt to short-term traffic spikes — like ones from popular sites like RoaringApps.com, Snow Leopard we had in the past?
- Are there any single points of failure in our infrastructure?
- What happens when key elements of our infrastructure fail?
-
- Are they redundant? If so, is the failover easy enough?
- How quickly can we repair or replace them?
- Is Wikidot platform still available and usable?
-
- Do we have a recovery plan?
In fact our current setup is quite resilient. Data durability has been always our top priority: we always keep multiple copies of database (redundant disks, redundant database servers, daily backup to a remote datacenter) and uploaded files (we use Amazon S3 to keep them safe and secure). We rely on quality hardware and quality datacenter services to keep Wikidot up to its growing tasks. It has been working quite well so far, with only a few issues (recently we discovered one of our servers has been up for 2 years without even a reboot!). But even "a few" issues is still too much for use.
In every scenario we considered we could fix a potential problem — replace a broken element, re-assign critical task to other servers, or even rebuild the whole cluster in a remote datacenter. But there are some potential problems we have identified:
- Provisioning time: it takes 2-4 hours to get a replacement server in our current datacenter
- Hardware failures: it takes from one hour up to several hours to fix a failed component
- No 100% setup automation: there are parts (servers) configured "by hand" — even if we have scripts that automate most tasks, they require manual interaction
- No automatic failover or healing: there are a few servers that play a critical role and any failover to a backup server must be done "by hand"
We are now dedicating a significant portion of our time to find a solution to the above problems. The goal is to create an automated, highly-available and scalable setup. We have the design almost ready with parts of the infrastructure already working. We are evaluating alternative datacenters too, including Amazon AWS.
It is not the first time we consider AWS as a home for Wikidot. But honestly, last time we did, AWS did not have half of useful features it has now. We already use AWS for a number of critical things, including email delivery, geo-aware content distribution, backups, file storage and even DNS.
Although Amazon AWS appears what could be a big win for Wikidot, but it is still too early for the final word. I would rather be careful praising AWS before we set-up a proof-of-concept stack and it proves to be as efficient as our current config.
I will keep you informed about our plans!
After so long time in data processing I can imagine how important this decision really is.
And - there is for a long term basis not the old sense: never change a running system.
A growing (server) world needs parallel a growing knowledge and control and (en)powering.
Good luck for this all!
wish your
Helmut
Service is my success. My webtips:www.blender.org (Open source), Wikidot-Handbook.
Sie können fragen und mitwirken in der deutschsprachigen » User-Gemeinschaft für WikidotNutzer oder
im deutschen » Wikidot Handbuch ?
I hope everything goes well. On the whole Wikidot does a very good job.
Do you feel this change in infrastructure may result in a cost change to users?
Actually we do not expect much cost increase. Running in AWS vs dedicated servers is like comparing apples to oranges. Both have benefits, but AWS allows much more flexibility, which is what we need right now. With a proper design I believe we might even save on some costs.
For our clients there will be no price increase due to the changes.
Michał Frąckowiak @ Wikidot Inc.
Visit my blog at michalf.me
Thanks Michal for sharing your thoughts and for giving us an update on your infrastructure upgrade plans. I'm sure that such gestures of openness and transparency creates more of a sense of trust amongst us users and members knowing where you are heading with your plans for the continual improvement and protection of Wikidot.
Regards
Simon @ Ye Olde
Ye Olde - Creator and Chief Admin of www.music-industrapedia.com (Global Music Industry Directory & Encyclopedia) hosted on Wikidot.
Is Wikidot platform still and usable and will be developed??
Best regards
My Website:
Sklep z herbatami świata - m.in. czerwoną herbata https://e-aromat.pl/26-sklep-herbaty-pu-erh (czerwona herbata - sklep internetowy z herbatami świata)