Skip to content

blog and wiki keep going down #163

Open
@domenic

Description

According to StatusCake these are constantly going down. Blog seems a bit worse than wiki.

According to the DigitalOcean logs everything is fine. They're getting a bit more traffic than normal, maybe 10-20 requests per minute, but all responses are 200s supposedly. Blog is at about 40% RAM usage and 20% CPU usage; wiki is at about 30% RAM and 60% CPU usage; and the shared database server is at at about 68% RAM and 12% CPU usage.

There was a major spike in incoming connections and CPU/RAM usage last night around 19:11 Eastern Time, but the outages started getting bad around 17:26 Eastern Time so I'm not sure if it's related.

My best hypothesis is that either DigitalOcean sucks, or something about our setup sucks, and can't handle this much traffic.

Potential ideas:

  • Bump up the server resources even more. Seems unlikely to help given that our RAM/CPU usage is not that high. Although maybe upgrading from the "basic" tier to "pro" tier gives us access to some less-flaky type of server. If we pay enough money we could even run two containers per service, load-balanced by DigitalOcean. This might be worth trying as a first attempt just to see if it makes a difference.
  • Bump up the database server resources.
  • Investigate more complicated in-container caching architectures to reduce the amount of times we hit the database. My understanding was that since DigitalOcean puts a CDN in front of us, sending the right caching expiry headers would cause the CDN to cache the appropriate resources and not hit our source server as much. It seems like this should be enough for relatively-low-traffic sites like ours. But maybe we need to go beyond that somehow and do WordPress/MediaWiki-specific caching stuff.
  • Try AWS instead of DigitalOcean.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions