The recently implemented code update for the Sections API was successfully deployed to the stage cluster, marking the commencement of the testing phase. Automated tests were then initiated during the overnight period as a standard step in the code release procedure. Notably, it should be acknowledged that the stage cluster shares Memcached servers with the production cluster for this particular API. As a result, the automated regression test run led to the generation of cache records in a novel format, causing compatibility challenges for the production application's read and processing capabilities.
A dedicated DevOps team member identified the issue and promptly halted further execution of the tests. Subsequently, an immediate response was undertaken by implementing an application-level modification. This alteration facilitated the purging of existing cache records, ensuring that the production applications can recreate cache data in the anticipated format.
This incident, while unprecedented, has propelled us to take preventative measures to mitigate any recurrence. As part of this effort, a dedicated Memcached cluster will be established explicitly for our stage environment. This proactive step is aimed at fortifying our testing infrastructure to uphold the highest standards of quality and compatibility assurance moving forward.