During my internship, I participated in the implementation of a Raspberry Pi cluster, a powerful computing system composed of several interconnected Raspberry Pi computers. My teacher, who had already assembled a prototype, guided me in this task by providing me with detailed instructions. Detailed information is also available on the official Raspberry PI tutorial.
https://www.raspberrypi.com/tutorials/cluster-raspberry-pi-tutorial/
I followed these instructions to configure each node in the cluster, starting with the primary node that had an external SSD for OS storage. I then configured the compute nodes to boot onto the network and get their OS from the master node. This step involved installing operating systems, configuring network settings and verifying communication between the different components of the cluster.
However, setting up the cluster has not been without challenges. We encountered a major issue where some Raspberry Pis were entering an unexpected reboot loop. I actively participated in the debugging process, focusing on the implementation of the keep-alive protocol, which helps maintain the connection between nodes. By analyzing the system logs and running tests, I was able to identify and fix an issue in the keep-alive code, which resolved the unexpected reboots and stabilized the cluster. This experience allowed me to develop my problem solving and software debugging skills.
This project allowed me to gain practical experience in system administration, network configuration and IT cluster management. I also learned to follow precise technical instructions and work independently to achieve a defined goal.
Version française
Montage de clusters Raspberry PI