
Deploy multi-server Lustre filesystem for large production
Upwork
Remoto
•12 hours ago
•No application
About
I need an experienced Lustre engineer to take my completely bare-metal infrastructure from zero to a stable, multi-server Lustre installation that can sustain large-scale data processing workloads. Here’s what I’m looking for: • Architecture planning – size and layout the metadata and object storage layers, network topology (InfiniBand/Eethernet), and recommended the design. • Full installation and configuration of the Lustre stack across multiple servers, including LNET tuning, failover configuration, and kernel/OFED alignment. • Benchmarking and optimisation so the file system reaches production-ready throughput and latency targets. • A concise run-book that documents every command, configuration file, and recovery procedure so my internal team can maintain the system long-term.