![]()
Modernizing Write-Heavy Scenarios in Distributed Systems: LSM Trees and Scylla DB
Bandi. Aruna1, P. Swapna2, G.Vinutha3, Ch. Mary Pushpa4
1Bandi. Aruna, Assistant Professor, Department of Computer Science, St. Pious X Degree and PG College for Women, Hyderabad (Telangana), India.
2P. Swapna, Department of Computer Science, St. Pious X Degree and PG College for Women, Hyderabad (Telangana), India.
3G. Vinutha, Department of Computer Science, St. Pious X Degree and PG College for Women, Hyderabad (Telangana), India.
4Ch. Mary Pushpa, Department of Computer Science, St. Pious X Degree and PG College for Women, Hyderabad (Telangana), India.
Manuscript received on 31 July 2025 | First Revised Manuscript received on 06 August 2025 | Second Revised Manuscript received on 12 August 2025 | Manuscript Accepted on 15 August 2025 | Manuscript published on 30 August 2025 | PP: 26-31 | Volume-14 Issue-9, August 2025 | Retrieval Number: 100.1/ijitee.I113014090825 | DOI: 10.35940/ijitee.I1130.14090825
Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: As modern applications become increasingly dataintensive, there is a growing demand for database systems that can efficiently manage high write-throughput workloads without compromising on performance or scalability. This research examines the internal mechanisms of ScyllaDB, a highperformance NoSQL database that utilises Log-Structured Merge (LSM) trees to address the challenges of write-intensive environments. ScyllaDB optimises data ingestion and persistence by utilising in-memory MemTables, commit logs for durability, and immutable SSTables for disk storage, which are periodically merged through compaction processes to enhance read efficiency and reduce disk usage. The study examines the architectural components of ScyllaDB, focusing on the roles of SS Tables and compaction strategies—Size-Tiered, Levelled, and Time-Window Compaction—in managing storage and retrieval. Experimental deployment within an educational institutional setting demonstrates ScyllaDB’s ability to maintain low-latency writes and scalable data distribution across multiple nodes, highlighting the effectiveness of schema design, partitioning, and resource management. Furthermore, the paper introduces a novel cluster auto-tuning algorithm for ScyllaDB that dynamically adjusts system parameters (e.g., memory allocation, compaction rate, and thread pools) based on real-time performance metrics. A mathematical model is proposed to optimise system latency and throughput under workload variations, utilising reinforcement learning-inspired logic and resource-constrained optimisation. The findings validate ScyllaDB’s effectiveness in high-volume scenarios such as real-time analytics, IoT data management, and time-series databases, positioning it as a robust alternative to traditional relational systems and legacy NoSQL platforms. The proposed auto-tuning framework holds promise for future selfadaptive database clusters in dynamic environments. This research provides practical insights into deploying and optimising write-intensive database systems for modern distributed computing needs.
Keywords: ScyllaDB, LSM Trees, SS Tables, Write-heavy Workloads, Compaction, Auto-tuning, Distributed Databases.
Scope of the Article: Smart Computing
