Hadoop Distributed File System splits files into MB blocks replicated across DataNodes (default copies). The NameNode tracks block locations in memory.
Write path: client contacts NameNode, gets DataNode list, streams blocks with pipeline replication. Read path: NameNode returns block locations, client reads from nearest DataNode. Optimized for large sequential reads, not random access.