Features
The main feature of Bacula Enterprise HDFS Plugin is to offer backup and restore of any file contained in HDFS Clusters in an efficient way. The technology supports Full, Incremental, and Differential backups, and is able to perform backups with automatic snapshot management. Using the HDFS Plugin ensures protection of the information stored in Hadoop environments.
A unique characteristic of the Plugin is the ability to filter information based on date, which may be quite useful for very large systems, where old information may not be of somebody’s interest, and/or where having a backup of everything could be problematic.
In order to increase user comfort, a wide range of backup filters have been incorporated. Moreover, a very useful feature of the Plugin is the ability to restore inside the original or a different HDFS filesystem, as well as to any other non-HDFS filesystem.
Also, the Plugin is integrated with Bweb, which guarantees ease of use.
See the detailed list of HDFS Plugin features:
Backup Features
Full/Incremental/Differential backups
Automatic snapshot management
Backup filters:
Exclude directories with a specific name
Exclude files with a pattern
Include files with a pattern
Include files created/modified after a given time
The configuration for HDFS backups is done in a Bacula FileSet configuration file.
During a backup, the Bacula plugin will contact the Hadoop File System to generate a system Snapshot and retrieve Files one by one. During an Incremental or a Differential backup session, the Bacula File Daemon will need to read the difference between two Snapshots to determine which files should be backed up.
Restore Features
Restore to local disk
Restore to the same HDFS instance
Restore to a different HDFS instance
See also
Go to HDFS Architecture
Go to HDFS Installation
Go to HDFS Configuration
Go to HDFS Operations
Go to HDFS Limitations
Go back to the main HDFS Plugin page.
Go back to the main Dedicated Backup Solution page.