Features

The main feature of Bacula Enterprise HDFS Plugin is to offer backup and restore of any file contained in HDFS Clusters in an efficient way. The technology supports Full, Incremental, and Differential backups, and is able to perform backups with automatic snapshot management. Using the HDFS Plugin ensures protection of the information stored in Hadoop environments.

A unique characteristic of the Plugin is the ability to filter information based on date, which may be quite useful for very large systems, where old information may not be of somebody’s interest, and/or where having a backup of everything could be problematic.

In order to increase user comfort, a wide range of backup filters have been incorporated. Moreover, a very useful feature of the Plugin is the ability to restore inside the original or a different HDFS filesystem, as well as to any other non-HDFS filesystem.

Also, the Plugin is integrated with Bweb, which guarantees ease of use.

See the detailed list of HDFS Plugin features:

Backup Features

  • Full/Incremental/Differential backups

  • Automatic snapshot management

  • Backup filters:

    • Exclude directories with a specific name

    • Exclude files with a pattern

    • Include files with a pattern

    • Include files created/modified after a given time

The configuration for HDFS backups is done in a Bacula FileSet configuration file.

During a backup, the Bacula plugin will contact the Hadoop File System to generate a system Snapshot and retrieve Files one by one. During an Incremental or a Differential backup session, the Bacula File Daemon will need to read the difference between two Snapshots to determine which files should be backed up.

Restore Features

  • Restore to local disk

  • Restore to the same HDFS instance

  • Restore to a different HDFS instance

Go back to the main HDFS Plugin page.

Go back to the main Dedicated Backup Solution page.