SDFS XML Configuration Parameters
Tag: locations
Attribute | Description | Editable after 1st mount |
dedupe-db-store | The location of the map metadata. This should be located on local disk. Default is /opt/sdfs/volumes/<volume-name>/ddb | Yes. All files will need to be copied to new location |
io-log | ignore |
<locations dedup-db-store=”C:\Program Files\sdfs\volumes\b2\ddb” io-log=”C:\Program Files\sdfs\volumes\b2\ioperf.log”/>
Tag: io
Attribute | Description | Editable after 1st mount |
chunk-size | The filesystem buffer size used for deduping data in KB. The default is set to 245KB. When –backup-volume is selected 40960KB. | No |
claim-hash-schedule | The cron schedule for performing garbage collection. This is set to run every sunday at 12PM. Look at (http://www.quartz-scheduler.org/documentation/quartz-2.x/tutorials/crontrigger.html) for more detail. | Yes |
dedupe-files | ignore | |
hash-type | The hash type used for deduplication. This is set to VARIABLE_SIP by default. The other option is VARIABLE_MURMUR3 | No |
log-level | If set to 0 opendedupe will be put into debug mode. Default is 1. | Yes |
max-file-inactive | Time in seconds a file can be set inactive before it is close. 0 ignores this option. | Yes |
max-file-write-buffers | Maximum size of the writebuffer, in MB, for a global internal IO cache before deduplication. This is set to 2048MB for backup-volume and 512MB by default. This should be set to at least 512. | Yes |
max-open-files | The maximum number of open files allowed in the filesystem. This parameter determines the amount of RAM used by the filesystem. Each open file will use approximately 3x the max-file-write-buffers during active IO. Default for this parameter is 4096. When using backup-volume it is set to 16. | Yes |
max-variable-segment-size | For variable block deduplication this is the MAX segment size in KB. The default is 32KB and 128KB for backup-volume | No |
meta-file-cache | Determines the number of metadata files to cache in memory. Default is 1024 | Yes |
min-variable-segment-size | The minimum chunk size in bytes at which deduplication will start. This is set to 4096 by default | No |
read-ahead | For cloud storage, initiates readahead cache for files on open. Default is true. | Yes |
safe-close | Closes files when operating system initiates close request. Set this to false when exporting through NFS. Default is true | Yes |
safe-sync | Syncs files when operating system initiates sync request. Set this to false when exporting over NFS Default is true. | Yes |
variable-window-size | Sets the Rabin window size in bytes for variable block deduplication. This is set to the optimal parameter. The default is 48 bytes. | No |
volume-type | The type of volume. This option does nothing Options are standard and backup. Default is backup. | No |
write-threads | The number of simultanious deduplication IO tasks that are performed. Default is 2x the number of cores or 8, whichever is greater. | Yes |
Tag: permissions
Description: Posix file permissions on newly created files
Attribute | Description | Editable after 1st mount |
default-file | The default unix file permissions. Default is 0644 | Yes |
default-folder | The default folder permissions. Default is 0755 | Yes |
default-group | Posix group owner for new files. Default is 0 | Yes |
default-owner | Posix file owner. Default is 0. | Yes |
Tag: sdfscli
Description: REST API parameters
Attribute | Description | Tunable after 1st mount |
enabled | Enable listen port | Yes |
enable-auth | Authentication required for control port. Default is false | Yes |
listen-address | The host ip that the rest port listens on. Default is localhost | Yes |
password | SHA 256 value of password | No – must be set thought sdfscli |
salt | The salt used to configure password | No – determined when password set. |
use-ssl | Use TLS for all REST communication. Default is true | Yes |
Tag: local-chunkstore
Description: DSE parameters
allocation-size | The maximum size in bytes of the backend dedupe storage before compression. | Yes |
average-chunk-size | The Average chunk size in bytes of data after deduplication. This is used to determine the number of hashtable entries to expect on a completely full volume. The default is 8192. | Yes |
chunk-store | The location where unique blocks will be stored or cached. The default is in /opt/sdfs/volumes/<volume-name>/chunkstore/chunks | Yes. Data will need to be copied to new location |
chunkstore-class | The class that is used to store the chunks. This is superseded by extended-config sub key chunkstore-class parameter | No |
cluster-config | Ignore | |
cluster-dse-password | Ignore | |
compress | Data is compressed or not. Default is true | No |
disable-auto-gc | Disable automatic garbage collection as data fills up. Garbage collection will happen at 10% full intervals if set to false. Default is false. | Yes |
enabled | ignore | |
encrypt | Deduplicated data encryption is turned on | Yes |
encryption-iv | Default Initialization vector used for encryption. This value is static for some encryption activities. | No |
encryption-key | The encryption key used for AES 256 bit encryption | No |
fpp | ignore | No |
gc-class | Class used to manage garbage collection. Do not change the default. Default is org.opendedup.sdfs.filestore.gc.PFullGC | No |
hash-db-store | The local location of the hashtable. This should be on the fastest disk possible. Default is /opt/sdfs/volumes/<volume-name>/chunkstore/hdb-<randomid> | Yes. Data must be copied to new location |
hash-db-class | Hash db class used for the hashtable. Default is org.opendedup.collections.RocksDBMap | No |
low-memory | ignore | No |
io-threads | ignore | No |
max-repl-batch-sz | The maximum batch size a response will accept. Default is 128 | Yes |
Tag: network
Description: Deprecated network parameters. Ignore.
Tag: extended-config
Description: Extended configuration parameters for chunk storage.
Special note: This table includes sub tag parameters as well.
allow-sync | Allow file syncs to also force local data spool to flush to cloud. Default is false | Yes |
block-size | The data block size, in MB, used for Archive files this is set between 25MB-40MB by default | No |
delete-unclaimed | Delete data in the cloud when it is deleted locally. Default is true | Yes |
io-threads | Specifies the number of upload threads uses to send data to the cloud. The default is 16 but can be safely set to 64 for fast connections over 1.5GB/s | Yes |
local-cache-size | The size of the local cache when sending data to the cloud. When using with backup on Windows, this should be set to at least the size of the largest single backup. This is set to 10GB by default | Yes |
map-cache-size | The number of open archive file maps allowed at any time. This is set to 100 by default. | Yes |
read-speed | The throttled read speed to cloud storage in KB/s. The default is 0 (disabled) | Yes |
service-type | The object storage service used. This attribute is optional. This should only be set by the mkfs.sdfs command | No |
sync-check-schedule | The schedule that is used to make sure metadata in the cloud is the same as the local metadata.Default is 4 59 23 * * ?. Take a look at http://www.quartz-scheduler.org/documentation/quartz-2.x/tutorials/crontrigger.html for more detail on how this is set | Yes |
sync-files | Sets whether metadata is also synced to the cloud. Default is true | Yes |
upload-thread-sleep-time | The sleep time in MilliSeconds before archive files timeout and are uploaded to the cloud regardless of size. The default is 10000 (10 Seconds) | Yes |
write-speed | The throttled write speed to cloud or local storage. Default is set to 0 (disabled) | Yes |
access-key | The access key used to connect to cloud storage | Yes |
bucket-name | The bucket name used for cloud storage | No. Could be but you would want to make sure all the data is migrated. |
chunk-store-class | The class used to manage storage. This is set based on the storage type specified during the mkfs.sdfs command. | No |
enabled | ignore | |
secret-key | The secret key used to authenticated to the object storage | Yes |
Tag: volume
Description : FS Service configuration parameters
allow-external-links | Allow symlinks to external files within SDFS. Default is true | Yes |
capacity | The logical capacity shown to the operating system. This is ignored by default but can be activated by setting use-dse-size=”false”. | Yes |
closed-gracefully | Indicates whether the volume was shut down gracefully. Will be set to true when volume is shut down. Set to false to recover the entire hashtable from the cloud. | Yes |
cluster-block-copies | ignore | |
cluster-id | ignore | |
cluster-rack-aware | ignore | |
cluster-response-timeout | ignore | |
compress-metadata | Indicates whether metadata map files (DDB) will be compressed using LZ4 when closed. Default is false. Set to true if –backup-volume | Yes |
current-size | The current used capacity of the filesystem | No |
dse-comp-size | The size of the DSE compressed | No |
dse-size | The current size of the DSE | No |
duplicated-bytes | The number of deduplicated bytes found | No |
files | Number of files in the filesystem | No |
maximum-percentage-full | The maximum percentage full the volume can get before writes will be blocked. Default is 95% | Yes |
name | The name of the volume specified during the mkfs.sdfs command | Yes |
offline | Determines if the volume is currently offline. This is set by internal processes. | No |
path | The path to filesystem stubs on local disk. This is set to /opt/sdfs/volumes/<volume-name>/files by default | Yes. Files need to be copied to new location. |
perf-mon-file | ignore | |
read-bytes | The number of bytes read from the filesystem | No |
read-timeout-seconds | ignore | |
serial-number | The serial number of the filesystem determined at mkfs.sdfs. | No |
sync-files | If set to true will initialize a fill sync of the local filesystem with data from the cloud on next mount. Set this to true to recover all files from the cloud. Default is false. | Yes |
use-dse-capacity | Use the actual dse capacity specified by allocation-size to report maximum capacity for volume. Default is true | Yes |
use-perf-mon | ignore | |
volume-clustered | ignore | |
write-bytes | Current number of bytes written | No |
write-timeout-seconds | ignore |