A mirror of https://github.com/rustic-rs/rustic used to provide automatic docker images
-
Updated
Jul 10, 2024 - Rust
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
A mirror of https://github.com/rustic-rs/rustic used to provide automatic docker images
A file system that can be used to compare different deduplication algorithms.
Simple program that finds duplicate files, written in Rust.
A file deduplication app written in rust to be _blazingly fast_ with absolutely no unsafe code.
🗃️🐊 fdupes-xenon is crposs-platform Rust version of fdupes utility with many cool features! (just for practice in Rust)
Traveler: a file deduplication tool. 📂🔍🗂️
Remove local files that are duplicates of files in another path
A tool to deduplicate backups. It builds a hash tree of all files and folders in the target directory. Optionally also traversing into archives like zip or tar files. The hash tree is then used to find duplicate files and folders.
Duplicate file/folder finder, can also scan in archives, HDD optimized
Deduplicate data by creating reflinks between identical files.
A pure rust library for parsing and generating zchunk file
A collection of algorithms to generate a signature/fingerprint/hash in order to be used for detecting duplicate/near duplicate documents.
An Implementation of Generalized Deduplication, written in Rust
A command-line tool for deduplicating entries in a file or stream.
Created by Halbert L. Dunn
Released 1946