Processor

The Problem

Given a blackbox of monolithic data (looking from the outside, but comprised of well-defined blocks internally), how do we process it efficiently using all compute power at our disposal?

The Steps

Split the data to chunk, [0, N1], [N1, N2], ... [Nn, Nn+1].
When starting to process each chunk, first lock on the previous/next block.
Process the chunk and store relevant information to enable monolithic context.

struct Chunk {
    start: u64,
    end: u64,
    first_block_offset: u64,
    last_block_offset: u64,
    results: Vec<Block>,
}

struct Block {
    relative_offset: u64,
    data: Vec<u8>,
}

In case our monolithic data allows efficient random access, we can traverse backwards to ensure each block covers the expected range. If not, post-processing will require to detect if some small boundary blocks are missing.
Combine the results using the offset information.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
tests		tests
tests_data		tests_data
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Processor

The Problem

The Steps

About

Releases

Packages

Languages

rusty-ferris-club/processor

Folders and files

Latest commit

History

Repository files navigation

Processor

The Problem

The Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages