Use SQL functions in Rust for checkpointing in SQLite bagfile databases #9

ruffsl · 2019-10-08T10:42:06Z

Diesel, the Rust crate being migrated to for interfacing with recording in SQLite .db3, enables the use of custom sql_functions which can also be implemented in rust:
http://docs.diesel.rs/diesel/macro.sql_function.html#use-with-sqlite

I'm still unsure how to formulate a SQL query that could loop over all rows given a topic_id (A vs B) in the messages (1,2,...,N) table such that it could replicate a hash chain as follows in (9) and (10):

An example of specifying a .filter() query using the topic_id as a criteria is as follows:

bbr_ros2/bbr_cli/src/api/mod.rs

Lines 119 to 123 in a9f0780

    
           use crate::schema::messages::dsl::topic_id; 
        
           let message_results = messages::table 
        
               .filter(topic_id.eq(topic_result.id)) 
        
               .load::<message::Message>(&conn) 
        
               .expect("Error loading messages");

Would it be possible to replicate something like a LAG function (guessing here?) over filter query and feed that into a custom function in SQL sequentially? Is this even possible in SQLite?

https://www.sqlitetutorial.net/sqlite-window-functions/sqlite-lag/

bbr_ros2/bbr_cli/migrations/create_tabels/up.sql

Lines 8 to 13 in a9f0780

    
           CREATE TABLE messages( 
        
               id INTEGER PRIMARY KEY NOT NULL, 
        
               topic_id INTEGER NOT NULL, 
        
               timestamp INTEGER NOT NULL, 
        
               data BLOB NOT NULL, 
        
               bbr_digest BLOB);

Relates to: diesel-rs/diesel#1775 , cc @weiznich

The text was updated successfully, but these errors were encountered:

weiznich · 2019-10-08T15:13:20Z

So if I understand the calculations above correctly each step needs the result of the previous one, right? I should be possible to do that as custom aggregate function in sql, but diesel does currently not provide a interface for custom aggregate functions with the sqlite backend. (It is possible to implement that and it shouldn't be to hard to implement that, If you are interested in using that approach I can provide a few pointers what needs to be done exactly)

That said: The easiest solution here is to just load data batch wise, do the calculation and after that load and process the next batch.

ruffsl · 2019-10-08T18:26:48Z

each step needs the result of the previous one, right?

Correct, digest bytes from the previous message (of the same topic) is used in the current row's hmac.

should be possible to do that as custom aggregate function in sql

Ah, this is termed as a "custom aggregate function" in SQL... TDIL!
Is this what you are referring to with respect to SQLite?:

If you are interested in using that approach I can provide a few pointers what needs to be done

Certainly, I'm still a new rustacean, but I could look into it. Perhaps you'd like to write those pointers and todo items under a new ticket back over at diesel-rs/diesel to proceed further.

That said: The easiest solution here is to just load data batch wise, do the calculation and after that load and process the next batch.

Thats seem like a reasonable workaround for now. Would you suggest to add a partial struct with only the id and topic_id fields to make a filter query and load all the array of indexes to iterate/stride over, or are there other preferred methods for loading? From the FormStruct pattern for partial updates, that would seem to mimic this pattern.

bbr_ros2/bbr_cli/src/models/message.rs

Lines 3 to 10 in a9f0780

    
           #[derive(Debug, diesel::Queryable)] 
        
           pub struct Message { 
        
               pub id: i64, 
        
               pub topic_id: i64, 
        
               pub timestamp: i64, 
        
               pub data: Vec<u8>, 
        
               pub bbr_digest: Option<Vec<u8>>, 
        
           }

weiznich · 2019-10-09T19:07:20Z

Ah, this is termed as a "custom aggregate function" in SQL... TDIL!
Is this what you are referring to with respect to SQLite?:

I refer to this functionality.

Certainly, I'm still a new rustacean, but I could look into it. Perhaps you'd like to write those pointers and todo items under a new ticket back over at diesel-rs/diesel to proceed further.

I've opened a issue for this: diesel-rs/diesel#2191 If you try to implement this and hit a problem just ping me there or in our gitter channel.

Thats seem like a reasonable workaround for now. Would you suggest to add a partial struct with only the id and topic_id fields to make a filter query and load all the array of indexes to iterate/stride over, or are there other preferred methods for loading? From the FormStruct pattern for partial updates, that would seem to mimic this pattern.

In general, you don't need a struct to load values. You could just use tuples there. With big amounts of data it is also preferable to select only those fields that are really required.

ruffsl mentioned this issue Oct 8, 2019

Streaming queries? diesel-rs/diesel#1775

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use SQL functions in Rust for checkpointing in SQLite bagfile databases #9

Use SQL functions in Rust for checkpointing in SQLite bagfile databases #9

ruffsl commented Oct 8, 2019 •

edited

Loading

weiznich commented Oct 8, 2019

ruffsl commented Oct 8, 2019

weiznich commented Oct 9, 2019 •

edited

Loading

Use SQL functions in Rust for checkpointing in SQLite bagfile databases #9

Use SQL functions in Rust for checkpointing in SQLite bagfile databases #9

Comments

ruffsl commented Oct 8, 2019 • edited Loading

weiznich commented Oct 8, 2019

ruffsl commented Oct 8, 2019

weiznich commented Oct 9, 2019 • edited Loading

ruffsl commented Oct 8, 2019 •

edited

Loading

weiznich commented Oct 9, 2019 •

edited

Loading