Add neurosift-dependent streaming libraries #30

CodyCBakerPhD · 2024-03-03T05:25:12Z

Together with #27 and #28 we should try to see if we can add benchmark capability of the h5wasm and remote-h5-file on the same files and slice selections as the other methods (for rigorous comparison between the two)

@magland @garrettmflynn Do either of you know if it's possible to have a javascript service running on a background process that can be triggered by a Python call in a way that can track the amount of time it took for a given operation to occur

For example, I want to perform the analog of the following operation using remfile: https://github.com/NeurodataWithoutBorders/nwb_benchmarks/blob/refactor_functional_fully_explicit/src/nwb_benchmarks/benchmarks/time_remote_slicing.py#L45-L58 (where the setup is not a part of the timing/tracking but the actual data slice is)

but via the two neurosift-dependent libraries mentioned above

The text was updated successfully, but these errors were encountered:

oruebel · 2024-03-03T06:52:49Z

If this is running JavaScript, do you think that maybe PyExecJS or the js2py library would work to run the script via Python?

magland · 2024-03-04T11:15:25Z

@CodyCBakerPhD This is not so straightforward since neurosift uses WebAssembly in a WebWorker, and as far as I know you can only run web workers within a browser.

But there could be a way to refactor this library so that the code can be run in the main thread of nodejs.

Since it uses webassembly, js2py is not going to work. I think the best bet would be to run using nodejs as a system command.

Will require some refactoring.

CodyCBakerPhD · 2024-03-04T14:48:03Z

Would it be possible to write a really simple standalone .html (forgive my naivety regarding inner works of JS/browsers/WebAssembly) file like the one seen here: https://github.com/catalystneuro/tqdm_publisher/blob/main/demo/client.html

that incorporates in-line timing around the lines of interest and dumps the information to a results file (JSON) of the structure we expect from the rest of the suite? If so then we can have Python based CLI triggers to 'open' that HTML (not sure about closing, though. Can a page automatically close itself after a certain delay?)

magland · 2024-03-04T16:52:37Z

Would it be possible to write a really simple standalone .html (forgive my naivety regarding inner works of JS/browsers/WebAssembly) file like the one seen here: https://github.com/catalystneuro/tqdm_publisher/blob/main/demo/client.html

that incorporates in-line timing around the lines of interest and dumps the information to a results file (JSON) of the structure we expect from the rest of the suite? If so then we can have Python based CLI triggers to 'open' that HTML (not sure about closing, though. Can a page automatically close itself after a certain delay?)

I don't think this is feasible. For one thing, there's no way to get the web page to write to a .json file.

The better option would be to write a nodejs program, because that executes in a normal process and has access to the file system.

garrettmflynn · 2024-03-04T18:16:45Z

If we wanted to ensure that we were benchmarking in the browser (to avoid differences between that and Node.js), we could create a WebSocket system that uses a Python server to request actions on the HTML client and which receives the execution information back.

We could also use my commoners library to create an Electron application (run for development purposes only) that could be executed with npm run benchmark using a Python subprocess, which runs some tests on the browser and saves directly to the filesystem without the intermediary WS connection.

Though if we don't care about the fact that h5wasm and remote-h5-file is executed in the browser, then both of these solutions would almost definitely be overengineered.

garrettmflynn · 2024-03-04T18:21:44Z

Now that I'm thinking about it, you could achieve this without Electron or WS by using puppeteer to execute code in the browser (as long as the action is programmatically triggerable from the global scope). To get some understanding of performance, you'd simply insert a call to performance.now() before and after to get the execution time, which is returned from the execution call and saved as a part of the test.

CodyCBakerPhD · 2024-03-04T19:43:45Z

Though if we don't care about the fact that h5wasm and remote-h5-file is executed in the browser, then both of these solutions would almost definitely be overengineered.

Is there a technological difference that could produce different results between running the same h5wasm code in a browser as opposed to in a nodejs script through a system command?

magland · 2024-03-04T21:25:24Z

Though if we don't care about the fact that h5wasm and remote-h5-file is executed in the browser, then both of these solutions would almost definitely be overengineered.

Is there a technological difference that could produce different results between running the same h5wasm code in a browser as opposed to in a nodejs script through a system command?

I think the results should be comparable. However, I just took a crack at creating a node version of this, and it seems it's going to be difficult to get that to work. So maybe one of @garrettmflynn 's two suggestions would be the way to go.

garrettmflynn · 2024-04-17T21:55:55Z

Here's a basic sandbox for benchmarking on the web and getting timing results back in Python: https://github.com/garrettmflynn/webnwb-benchmark

Looks like we won't be able to time using Python directly here because of some variability inherent in using Puppeteer

CodyCBakerPhD mentioned this issue Apr 30, 2024

MatNWB timing #56

Closed

CodyCBakerPhD closed this as completed Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add neurosift-dependent streaming libraries #30

Add neurosift-dependent streaming libraries #30

CodyCBakerPhD commented Mar 3, 2024

oruebel commented Mar 3, 2024

magland commented Mar 4, 2024

CodyCBakerPhD commented Mar 4, 2024

magland commented Mar 4, 2024

garrettmflynn commented Mar 4, 2024 •

edited

Loading

garrettmflynn commented Mar 4, 2024 •

edited

Loading

CodyCBakerPhD commented Mar 4, 2024

magland commented Mar 4, 2024

garrettmflynn commented Apr 17, 2024

Add neurosift-dependent streaming libraries #30

Add neurosift-dependent streaming libraries #30

Comments

CodyCBakerPhD commented Mar 3, 2024

oruebel commented Mar 3, 2024

magland commented Mar 4, 2024

CodyCBakerPhD commented Mar 4, 2024

magland commented Mar 4, 2024

garrettmflynn commented Mar 4, 2024 • edited Loading

garrettmflynn commented Mar 4, 2024 • edited Loading

CodyCBakerPhD commented Mar 4, 2024

magland commented Mar 4, 2024

garrettmflynn commented Apr 17, 2024

garrettmflynn commented Mar 4, 2024 •

edited

Loading

garrettmflynn commented Mar 4, 2024 •

edited

Loading