Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Debarcoded Sequences #10

Open
ozgegizlenci opened this issue Sep 1, 2019 · 1 comment
Open

Debarcoded Sequences #10

ozgegizlenci opened this issue Sep 1, 2019 · 1 comment

Comments

@ozgegizlenci
Copy link

Hello,

I am quite new to sarlacc package. I would like to subset the sequences which carry the same barcodes into seperate folders. However, I couldn't figure out in which step it can be possible.

According to you example;

After adapterAlign and barcodeAlign, I checked as.data.frame(debarcoded@listData) and
as.data.frame(out@listData). None of them had the corresponding barcodes per sequence. Do we have to merge the listData file at this point before going to filteredReads and later realizedReads to get each sequence with barcodes?

Thank you

@LTLA
Copy link
Collaborator

LTLA commented Sep 5, 2019

Some dredging of my memory suggests that realizeReads() is necessary to pull out the sequences. This is because the sequences are never fully read into memory at any point, we just stream from the file and process reads one at a time to reduce RAM usage (e.g., when processing large multiplexed libraries). So the read sequences are never stored in any of the objects. If you want them, you have to pull them out ex[;ocot;u with realizeReads() (which "realizes" them in memory).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants