You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think this is a neat tool and was just trying it out today. I am curious about one potential issue. I've noticed if the first 3 fields are separated by tab but any subsequent fields are separated by another delimiter (e.g. space) then bedqc will give green checks all around but incorrectly identify the file as having 3 fields. Files where the first 4 fields are delimited by tabs but with space delimitation afterwards seem to run through bedtools intersect (and I assume other functions) fine so this may rarely be an issue but I assume inconsistent delimitation of this type may happen often (like appending extra columns to a bed file) and it may help for bedqc to explicitly report something like this. I apologize if I'm simply missing something though or if this is a trivial point.
The text was updated successfully, but these errors were encountered:
BED does allow the delimiter to change throughout the file as long as it's either space or tab, but right now my main concern is successfully grabbing the first 3 columns in order to continue the analysis of the intervals. The files in this case being exposed to the bedtools command have been standardized as tab-delimited and sorted. This particular issue will have to be addressed in the function where I allow the user to download a "fixed" version of their file. Of course, that depends on finding these more complex issues in the first place...
I'm attempting to account for most of the cases outlined by the Hoffman group (https://github.com/hoffmangroup/acidbio) and we're both working from the official specs at:
I think this is a neat tool and was just trying it out today. I am curious about one potential issue. I've noticed if the first 3 fields are separated by tab but any subsequent fields are separated by another delimiter (e.g. space) then bedqc will give green checks all around but incorrectly identify the file as having 3 fields. Files where the first 4 fields are delimited by tabs but with space delimitation afterwards seem to run through bedtools intersect (and I assume other functions) fine so this may rarely be an issue but I assume inconsistent delimitation of this type may happen often (like appending extra columns to a bed file) and it may help for bedqc to explicitly report something like this. I apologize if I'm simply missing something though or if this is a trivial point.
The text was updated successfully, but these errors were encountered: