Most Popular

1500 questions
3
votes
0 answers

Setting up MaxQuant parameters for processing proteomics data

This is a follow up to my previous question here. In relevance to my research, I've been looking for proteomics data (control vs. diabetic) and I found a dataset in the article "Diabetes causes marked inhibition of mitochondrial metabolism in…
Natasha
  • 125
  • 2
  • 9
3
votes
0 answers

GRanges coverage error with discordant reads

I mapped my fastq files from 4SU-seq experiment to human genome hg19 with bowtie 2 and got this results: 67716784 reads; of these: 67716784 (100.00%) were paired; of these: 29941143 (44.22%) aligned concordantly 0 times 33244334…
serbe204
  • 51
  • 1
3
votes
1 answer

Grouping x-axis in scatter plot

I am trying to plot one scatter plot for each group, where each group has continous values in x-axis. Reference Data Gives continuous x-axis but all groups are combined. ggplot(data_for_plotting, aes(x = log10(Count), y =…
user98059
  • 345
  • 2
  • 10
3
votes
0 answers

Is there a generally accepted method used to impute missing DNA methylation data (probe beta-values)?

I'm looking at DNA methylation (DNAm) data such as TCGA (e.g., BRCA, KIRP, KIRC, etc.). Currently trying to build use my model to predict DNAm age on test sets, but many of the data sets are missing key probe values used in my clock/model. I have…
sumthymes
  • 31
  • 1
3
votes
1 answer

Multiple file Python script

I am currently doing an analysis requiring a multiple-input script. I have a directory filled with 900+ samples. Each sample is comprised of two-files in a fastq format. This is an example of a sample: MIP_REV_BC_01_01A_S1_R1_001.fastq.gz…
3
votes
1 answer

Processing proteomics data

[this question was also asked on BioStars] In relevance to my research, I've been looking for proteomics data (control vs. diabetic) and I found a dataset in the article "Diabetes causes marked inhibition of mitochondrial metabolism in pancreatic…
Natasha
  • 125
  • 2
  • 9
3
votes
0 answers

LRT test in deseq2 on multiple condition

I have some doubts when i'm running LRT This is my condition data-frame small subset dput(head(metadata,18)) structure(list(FAB = structure(c(1L, 1L, 1L, 1L, 6L, 2L, 6L, 4L, 3L, 5L, 7L, 3L, 2L, 7L, 6L, 2L, 6L, 2L), .Label = c("C", "M0", "M1",…
kcm
  • 1,684
  • 9
  • 24
3
votes
1 answer

Suggestion for python graphics library to plot aligned protein sequences

I'm working towards developing some visualization method for multiple protein sequences and subsequences using PYTHON. I want to show the alignment of many fragment sequences to the original sequences as well as the position of fragments and the…
The August
  • 153
  • 4
3
votes
2 answers

What can be the reason of getting negative branches lengths after BEAST analysis?

BEAST2 is currently being used for tree reconstruction prior phylogeographic analysis. The sample size and loci are described below. I thought that BEAST/BEAST2 does not allow negative lengths of branches on prior distribution level so I was…
Vovin
  • 355
  • 10
3
votes
1 answer

Missing ',' in line when Biopython reads a nexus tree

I want to edit a tree that I got from BEAST2 treeannotator in nexus-format. Usually I use the module Phylo from Biopython for such work but Phylo.read(r"filename.tree", "nexus") gave me the next…
Vovin
  • 355
  • 10
3
votes
1 answer

Constructing Two Triangle Heatmaps in One Square

I'm looking to make a two, two range colour scheme heatmap where each range corresponds to a separate triangle in the heatmap square. I am looking for a relevant guide for this heatmap. Any recommended package/library or guides? Thanks!
raysteven
  • 121
  • 7
3
votes
1 answer

Funcotator reference file error in GATK4

I am using a function called Funcotator (here) in GATK4. The code I am trying to run is: gatk Funcotator \ --variant chr21.vcf \ --reference hg38.fa.fai \ --ref-version hg38 \ --data-sources-path funcotator_dataSources.v1.2.20180329 \ --output…
Scott XU
  • 135
  • 5
3
votes
0 answers

A question on Homer normalization

When using annotatePeaks.pl script from Homer software to create histograms, the output is normalized per bp per peak (on top of normalization to 10 million tags). What does it mean to normalize per bp per peak? Hypergeometric Optimization of Motif…
Sam
  • 167
  • 5
3
votes
2 answers

Wrapping R scripts in WDL and run in docker

I am trying to run an R script as a task in WDL in a docker image. Since this is my first time working with these, I am stuck in something very simple (and in the initial stage) and am hoping somebody here can point me in the right direction. Here…
binn
  • 73
  • 4
3
votes
2 answers

Detect mutation context in a read of a sam file

After sequencing (Illumina) of some DNA, I generated a sam file through alignment of a fastq file (using Bowtie2). Instead of using variant calling programs, I want to know the specific variances, compared to the reference genome, in each specific…
Annelisa
  • 33
  • 3
1 2 3
99
100