Most Popular

1500 questions
3
votes
2 answers

How should I interpret DGE results if only one HLA-A gene shows up as significant but not the others?

I have done a DGE recently and have been looking at the DGE list. One of the genes is HLA-A. However, when I dug deeper I realised there are hundreds of HLA-A genes with unique ENSEMBL number (of course it's HLA...). A few other HLA-A genes also…
Kento
  • 67
  • 4
3
votes
0 answers

Genome-guided transcriptome reconstruction: should I filter my reference annotation by transcript_support_level or other tags?

Is there really any advantage to filtering the GTF annotation for transcriptome reconstruction (or, for example, for pseudo-alignment quantification)? Are there any downsides (i.e. reasons why I should NOT do that)? I'm going to run a transcriptome…
bepoli
  • 165
  • 1
  • 8
3
votes
0 answers

Can multiplexing in Sequel II SMRTcells reduce the coverage?

I would like to have high depth of coverage of the transcriptome, but I need to analyze several tissues. I was suggested to pool all 5 tissue samples in same SMRT 8M cell. Will this result in a smaller coverage when compared to non-multiplex…
Caterina
  • 257
  • 1
  • 4
3
votes
0 answers

How are BLAST record scores calculated in biopython

Can someone explain how the score is calculated for Bio.Blast.Record.Description? For example, if Record.Description.num_alignments is >1, is Record.Description.score simply the sum of the alignment scores for the description's hit? Can alignments…
3
votes
2 answers

How can I find the maximum percent identity between two sets of (unaligned) sequences?

As the title states, I've got two sets of unaligned amino acid sequences (~25k sequences in one, ~3k in the other). I want to find the minimum distance between a sequence in the first set and a sequence in the second, expressed in terms of percent…
Empiromancer
  • 163
  • 4
3
votes
2 answers

bash script for interproscan REST

hey i have the problem with bash script. It should run interproscan REST Im green in bash scripting. I found this script on the web: #!/bin/bash i=1 waitevery=30 mkdir -p out for j in $(find `pwd` -type f -name "*.fa") do echo "Iteration: $i;…
MTG
  • 135
  • 7
3
votes
2 answers

STAR vs Bowtie2

What is the fundamental difference between STAR and Bowtie(2). Specifically, what is the difference in their final output (regardless of run-time differences, speed, memory usage etc.). Both seem to work on the same kind of input *.fastq files…
Zebra Fish
  • 189
  • 8
3
votes
1 answer

Find a genomic coordinates for a protein aminoacid position

Is there a function that can map genomic position (hg19) back from a protein position? I can have name of a particular transcript, and exon number. For example, I have KRAS gene for which I would like to know the exact genomic position of G12, and I…
lizaveta
  • 203
  • 1
  • 3
3
votes
1 answer

How To display bar graph with error bars?

I am using the below code to get the bar plot. I want to get the bar plot with error bars displayed. import seaborn as sns def plot_coefficients(target, coeffs_df, n=10): if coeffs_df.shape[0]==0: return None # nothing to plot here! …
Megha
  • 343
  • 2
  • 10
3
votes
0 answers

How to use hgvs to project a variant list on a given protein sequence string?

I have the following protein structural variance list: mutations = ['F594_D600dup', 'D586_E596dup', 'D593_F594insSPEDNEYFYVD', 'E598_F612dup', 'E598_Y599insSGSSDNEYFYVDFREYE', 'E598_Y599insVAYVDFREYE', 'E604_F605insSPRGGNEYFYVDFREYEYDLKWE',…
0x90
  • 1,417
  • 6
  • 17
3
votes
2 answers

R ggtree apply colors to branches with multiple colors

This question was also asked on StackOverflow I have a phylogenetic tree made with a Newick format ((a:1,b:1):2, (c:1, d:1):3):1; The output will be I have drawn this with http://etetoolkit.org/treeview/, but output with R will be same. What I…
SG Kwon
  • 45
  • 3
3
votes
2 answers

Example of a small but moderately complex PDB file?

I am developing a bioinformatics framework only for learning purposes. I want to calculate -- dihedral angles detect H, E, and C components detect Hydrogen bonds etc. Therefore, I want a PDB file that represents a protein with a small chain but is…
user366312
  • 624
  • 1
  • 12
3
votes
1 answer

How can I parse alternative atom information?

I am trying to parse PDB files. Say, a PDB file has the following data: ATOM 33 N ATHR A 2 4.935 -11.632 15.046 0.74 2.95 N ATOM 34 N BTHR A 2 5.078 -11.406 15.180 0.31 2.78 N ATOM 35 CA…
user366312
  • 624
  • 1
  • 12
3
votes
1 answer

What is the function of the heteroatoms

I am trying to make a prediction application using the sequence of proteins. If we think about the 6COU protein, I'm running the code below pdb = PDBParser().get_structure("6c0u", "6c0u.pdb") for residue in pdb.get_residues(): print(residue)…
drorhun
  • 133
  • 5
3
votes
1 answer

deseq2 makeContrasts function

This article talked about various deseq2 designs etc. One of the designs I would like to use is explained as this: Control versus treatment average makeContrasts((treatmentI+treatmentII+treatmentIII)/3-treatmentCTL, levels=colnames(design)) I'm…
kcm
  • 1,684
  • 9
  • 24