Документ взят из кэша поисковой машины. Адрес оригинального документа : http://kodomo.fbb.msu.ru/hg/allpy/file/45da1d3a60c3/utils/freqs.py
Дата изменения: Unknown
Дата индексирования: Mon Feb 4 02:03:15 2013
Кодировка:
allpy: 45da1d3a60c3 utils/freqs.py

allpy

view utils/freqs.py @ 1116:45da1d3a60c3

pair_cores/web: add note about support by Ministry of Education and Science
author Boris Nagaev <bnagaev@gmail.com>
date Wed, 28 Nov 2012 00:35:35 +0400
parents 8c7e5c16b2f4
children
line source
1 #!/usr/bin/python
2 """Read alignment on stdin. Print CSV table of letter frequences on stdout.
3 """
4 from allpy import protein
5 from allpy.data import codes
6 import sys
8 sys.stderr.write(__doc__)
10 def freq(monomer):
11 amount = freqs.get(monomer)
12 if amount:
13 return 100.0 * amount / width
14 return ""
16 aln = protein.Alignment().append_file(sys.stdin)
17 monomers = [code1 for code1, modified, _, _ in codes.protein if not modified]
18 monomers += ["-"]
19 width = len(aln.sequences)
20 print ", ".join(map(str, monomers))
21 for column in aln.columns_as_lists():
22 freqs = {}
23 for monomer in column:
24 if monomer:
25 monomer = monomer.code1
26 else:
27 monomer = "-"
28 freqs[monomer] = freqs.get(monomer, 0) + 1
29 print ", ".join(map(str, map(freq, monomers)))