GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:55:43 Sequence gi568815597r:150696802_150906805 : 210004 bp : 41.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 791 1019 229 2 1 53 42 174 0.670 4.42 1.02 PlyA + 1231 1236 6 1.05 2.40 PlyA - 1281 1276 6 1.05 2.39 Term - 2166 2101 66 0 0 115 38 27 0.342 -2.64 2.38 Intr - 6592 6509 84 1 0 94 54 71 0.439 3.40 2.37 Intr - 7393 7317 77 2 2 114 75 34 0.817 3.02 2.36 Intr - 7542 7476 67 0 1 38 116 67 0.963 2.06 2.35 Intr - 10008 9752 257 1 2 67 84 172 0.950 10.94 2.34 Intr - 11606 11455 152 1 2 67 74 97 0.438 5.09 2.33 Intr - 51078 50976 103 1 1 78 90 117 0.873 9.21 2.32 Intr - 53370 53205 166 0 1 103 43 170 0.978 12.61 2.31 Intr - 55207 54980 228 1 0 85 74 290 0.994 24.44 2.30 Intr - 58349 58200 150 2 0 87 89 128 0.991 12.24 2.29 Intr - 61179 61057 123 0 0 82 78 57 0.915 3.96 2.28 Intr - 67963 67837 127 0 1 80 71 48 0.441 2.06 2.27 Intr - 102472 102367 106 2 1 52 95 84 0.926 3.85 2.26 Intr - 102908 102743 166 2 1 88 116 165 0.992 17.91 2.25 Intr - 107438 107220 219 2 0 71 94 271 0.985 23.58 2.24 Intr - 109215 109060 156 0 0 107 78 154 0.957 15.59 2.23 Intr - 111524 111468 57 2 0 62 54 79 0.100 0.06 2.22 Intr - 116537 116371 167 0 2 76 110 199 0.996 19.66 2.21 Intr - 117435 117276 160 0 1 89 97 146 0.738 14.24 2.20 Intr - 119605 119458 148 0 1 95 98 150 0.999 16.02 2.19 Intr - 120089 119987 103 0 1 114 99 -15 0.998 0.51 2.18 Intr - 120401 120281 121 2 1 106 79 103 0.999 10.35 2.17 Intr - 120632 120560 73 1 1 26 109 73 0.993 1.69 2.16 Intr - 121229 121119 111 1 0 96 98 97 0.999 10.08 2.15 Intr - 126544 126393 152 1 2 53 47 106 0.813 1.04 2.14 Intr - 129816 129742 75 0 0 91 93 34 0.885 2.99 2.13 Intr - 132426 132292 135 0 0 109 94 114 0.731 13.94 2.12 Intr - 134313 134300 14 1 2 44 110 9 0.416 -7.72 2.11 Intr - 135102 135017 86 2 2 81 110 43 0.677 4.34 2.10 Intr - 135598 135533 66 0 0 92 94 24 0.434 0.50 2.09 Intr - 137839 137737 103 2 1 79 105 50 0.830 4.11 2.08 Intr - 139692 139479 214 0 1 110 94 169 0.807 17.07 2.07 Intr - 142853 142640 214 1 1 101 20 218 0.790 14.10 2.06 Intr - 145667 145623 45 1 0 69 91 87 0.927 3.71 2.05 Intr - 146091 145924 168 2 0 60 89 46 0.502 0.04 2.04 Intr - 149506 149462 45 0 0 118 90 82 0.995 8.01 2.03 Intr - 156005 155961 45 1 0 74 94 42 0.499 0.01 2.02 Intr - 161659 161548 112 2 1 85 96 181 0.256 17.12 2.01 Init - 179766 179742 25 0 1 117 93 35 0.059 6.66 2.00 Prom - 185479 185440 40 -3.35 3.00 Prom + 188644 188683 40 -5.75 3.01 Sngl + 207095 207412 318 1 0 48 38 339 0.970 20.62 3.02 PlyA + 209034 209039 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 179808 179969 162 2 0 72 39 175 0.851 7.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:150696802_150906805|GENSCAN_predicted_peptide_1|76_aa XQFILDLTVPEKHDLGQFVYVYMVYLKQDEYISTIAYVPAFPIALITDKPMDVYVFGIWL LKLQAANRPSFKSSFF >gi568815597r:150696802_150906805|GENSCAN_predicted_CDS_1|231_bp nntcagtttatccttgacttgactgttccagaaaaacatgatctgggtcagtttgtgtac gtttatatggtttaccttaaacaggatgaatacatttctaccatcgcatatgtaccagca ttccctatagcgcttattactgacaaacccatggatgtgtatgtttttggaatatggctg ctgaaactacaagcggcaaacagaccaagttttaagtcttcctttttctaa >gi568815597r:150696802_150906805|GENSCAN_predicted_peptide_2|1561_aa MAATTANPEMTSDVPSLGPAIASGNSGPGIQGGGAIVQRAIKRRPGLDFDDDGEGNSKFL RCDDDQMSNDKERFARLGAVHENGIPEFPWLVLVQTLYIFGPYRDRLAKCFLCLDHVNLL YIGFAFMLTFILSDDEQSSADKERLARENHSEIERRRRNKMTAYITELSDMVPTCSALAR KPDKLTILRMAVSHMKSLRGTGNTSTDGSYKPSFLTDQELKHLILEAADGFLFIVSCETG RVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLSTSENALTGRILDLKTGTV KKEGQQSSMRMCMGSRRSFICRMRCGSSSVDPVSVNRLSFVRNRCRNGLGSVKDGEPHFV VVHCTGYIKAWPPAVASPRVTSSPNCTDMSNVCQPTEFISRHNIEGIFTFVDHRCVATVG YQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFRSKNQEWLWMRTSSFT FQNPYSDEIEYIICTNTNVKNSSQEPRPTLSNTIQRPQLGPTANLPLEMGSGQLAPRQQQ QQTELDMVPGRDGLASYNHSQVVQPVTTTGPEHSKPLEKSDGLFAQDRDPRFSEIYHNIN ADQSKGISSSTVPATQQLFSQGNTFPPTPRPAENFRNSGLAPPVTIVQPSASAGQMLAQI SRHSNPTQGATPTWTPTTRSGFSAQVATQATAKTRTSQFGVGSFQTPSSFSSMSLPGAPT ASPGAAAYPSLTNRGSNFAPETGQTAGQFQTRTAEGVGVWPQWQGQQPHHRSSSSEQHVQ QPPAQQPGQPEVFQPITGADFRNPDGINLAPLMTSEEVVQKMTGLKVPLSHSRSNDTLYI PEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLV DCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIP EGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKG NKHWIIKNRMKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLI WEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNP NRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS TEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELP YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGK EYWLVKNSKNQSNESSMLSTDTKKASILLIRKIYILMQNLGPLPNDVCLTMKLFYYDEVT PPDYQPPGFKDGDCEGVIFEGEPMYLNVGEVSTPFHIFKVKVTTERERMENIDSTILSPK QIKTPFQKILRDKDVEDEQEHYTSDDLDIETKMEEQEKNPASSELEEPSLVCEEDEIMRS KESPDLSISHSQVEQLVNKTSELDMSESKTRSGKVFQNKMSSPTEDYTFCKYILSAAIGR C >gi568815597r:150696802_150906805|GENSCAN_predicted_CDS_2|4686_bp atggcggcgactactgccaaccccgaaatgacatcagatgtaccatcactgggtccagcc attgcctctggaaactctggacctggaattcaaggtggaggagccattgtccagagggct attaagcggcgaccagggctggattttgatgatgatggagaagggaacagtaaatttttg aggtgtgatgatgatcagatgtctaacgataaggagcggtttgccagattgggagcagtg catgaaaatggtattcctgaattcccttggttggttcttgttcagactctgtatatcttt ggtccctacagagatcgattggcaaaatgctttctgtgtttagatcatgttaatttacta tatattggctttgcttttatgttgacctttatcttgtcggatgatgagcagagctctgcg gataaagagagacttgccagggaaaatcacagtgaaattgaacggcggcgacggaacaag atgacagcctacatcacagaactgtcagatatggtacccacctgtagtgccctggctcga aaaccagacaagctaaccatcttacgcatggcagtttctcacatgaagtccttgcgggga actggcaacacatccactgatggctcctataagccgtctttcctcactgatcaggaactg aaacatttgatcttggaggcagcagatggctttctgtttattgtctcatgtgagacaggc agggtggtgtatgtgtctgactccgtgactcctgttttgaaccagccacagtctgaatgg tttggcagcacactctatgatcaggtgcacccagatgatgtggataaacttcgtgagcag ctttccacttcagaaaatgccctgacagggcgtatcctggatctaaagactggaacagtg aaaaaggaaggtcagcagtcttccatgagaatgtgtatgggctcaaggagatcgtttatt tgccgaatgaggtgtggcagtagctctgtggacccagtttctgtgaataggctgagcttt gtgaggaacagatgcaggaatggacttggctctgtaaaggatggggaacctcacttcgtg gtggtccactgcacaggctacatcaaggcctggcccccagcagtggcatcacctagggta actagttctcccaactgtacagacatgagtaatgtttgtcaaccaacagagttcatctcc cgacacaacattgagggtatcttcacttttgtggatcaccgctgtgtggctactgttggc taccagccacaggaactcttaggaaagaatattgtagaattctgtcatcctgaagaccag cagcttctaagagacagcttccaacaggtagtgaaattaaaaggccaagtgctgtctgtc atgttccggttccggtctaagaaccaagaatggctctggatgagaaccagctcctttact ttccagaacccttactcagatgaaattgagtacatcatctgtaccaacaccaatgtgaag aactctagccaagaaccacggcctacactctccaacacaatccagaggccacaactaggt cccacagctaatttacccctggagatgggctcaggacagctggcacccaggcagcagcaa cagcaaacagaattggacatggtaccaggaagagatggactggccagctacaatcattcc caggtggttcagcctgtgacaaccacaggaccagaacacagcaagccccttgagaagtca gatggtttatttgcccaggatagagatccaagattttcagaaatctatcacaacatcaat gcggatcagagtaaaggcatctcctccagcactgtccctgccacccaacagctattctcc cagggcaacacattccctcctaccccccggccggcagagaatttcaggaatagtggccta gcccctcctgtaaccattgtccagccatcagcttctgcaggacagatgttggcccagatt tcccgccactccaaccccacccaaggagcaaccccaacttggacccctactacccgctca ggcttttctgcccaggtggctacccaggctactgctaagactcgtacttcccagtttggt gtgggcagctttcagactccatcctccttcagctccatgtccctccctggtgccccaact gcatcgcctggtgctgctgcctaccctagtctcaccaatcgtggatctaactttgctcct gagactggacagactgcaggacaattccagacacggacagcagagggtgtgggtgtctgg ccacagtggcagggccagcagcctcatcatcgttcaagttctagtgagcaacatgttcaa caaccgccagcacagcaacctggccagcctgaggtcttccagccgatcactggagctgac ttccgcaatcccgatggaataaatctagcacccctgatgaccagtgaagaggtggttcag aagatgactggactcaaagtacccctgtctcattcccgcagtaatgacaccctttatatc ccagaatgggaaggtagagccccagactctgtcgactatcgaaagaaaggatatgttact cctgtcaaaaatcagggtcagtgtggttcctgttgggcttttagctctgtgggtgccctg gagggccaactcaagaagaaaactggcaaactcttaaatctgagtccccagaacctagtg gattgtgtgtctgagaatgatggctgtggagggggctacatgaccaatgccttccaatat gtgcagaagaaccggggtattgactctgaagatgcctacccatatgtgggacaggaagag agttgtatgtacaacccaacaggcaaggcagctaaatgcagagggtacagagagatcccc gaggggaatgagaaagccctgaagagggcagtggcccgagtgggacctgtctctgtggcc attgatgcaagcctgacctccttccagttttacagcaaaggtgtgtattatgatgaaagc tgcaatagcgataatctgaaccatgcagttttggcagtgggatatggaatccagaaggga aacaagcactggataattaaaaacagaatgaaacggctggtttgtgtgctcttggtgtgc tcctctgcagtggcacagttgcataaagatcctaccctggatcaccactggcatctctgg aagaaaacctatggcaaacaatacaaggaaaagaatgaagaagcagtacgacgtctcatc tgggaaaagaatctaaagtttgtgatgcttcacaacctggagcattcaatgggaatgcac tcatacgatctgggcatgaaccacctgggagacatgaccagtgaagaagtgatgtctttg atgagttccctgagagttcccagccagtggcagagaaatatcacatataagtcaaaccct aatcggatattgcctgattctgtggactggagagagaaagggtgtgttactgaagtgaaa tatcaaggttcttgtggtgcttgctgggctttcagtgctgtgggggccctggaagcacag ctgaagctgaaaacaggaaagctggtgtctctcagtgcccagaacctggtggattgctca actgaaaaatatggaaacaaaggctgcaatggtggcttcatgacaacggctttccagtac atcattgataacaagggcatcgactcagacgcttcctatccctacaaagccatggatcag aaatgtcaatatgactcaaaatatcgtgctgccacatgttcaaagtacactgaacttcct tatggcagagaagatgtcctgaaagaagctgtggccaataaaggcccagtgtctgttggt gtagatgcgcgtcatccttctttcttcctctacagaagtggtgtctactatgaaccatcc tgtactcagaatgtgaatcatggtgtacttgtggttggctatggtgatcttaatgggaaa gaatactggcttgtgaaaaacagtaaaaaccaaagcaacgaatctagcatgttgtctact gacaccaagaaagcaagcattctcctcattcgcaagatttatatcctaatgcaaaatctg gggcctttacctaatgatgtttgtttgaccatgaaacttttttactatgatgaagttaca cccccagattaccagcctcccggttttaaggatggtgattgtgaaggagttatatttgaa ggggaacctatgtatttaaatgtgggagaagtctcaacaccttttcacatcttcaaagta aaagtgaccactgagagagaacgaatggaaaatattgactcaactatactatcaccaaaa caaataaaaacaccatttcaaaaaatcctgagggacaaagatgtagaagatgaacaggag cattatacaagtgatgatttggacattgaaactaaaatggaagaacaggaaaaaaaccct gcatcttctgaacttgaagaaccaagtttagtttgtgaggaagatgaaattatgaggtct aaagaaagtccagatctttctatttctcattctcaggttgagcagttagtcaataaaaca tctgaacttgatatgtctgaaagcaaaacaagaagtggaaaagtctttcagaataaaatg agcagccctactgaagattatacattttgcaagtacatactttcagcagccattggaaga tgttaa >gi568815597r:150696802_150906805|GENSCAN_predicted_peptide_3|105_aa MGDVEKGKKIFVQKCAQCHTVEKGGKHKTGPNLHGLFSQKTGQAVGFSYTDANKNKGIIW GEDTLMEYLENPKKYIPGTKMIFAGIKKKAEKADLTAYLKKATNE >gi568815597r:150696802_150906805|GENSCAN_predicted_CDS_3|318_bp atgggtgatgttgagaaaggcaagaagatttttgttcagaagtgtgcccagtgtcacacc gtggaaaagggaggcaagcacaagactgggcctaatctccatggtctcttcagtcagaag acaggtcaggctgttggattctcttacacagatgccaataagaacaaaggcatcatctgg ggagaggatacgctgatggagtatttggaaaatcccaagaagtacatccctggaacaaaa atgatctttgccggcattaaaaagaaggcagaaaaggccgacttgacagcttatctcaaa aaagctactaatgagtaa