GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:16:25 Sequence gi568815578r:37154209_37356889 : 202681 bp : 47.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 959 855 105 2 0 98 105 83 0.594 11.41 1.08 Intr - 3747 3652 96 0 0 81 92 15 0.706 1.41 1.07 Intr - 4828 4684 145 0 1 52 78 83 0.954 3.98 1.06 Intr - 5968 5889 80 1 2 136 64 78 0.977 8.55 1.05 Intr - 7458 7421 38 1 2 83 100 29 0.698 1.58 1.04 Intr - 13332 13249 84 1 0 50 70 77 0.568 1.89 1.03 Intr - 17849 17715 135 0 0 28 89 124 0.843 7.04 1.02 Intr - 19908 19787 122 2 2 112 98 88 0.994 12.34 1.01 Init - 20078 20062 17 2 2 53 88 -26 0.076 -6.22 1.00 Prom - 23619 23580 40 -5.66 2.00 Prom + 23749 23788 40 -8.16 2.01 Init + 24030 24043 14 2 2 72 66 12 0.086 -4.19 2.02 Intr + 25044 25161 118 0 1 73 76 139 0.121 11.67 2.03 Intr + 25267 25357 91 0 1 44 -5 94 0.085 -4.73 2.04 Intr + 29972 30165 194 0 2 47 116 216 0.869 19.51 2.05 Intr + 44189 44284 96 1 0 53 105 71 0.957 5.51 2.06 Intr + 44842 45017 176 0 2 99 69 124 0.987 10.34 2.07 Intr + 49677 49752 76 0 1 90 48 76 0.996 3.32 2.08 Intr + 50559 50693 135 2 0 79 67 135 0.942 11.26 2.09 Intr + 53065 53241 177 0 0 53 94 261 0.885 23.32 2.10 Intr + 55839 55957 119 2 2 80 110 44 0.937 5.06 2.11 Intr + 59552 59657 106 2 1 77 92 131 0.989 12.62 2.12 Intr + 69670 69761 92 0 2 98 94 97 0.995 10.09 2.13 Intr + 71480 71594 115 2 1 81 71 145 0.986 12.55 2.14 Intr + 74342 74536 195 1 0 39 68 201 0.914 12.81 2.15 Intr + 75765 75851 87 2 0 98 76 75 0.954 7.47 2.16 Intr + 78088 78183 96 0 0 72 105 85 0.935 8.81 2.17 Intr + 79812 79887 76 2 1 67 119 60 0.999 6.09 2.18 Intr + 82372 82501 130 2 1 81 94 184 0.914 18.05 2.19 Term + 85405 85486 82 1 1 117 38 13 0.278 -3.63 2.20 PlyA + 87389 87394 6 1.05 3.12 PlyA - 88660 88655 6 1.05 3.11 Term - 88801 88772 30 1 0 103 39 37 0.264 -1.85 3.10 Intr - 89523 89385 139 2 1 97 19 96 0.227 4.07 3.09 Intr - 100121 100002 120 1 0 54 80 95 0.635 4.91 3.08 Intr - 102290 102186 105 1 0 106 86 218 0.990 22.73 3.07 Intr - 102461 102406 56 2 2 73 84 -27 0.421 -6.82 3.06 Intr - 102700 102599 102 1 0 129 115 8 0.513 7.97 3.05 Intr - 107464 107416 49 0 1 13 105 60 0.205 -1.12 3.04 Intr - 107647 107535 113 1 2 18 94 125 0.479 5.28 3.03 Intr - 122033 121950 84 2 0 7 111 66 0.086 0.82 3.02 Intr - 125905 125883 23 2 2 46 102 36 0.030 -1.94 3.01 Init - 132462 132309 154 0 1 101 45 113 0.437 8.44 3.00 Prom - 137575 137536 40 -4.26 4.00 Prom + 139284 139323 40 -3.26 4.01 Init + 147056 147205 150 1 0 92 101 233 0.996 25.04 4.02 Intr + 159628 159721 94 0 1 19 37 134 0.019 0.94 4.03 Intr + 162297 162462 166 1 1 77 58 45 0.048 -0.58 4.04 Intr + 174009 174145 137 0 2 85 94 38 0.369 4.31 4.05 Intr + 179601 179681 81 1 0 72 96 45 0.828 3.31 4.06 Term + 179799 179941 143 1 2 52 47 117 0.840 2.09 4.07 PlyA + 181025 181030 6 1.05 5.05 PlyA - 181114 181109 6 1.05 5.04 Term - 182115 182071 45 0 0 91 53 37 0.137 -2.29 5.03 Intr - 189120 189031 90 0 0 49 101 80 0.317 5.59 5.02 Intr - 199861 199757 105 1 0 125 14 35 0.194 0.21 5.01 Intr - 202251 201972 280 2 1 99 131 70 0.607 9.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:37154209_37356889|GENSCAN_predicted_peptide_1|274_aa MKKIRRICSQEEVVIPCAYDSDSESVDLELSNLEIIKKGSSSIELTDLDIPDIPGLHCEP LSHSPRHLTQQDPLSEAIVEKLIQSIQKVFNVPDSSRNCLGNLGYKDKEDKIPIYAAKQG KRNPLEAAETQKVLVQEERPHSLSSSMRQEVFVTIADLSYQDVHLLLGSEDRAELFSLTI KSIITLPSVRTLTQIQEIMPNGTCNTECLYRQTFQAFSEMLQSLVVKDPHLENLDTIIKH LVPWLQSVKDHERERATASMAQVLKCLSKHLNLK >gi568815578r:37154209_37356889|GENSCAN_predicted_CDS_1|822_bp atgaaaaaaataagaaggatctgtagtcaggaagaagtagtgatcccctgtgcctatgac agtgattcagaaagtgtggatttggagctgagcaacttagagattattaaaaaaggctca agtagcattgaactgacagacttggacatccctgacatccctggactccattgtgagccc ctgtcacatagccccagacacctgacccaacaggacccgctcagtgaggccattgttgag aaactgatccagtccatccagaaggttttcaatgtgcctgacagttccaggaactgtctt gggaatttgggctacaaagacaaagaagacaaaatccctatttatgcagccaagcaaggt aagagaaatcctctagaagcagctgaaacacaaaaggtactggtacaagaggaacgcccg cattctctgtccagttccatgcgccaggaggtctttgtcaccatcgctgatctcagttac caagatgtccatttgctgttgggctctgaagatcgagctgagttgttcagtcttaccatc aagagtataatcactctgccctctgtaaggacccttacccagatacaggaaatcatgccc aatgggacctgcaacacagagtgtctttacaggcagacgtttcaggcattctctgagatg ctccagagtttggtggtaaaagacccacatttggaaaatcttgacaccattattaagcac ttggtcccctggttacagtcagtcaaagaccatgagcgggaacgggccacggccagcatg gctcaagttctgaagtgcctatccaaacatctcaacttgaag >gi568815578r:37154209_37356889|GENSCAN_predicted_peptide_2|724_aa MSRLQACRESLASPVAGSWSHFPERKSARGSDSGGTCSEEWRRRGHGHKLWLGRSRIEGP KEGCELVGVPATWRGSSTVFLLALTIIASTWALTPTHYLTKHDVERLKASLDRPFTNLES AFYSIVGLSSLGAQVPDAKKACTYIRSNLDPSNVDSLFYAAQASQALSGCEISISNETKD LLLAAVSEDSSVTQIYHAVAALSGFGLPLASQEALSALTARLSKEETVLATVQALQTASH LSQQADLRSIVEEIEDLVARLDELGGVYLQFEEGLETTALFVAATYKLMDHVGTEPSIKE DQVIQLMNAIFSKKNFESLSEAFSVASAAAVLSHNRYHVPVVVVPEGSASDTHEQAILRL QVTNVLSQPLTQATVKLEHAKSVASRATVLQKTSFTPVGDVFELNFMNVKFSSGYYDFLV EVEGDNRYIANTVELRVKISTEVGITNVDLSTVDKDQSIAPKTTRVTYPAKAKGTFIADS HQNFALFFQLVDVNTGAELTPHQTFVRLHNQKTGQEVVFVAEPDNKNVYKFELDTSERKI EFDSASGTYTLYLIIGDATLKNPILWNVADVVIKFPEEEAPSTVLSQNLFTPKQEIQHLF REPEKRPPTVVSNTFTALILSPLLLLFALWIRIGANVSNFTFAPSTIIFHLGHAAMLGLM YVYWTQLNMFQTLKYLAILGSVTFLAGNRMLAQQAVKRFNGVVIYLQKNVPFLVYNSVNP EKPL >gi568815578r:37154209_37356889|GENSCAN_predicted_CDS_2|2175_bp atgtcgaggctgcaagcctgccgcgagtccctggcgtcccctgtggcgggctcttggagc cactttcccgagcggaagtcagcccgcggctcggactccggcgggacctgctcggaggaa tggcgccgccgggggcatgggcacaagctctggctggggcgctctcggatcgagggtccg aaggagggctgcgagctggtgggagtgcccgcgacctggcggggttcaagcactgtcttc ctgttggccctgacaatcatagccagcacctgggctctgacgcccactcactacctcacc aagcatgacgtggagagactaaaagcctcgctggatcgccctttcacaaatttggaatct gccttctactccatcgtgggactcagcagccttggtgctcaggtgccagatgcaaagaaa gcatgtacctacatcagatctaaccttgatcccagcaatgtggattccctcttctacgct gcccaggccagccaggccctctcaggatgtgagatctctatttcaaatgagaccaaagat ctgcttctggcagctgtcagtgaggactcatctgttacccagatctaccatgcagttgca gctctaagtggctttggccttcccttggcatcccaagaagcactcagtgcccttactgct cgtctcagcaaggaggagactgtgctggcaacagtccaggctctgcagacagcatcccac ctgtcccagcaggctgacctgaggagcatcgtggaggagattgaggaccttgttgctcgc ctggatgaactcgggggcgtgtatctccagtttgaagaaggactggaaacaacagcgtta tttgtggctgccacctacaagctcatggatcatgtggggactgagccatccattaaggag gatcaggtcatccagctgatgaacgcgatcttcagcaagaagaactttgagtccctctcc gaagccttcagcgtggcctctgcagctgctgtgctctcgcataatcgctaccacgtgcca gttgtggttgtgcctgagggctctgcttccgacactcatgaacaggctatcttgcggttg caagtcaccaatgttctgtctcagcctctgactcaggccactgttaaactagaacatgct aaatctgttgcttccagagccactgtcctccagaagacatccttcacccctgtaggggat gtttttgaactaaatttcatgaacgtcaaattttccagtggttattatgacttccttgtc gaagttgaaggtgacaaccggtatattgcaaataccgtagagctcagagtcaagatctcc actgaagttggcatcacaaatgttgatctttccaccgtggataaggatcagagcattgca cccaaaactacccgggtgacatacccagccaaagccaagggcacattcatcgcagacagc caccagaacttcgccttgttcttccagctggtagatgtgaacactggtgctgaactcact cctcaccagacatttgtccgactccataaccagaagactggccaggaagtggtgtttgtt gccgagccagacaacaagaacgtgtacaagtttgaactggatacctctgaaagaaagatt gaatttgactctgcctctggcacctacactctctacttaatcattggagatgccactttg aagaacccaatcctctggaatgtggctgatgtggtcatcaagttccctgaggaagaagct ccctcgactgtcttgtcccagaaccttttcactccaaaacaggaaattcagcacctgttc cgcgagcctgagaagaggccccccaccgtggtgtccaatacattcactgccctgatcctc tcgccgttgcttctgctcttcgctctgtggatccggattggtgccaatgtctccaacttc acttttgctcctagcacgattatatttcacctgggacatgctgctatgctgggactcatg tatgtctactggactcagctcaacatgttccagaccttgaagtacctggccatcctgggc agtgtgacgtttctggctggcaatcggatgctggcccagcaggcagtcaagaggtttaat ggagttgtaatttatctgcagaaaaatgtaccctttttagtgtacaattctgtgaatcct gaaaaacctctgtag >gi568815578r:37154209_37356889|GENSCAN_predicted_peptide_3|324_aa MDTDAHIERTSCEDEGRYRDDASTAKEHHRLPAGHQKLVKRHGTGSSQPSEGEHNVGGKV FTESENSTLKGNDEVLWSNTLTLQTAQENEEINDGNARRLPEQTPSPGPLDLSSASEQRD ICRIRNTESKRPTAILFAHGLVPPRVKDATLGVLLCDPHPQQQLPLLPTSPFDPQRRQTQ LHRGHSEGSESLLGMRRYADAIFTNSYRKVLGQLSARKLLQDIMSRQQGESNQERGARAR LGRQVDSMWAEQKQMELESILVALLQKHSHWLSFVPSDQDTQQPLYFHPQALLPLLHTTV HQPRTQSVFQARTGLDPIQNGIEF >gi568815578r:37154209_37356889|GENSCAN_predicted_CDS_3|975_bp atggacacagatgcacacatagagagaacgtcatgtgaagatgaaggcagatatcgggac gatgcctctacagccaaggaacaccacagattgccagcaggccaccagaagctagtcaag aggcatgggacaggctcctcacagccctcagaaggagagcacaacgtgggcggtaaggta ttcactgaatcagagaactcgactcttaaaggaaatgatgaggtcttatggtctaatacc ctcactttacagacagctcaggaaaatgaagagataaatgatgggaacgccaggcggctg ccagagcaaacacccagcccagggcccctggatttgagcagtgcctcggagcagagggat atctgccgcatcagaaacactgagtccaagaggcccaccgccatcctctttgcccatgga ctggtgccaccccgggtgaaggatgccactctgggtgttcttctttgtgatcctcaccct cagcaacagctcccactgctccccacctccccctttgaccctcagaggaggcagacacag cttcacagaggtcactcagaggggtctgagtctctcttggggatgcggcggtatgcagat gccatcttcaccaacagctaccggaaggtgctgggccagctgtccgcccgcaagctgctc caggacatcatgagcaggcagcagggagagagcaaccaagagcgaggagcaagggcacgg cttggtcgtcaggtagacagcatgtgggcagaacaaaagcaaatggaattggagagcatc ctggtggccctgctgcagaagcacagccactggctgtcctttgttcccagtgaccaggac acccagcagcctttgtacttccaccctcaggccctactacccctgctccacaccactgtc caccagccccgcacccagtctgtcttccaggcccgcacaggtctggatcccattcagaac ggcatagagttctag >gi568815578r:37154209_37356889|GENSCAN_predicted_peptide_4|256_aa MASDLDFSPPEVPEPTFLENLLRYGLFLGAIFQLICVLAIIVPIPKSHEAGQTMGAVCRV AKETGIPVDEGDQRKLWGLTQCSGSEEAVTPYDRRDLDSRNSPQAPAGQSTTSSSFCFCD GLESRGLKHTVSIDCIRDPESLLLCSHLVETPNLKCGTLLLKPEKDPGLWPLAKAQAVQF SSAQAQCKGASLPHYPLLPYPGNVCGRRKGAKCTTDVTEHPQQVPPEGPAQSGCGKAKCM TSRLEDFGPQEPWAVD >gi568815578r:37154209_37356889|GENSCAN_predicted_CDS_4|771_bp atggcctctgacctagacttctcacctccggaggtgcccgagcccactttcctggagaac ctgctacggtacggactcttcctgggagccatcttccagctcatctgtgtgctggccatc atcgtacccattcccaagtcccacgaggcgggccaaaccatgggagcagtttgccgggtg gccaaggaaacaggtattcctgttgatgaaggtgaccagcggaagctctggggcctgact cagtgctcagggtctgaggaggctgtgacgccctatgaccgcagagatctagacagtcgt aacagtccccaggctccagctgggcaatccaccacttcctcttccttctgcttctgtgac ggtttagagtcaagggggctgaaacacactgtgagcatagactgtattagggatcctgag tctttgctcctatgtagtcacttggtagaaacgccgaacctgaaatgtggcactttgctt ctcaagccagagaaggatccaggcttatggcccttagcaaaagcccaagcggttcagttc agctcagcccaggcccagtgcaaaggagcctcccttcctcattacccgcttctgccctac cccgggaacgtgtgtggacgtagaaagggtgcaaaatgcacaacggatgtcacagagcat ccccagcaggtgcccccagagggaccagcccagagcgggtgcggaaaggccaagtgcatg accagccgccttgaggactttggcccccaggaaccttgggctgtggactag >gi568815578r:37154209_37356889|GENSCAN_predicted_peptide_5|173_aa XPICSQCGHQGHLTEATLTLTQTLPRMWDTLLSSSSPSSLPEEQAPRPSTHWHQPTDPSQ DSMAPQTSKRSCSQGTFRPFVKLYLPSEVPPILPVNTNCNKITRTEHLLCAGIAEGVGHK TSSLQQAHGASLRTPTGTTSQPKSTQQMIDAIDVNIAEMACMSAPVLELGTDK >gi568815578r:37154209_37356889|GENSCAN_predicted_CDS_5|522_bp nngcccatctgttctcagtgcggccaccagggccacctcactgaagccacactgaccctg acccagacactacccagaatgtgggatacattactctcatccagtagcccttcctcactg cccgaggagcaggcccccaggccctctacccattggcaccagccaactgacccaagtcaa gactccatggctcctcagacctccaagaggagctgctcacagggcaccttcaggcccttt gtcaagctctatctcccctcagaggtccctcccatccttccggttaatacaaattgcaat aaaataacacgcaccgagcacttactgtgtgctggcattgctgagggcgttggacacaag acttcatccttgcaacaagcccatggagccagtcttagaacaccaacaggaaccacttcc caacccaaaagcacccagcagatgatagatgctattgacgtcaacattgctgagatggcc tgcatgtcagctcctgtgctggagctggggacagacaagtga