GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:49:59 Sequence gi568815586r:50963749_51176561 : 212813 bp : 43.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4310 4422 113 0 2 46 43 143 0.463 4.32 1.02 PlyA + 5249 5254 6 1.05 2.00 Prom + 12614 12653 40 -4.76 2.01 Sngl + 13126 14256 1131 0 0 60 44 336 0.992 22.98 2.02 PlyA + 14894 14899 6 1.05 3.18 PlyA - 15513 15508 6 1.05 3.17 Term - 15945 15880 66 0 0 114 39 29 0.032 -1.46 3.16 Intr - 24687 24634 54 0 0 96 115 19 0.172 4.58 3.15 Intr - 27200 27047 154 1 1 110 111 109 0.999 15.47 3.14 Intr - 27924 27851 74 0 2 107 74 63 0.999 5.00 3.13 Intr - 28621 28442 180 1 0 111 94 150 0.777 17.96 3.12 Intr - 29181 29062 120 0 0 89 94 -5 0.646 0.89 3.11 Intr - 30882 30796 87 0 0 58 86 112 0.991 8.17 3.10 Intr - 32039 31881 159 2 0 88 89 163 0.828 16.58 3.09 Intr - 33160 33069 92 2 2 45 105 68 0.797 3.91 3.08 Intr - 35493 35419 75 1 0 129 37 77 0.935 6.19 3.07 Intr - 35667 35597 71 2 2 84 91 13 0.999 0.03 3.06 Intr - 36671 36565 107 2 2 69 100 124 0.998 10.71 3.05 Intr - 41159 41040 120 2 0 48 105 31 0.718 1.49 3.04 Intr - 41688 41563 126 0 0 99 61 65 0.977 5.78 3.03 Intr - 44796 44728 69 0 0 86 88 126 0.357 11.78 3.02 Intr - 53738 53714 25 1 1 112 62 24 0.091 0.23 3.01 Init - 62278 62178 101 1 2 79 13 190 0.288 8.34 3.00 Prom - 73860 73821 40 -2.46 4.00 Prom + 79957 79996 40 -3.66 4.01 Init + 84609 84730 122 2 2 96 85 88 0.969 6.96 4.02 Intr + 85286 85437 152 2 2 93 93 95 0.944 10.31 4.03 Intr + 90064 90112 49 2 1 50 82 46 0.665 -2.06 4.04 Intr + 92087 92273 187 2 1 121 61 10 0.344 1.29 4.05 Intr + 92602 92754 153 0 0 69 103 56 0.621 5.47 4.06 Term + 96362 96460 99 1 0 80 48 62 0.635 -0.47 4.07 PlyA + 98551 98556 6 1.05 5.19 PlyA - 99429 99424 6 -3.24 5.18 Term - 100921 99998 924 1 0 106 48 768 0.996 66.78 5.17 Intr - 104221 103925 297 1 0 96 82 420 0.995 39.27 5.16 Intr - 104926 104801 126 1 0 65 88 37 0.742 2.28 5.15 Intr - 110334 110075 260 1 2 79 46 300 0.956 22.08 5.14 Intr - 112831 112663 169 1 1 20 94 233 0.138 16.62 5.13 Intr - 114236 114128 109 1 1 70 34 49 0.032 -2.01 5.12 Intr - 120037 119947 91 2 1 64 33 128 0.015 3.95 5.11 Intr - 132292 132241 52 1 1 97 85 18 0.498 0.88 5.10 Intr - 135170 135028 143 0 2 117 61 78 0.974 8.07 5.09 Intr - 136031 135907 125 1 2 115 86 57 0.940 8.43 5.08 Intr - 142865 142777 89 2 2 84 110 14 0.977 1.97 5.07 Intr - 143598 143488 111 0 0 59 83 53 0.844 2.48 5.06 Intr - 145525 145373 153 1 0 105 61 63 0.974 5.57 5.05 Intr - 147235 147129 107 2 2 95 110 23 0.949 5.13 5.04 Intr - 152672 152567 106 2 1 84 95 78 0.997 7.89 5.03 Intr - 153999 153923 77 1 2 98 93 99 0.904 10.63 5.02 Intr - 155187 155133 55 0 1 34 113 -6 0.195 -4.95 5.01 Init - 157919 157917 3 2 0 113 81 0 0.323 1.80 5.00 Prom - 159659 159620 40 -5.76 6.03 PlyA - 159806 159801 6 1.05 6.02 Term - 161420 160880 541 1 1 40 43 749 0.003 59.33 6.01 Init - 208674 208553 122 0 2 101 105 155 0.995 17.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 24687 24577 111 0 0 96 41 84 0.819 3.06 S.002 Init - 112813 112663 151 1 1 37 94 227 0.860 18.40 S.003 Term - 207534 207489 46 2 1 75 50 43 0.815 -4.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_1|37_aa XLATERGIVSNNKKKKEEEEEEEENEEEEEKGFITEI >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_1|114_bp nccttggccacagagcgaggcattgtctcaaataataagaagaagaaggaggaggaggag gaggaggaggagaatgaggaggaggaggagaagggatttattacagagatatga >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_2|376_aa MSEIPFTIASKRIKYLGIQLTREVKDLSKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KTAILPKVIYRFNAIPIKLPMTFFTELGKTTLKFIWNQKRACIAKSILSQKNKAGSITLP DFKLYYQATVTKTAWYWCQNRDIDQWNRTEPSEIIPHIYNHLTFDKPDKNEKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKPPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKISAIYSSDKGLISRI YKELKQFHKKKNNPINKWVKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRY >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_2|1131_bp atgagtgaaatcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggaggtgaaggacctctccaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaacggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattgggaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaagcatcacgctacct gacttcaaactatactaccaggctacagtaaccaaaacagcatggtactggtgccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaac catctgacctttgacaaacctgacaaaaacgagaaatggggaaaggattccctatttaac aaatggtgctgggaaaactggttagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaccaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaatttctgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaatttcacaagaaaaaaaacaaccccatcaacaagtgggtgaag gatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtactag >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_3|559_aa MAQLPAAPDGAPGLCRGALTCFDASKEADGHRARDGLYYQFLSPGDSEEYFATYFNEKIS IPEEEYSCFSFRKLWAFTGPGFLMSIAYLDPGNIESDLQSGAVAGFKLLWILLLATLVGL LLQRLAARLGVVTGLHLAEVCHRQYPKVPRVILWLMVELAIIGSDMQEVIGSAIAINLLS VGRIPLWGGVLITIADTFVFLFLDKYGLRKLEAFFGFLITIMALTFGYEASGCRTPQIEQ AVGIVGAVIMPHNMYLHSALVKSRQVNRNNKQEVREANKYFFIESCIALFVSFIINVFVV SVFAEAFFGKTNEQVVEVCTNTSSPHAGLFPKDNSTLAVDIYKGGVVLGCYFGPAALYIW AVGILAAGQSSTMTGTYSGQFVMETVICSYVFFQGFLNLKWSRFARVVLTRSIAIIPTLL VAVFQDVEHLTGMNDFLNVLQSLQLPFALIPILTFTSLRPVMSDFANGLGWRIAGGILVL IICSINMYFVVVYVRDLGHVALYVVAAVVSVAYLGFVFYLGWQCLIALGMSFLDCGHTGL LKLQIMRAANPATQAIESF >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_3|1680_bp atggcccagctcccagctgcaccggatggcgcgcccggcctgtgtcggggtgcgctgacc tgcttcgacgcctcgaaagaggccgacgggcacagggcaagggatggcctttattaccag tttttgtcccctggggactcagaggagtacttcgccacttactttaatgagaagatctcc attcctgaggaggagtactcttgttttagctttcgtaaactctgggctttcaccggacca ggttttcttatgagcattgcctacctggatccaggaaatattgaatccgatttgcagtct ggagcagtggctggatttaagttgctctggatccttctgttggccacccttgtggggctg ctgctccagcggcttgcagctagactgggagtggttactgggctgcatcttgctgaagta tgtcaccgtcagtatcccaaggtcccacgagtcatcctgtggctgatggtggagttggct atcatcggctcagacatgcaagaagtcattggctcagccattgctatcaatcttctgtct gtaggaagaattcctctgtggggtggcgttctcatcaccattgcagatacttttgtattt ctcttcttggacaaatatggcttgcggaagctagaagcattttttggctttctcatcact attatggccctcacatttggatatgaggcaagtggctgtcgcactccacagattgaacag gctgtgggcatcgtgggagctgtcatcatgccacacaacatgtacctgcattctgcctta gtcaagtctagacaggtaaaccggaacaataagcaggaagttcgagaagccaataagtac tttttcattgaatcctgcattgcactctttgtttccttcatcatcaatgtctttgttgtc tcagtctttgctgaagcattttttgggaaaaccaacgagcaggtggttgaagtctgtaca aataccagcagtcctcatgctggcctctttcctaaagataactcgacactggctgtggac atctacaaagggggtgttgtgctgggatgttactttgggcctgctgcactctacatttgg gcagtggggatcctggctgcaggacagagctccaccatgacaggaacctattctggccag tttgtcatggagacagtcatctgctcctatgttttcttccagggattcctgaacctaaag tggtcacgctttgcccgagtggttctgactcgctctattgccatcatccccactctgctt gttgctgtcttccaagatgtagagcatctaacagggatgaatgactttctgaatgttcta cagagcttacagcttccctttgctctcatacccatcctcacatttacgagcttgcggcca gtaatgagtgactttgccaatggactaggctggcggattgcaggaggaatcttggtcctt atcatctgttccatcaatatgtactttgtagtggtttatgtccgggacctagggcatgtg gcattatatgtggtggctgctgtggtcagcgtggcttatctgggctttgtgttctacttg ggttggcaatgtttgattgcactgggcatgtccttcctggactgtgggcatacgggcctc ttgaaacttcagataatgagagcagccaaccctgcaactcaggctatagagtccttctga >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_4|253_aa MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGIISIPPFANYLVFLLMYLFPRQLLIRHF WTPKQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKKALSRAMLLT SYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTAQEVKSPSLSIVFSGLYYSPLWIFN SAASTVMQQSNCN >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_4|762_bp atggcgctctccagggtgtgctgggctcggtcggctgtgtggggctcggcagtcacccct ggacattttgtcacccggaggctgcaacttggtcgctctggcctggcttggggggcccct cggtcttcaaagcttcacctttctccaaaggcagatgtgaagaacttgatgtcttatgtg gtaaccaagacaaaagcgattaatgggaaataccatcgtttcttgggtcgtcatttcccc cgcttctatgtcctgtacacaatcttcatgaaaggtattatttccattccaccttttgcc aactacctggtcttcttgctaatgtacctgtttcccaggcaactactgatcaggcatttc tggaccccaaaacaacaaactgatttcttagatatctatcatgctttccggaagcagtcc cacccagaaattattagttatttagaaaaggtcatccctctcatttctgatgcaggactc cggtggcgtctgacagatctgtgcaccaagaaagccttgagccgggccatgcttctcaca tcttacctgcctcctcccttgttgagacatcgtttgaagactcatacaactgtgattcac caactggacaaggctttggcaaagctggggattggccagctgactgctcaggaagtaaaa tcgccttcactctccattgtcttttctgggctgtattacagccctctgtggatcttcaac tctgctgcctccactgtgatgcagcagtccaactgtaactga >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_5|998_aa MGDITAGNLNGILGMKLHQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVFHDRRLQYT EHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTSVFIQVHCI STEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGADRKQKTDRE KMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSSFSLGEGMVRPRL TIYVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSI SPCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKGCSRAAASGAGLRAGFGFAA VTAYNSRHAAGISAEQIRVLLELKSKDGGKRFVPREMDYLRLLGCMKETPLKPMDAFTGS GLKRKFDDVDVGSSVSNSDDEISSSDSADSCDSLNPPTTASFTPTSILKRQKQLRRKNVR FDQVTVYYFARRQGFTSVPSQGGSSLGMAQRHNSVRSYTLCEFAQEQEVNHREILREHLK EEKLHAKKMKKASECKLGDTCLEIDGMFVEIDFRVGGKMRRVFEKLGPRKVMLTKNGTVE SVEADGLTLDDVSDEDIDVENVEVDDYFFLQPLPTKRRRALLRASGVHRIDAEEKQELRA IRLSREECGCDCRLYCDPEACACSQAGIKCQVDRMSFPCGCSRDGCGNMAGRIEFNPIRV RTHYLHTIMKLELESKRQVSRPAAPDEEPSPTASCSLTGAQGSETQDFQEFIAENETAVM HLQSAEELERLKAEEDSSGSSASLDSSIESLGVCILEEPLAVPEELCPGLTAPILIQAQL PPGSSVLCFTENSDHPTASTVNSPSYLNSGPLVYYQVEQRPVLGVKGEPGTEEGSASFPK EKDLNVFSLPVTSLVACSSTDPAALCKSEVGKTPTLEALLPEDCNPEEPENEDFHPSWSP SSLPFRTDNEEGCGMVKTSQQNEDRPPEDSSLELPLAV >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_5|2997_bp atgggggatatcacagcaggtaatctaaatggaattcttgggatgaaactccaccaagga cagtcttatgaaattcgaatgctagacaataggaaacttggagaacttccagaaattaat ggcaaattggtgaagagtatattccgtgtggtgttccatgacagaaggcttcagtacact gagcatcagcagctagagggctggaggtggaaccgacctggagacagaattcttgacata gatatcccgatgtctgtgggtataatcgatcctagggctaatccaactcaactaaataca gtggagttcctgtgggaccctgcaaagaggacatctgtgtttattcaggtgcactgtatt agcacagagttcactatgaggaaacatggtggagaaaagggggtgccattccgagtacaa atagataccttcaaggagaatgaaaacggggaatatactgagcacttacactcggccagc tgccagatcaaagttttcaagcccaaaggtgcagacagaaagcaaaaaacggatagggaa aaaatggagaaacgaacacctcatgaaaaggagaaatatcagccttcctatgagacaacc atactcacagagtgttctccatggcccgagatcacgtatgtcaataactccccatcacct ggcttcaacagttcccatagcagtttttctcttggggaagggatggtgcgtccaaggtta accatttatgtttgtcaggaatcactgcagttgagggagcagcaacaacagcagcagcaa cagcagcagaagcatgaggatggagactcaaatggtactttcttcgtttaccatgctatc tatctagaagaactaacagctgttgaattgacagaaaaaattgctcagcttttcagcatt tccccttgccagatcagccagatttacaagcaggggccaacaggaattcatgtgctcatc agtgatgagatgatacagaactttcaggaagaagcatgttttattctggacacaatgaaa ggttgcagcagagctgccgcctcgggagccggtttgcgcgccggcttcggctttgcagca gttaccgcctacaactcccggcatgctgctggcatttctgctgagcagattcgtgtcctt ctggaacttaaatccaaagatggaggaaaaagatttgtacctagggaaatggactatctc agactccttggttgtatgaaagaaacccctttgaaaccaatggatgcattcacgggctcg ggtctcaagaggaagtttgatgatgtggatgtgggctcatcagtttccaactcagatgat gagatctccagcagtgatagtgctgacagctgcgacagcctcaatcctcctaccactgcc agcttcacacccacatccatcctgaagcggcagaagcagctgcggaggaagaatgtacgc tttgaccaggtgactgtatactactttgcccggcgccaaggttttaccagtgtgcccagc cagggtggtagctctctgggcatggcccagcgccataactctgtacggagctatacactc tgtgagtttgcccaggaacaggaggtgaaccatcgagagattctgcgtgagcacctgaag gaagagaaactccatgccaagaaaatgaagaaggcaagtgagtgtaaattaggggataca tgtctggaaatagacggtatgtttgtggaaatagattttcgtgtaggagggaagatgaga cgggtttttgagaagcttggacccagaaaggtgatgctgaccaagaatgggacagtggag tcggtggaggctgatggcctgacgctggatgatgtgtcagatgaagatattgatgtggaa aatgtggaggtggatgattacttcttcctgcagcctctgcccaccaaacggcgacgggcc ctgctgagggcttctggggtccaccgtattgatgctgaagagaagcaagaacttcgagcc atccgcctgtcacgggaagaatgtggttgtgactgccgactgtattgtgacccagaagcg tgtgcctgcagccaggctgggattaaatgccaggtggatcgcatgtcctttccatgtggc tgctcccgggatggctgtgggaacatggcaggacgcattgaatttaatccaatccgggtc cggactcattacctccacaccattatgaagctggagctggagagcaagcggcaggtgagc cgcccagcagccccagatgaggagccctccccgactgccagttgcagcctgacaggagca cagggctctgagacccaggacttccaggagttcattgctgagaatgagacagcagtgatg cacctgcagagtgcagaggaactggagcggctcaaggcagaagaagattccagcggctct agtgccagcctggactcgagcatcgagagcctgggtgtgtgcatcctagaggagcctctg gctgtccccgaagagctgtgcccaggccttacagcccccattctcatccaggctcagctg cccccaggctcctctgtcctgtgttttaccgagaactcagaccacccaactgcctcaacg gtgaacagcccatcctacttgaacagtgggcccctggtctattatcaagtggagcagagg ccagtcttgggagtgaaaggagagcctggtacggaagaaggctcagcctctttcccaaag gagaaggatctgaatgtcttctctctccctgttacctcactcgtggcttgtagctccaca gacccagctgccctctgtaaatcagaggtggggaaaacacccaccctagaagctctattg cccgaagattgtaaccctgaggagcctgaaaatgaagacttccacccttcctggtccccc tcaagcctccccttccgcacggacaatgaagagggctgtgggatggtgaagacctcccag cagaatgaggatcggccccctgaagattcttccttagaactccctctggcagtgtga >gi568815586r:50963749_51176561|GENSCAN_predicted_peptide_6|220_aa MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSKDLQNVNITLRIIFQPVAS QLPRIFTSIGEDYDEPVLTYITTEILKSVVARFDAGEVITQRELVSRQVSNDLTEQAATF GLILDDVSLTYLTFGKEFTEAVEAKQVAQQEAERARFVKEKAEQQKKAEQQKKVEQQKKA AVISAEGDSKATELIANSLATAGDGLMELCKLEAAEALGT >gi568815586r:50963749_51176561|GENSCAN_predicted_CDS_6|663_bp atggcctgggctctgaagctgcctctggccgacgaagtgattgaatccgggttggtgcag gactttgatgctagcctgtccgggatcggccaggaactgggtgctggtgcctatagcatg agcaaagatttacagaatgtcaatatcacactgcgcatcatcttccagcctgttgctagc cagcttcctcgcatcttcaccagcatcggagaggactatgatgagcctgtgctgacgtac atcacgaccgagatcctcaagtcagtggtggctcgctttgatgctggagaagttatcact cagagagagctggtctccaggcaggtgagcaacgaccttacggagcaagcagccacattt gggctcatcctggacgacgtgtccttgacatatctgacctttggaaaggagttcacagaa gcagtggaagccaaacaggtggctcagcaggaagcagagagggccagatttgtgaaggaa aaggctgagcagcagaaaaaggctgagcagcagaaaaaggttgagcagcagaaaaaggca gccgtgatctctgctgagggcgactccaaggcaaccgagctgattgccaactcactggcc accgcgggggacggcctgatggagctgtgcaagttggaagccgcggaggctctcggaaca tga