GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:50:32 Sequence gi568815586f:50948357_51159428 : 211072 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4344 4359 16 2 1 81 99 0 0.129 0.75 1.02 Intr + 5635 5736 102 2 0 60 97 57 0.377 3.95 1.03 Intr + 12612 12746 135 1 0 84 59 82 0.833 5.44 1.04 Term + 19702 19814 113 2 2 46 43 143 0.496 4.32 1.05 PlyA + 20641 20646 6 1.05 2.00 Prom + 28006 28045 40 -4.76 2.01 Sngl + 28518 29648 1131 2 0 60 44 336 0.992 22.98 2.02 PlyA + 30286 30291 6 1.05 3.18 PlyA - 30905 30900 6 1.05 3.17 Term - 31337 31272 66 2 0 114 39 29 0.032 -1.46 3.16 Intr - 40079 40026 54 2 0 96 115 19 0.172 4.58 3.15 Intr - 42592 42439 154 0 1 110 111 109 0.999 15.47 3.14 Intr - 43316 43243 74 2 2 107 74 63 0.999 5.00 3.13 Intr - 44013 43834 180 0 0 111 94 150 0.777 17.96 3.12 Intr - 44573 44454 120 2 0 89 94 -5 0.646 0.89 3.11 Intr - 46274 46188 87 2 0 58 86 112 0.991 8.17 3.10 Intr - 47431 47273 159 1 0 88 89 163 0.828 16.58 3.09 Intr - 48552 48461 92 1 2 45 105 68 0.797 3.91 3.08 Intr - 50885 50811 75 0 0 129 37 77 0.935 6.19 3.07 Intr - 51059 50989 71 1 2 84 91 13 0.999 0.03 3.06 Intr - 52063 51957 107 1 2 69 100 124 0.998 10.71 3.05 Intr - 56551 56432 120 1 0 48 105 31 0.718 1.49 3.04 Intr - 57080 56955 126 2 0 99 61 65 0.977 5.78 3.03 Intr - 60188 60120 69 2 0 86 88 126 0.357 11.78 3.02 Intr - 69130 69106 25 0 1 112 62 24 0.091 0.23 3.01 Init - 77670 77570 101 0 2 79 13 190 0.288 8.34 3.00 Prom - 89252 89213 40 -2.46 4.00 Prom + 95349 95388 40 -3.66 4.01 Init + 100001 100122 122 1 2 96 85 88 0.969 6.96 4.02 Intr + 100678 100829 152 1 2 93 93 95 0.944 10.31 4.03 Intr + 105456 105504 49 1 1 50 82 46 0.665 -2.06 4.04 Intr + 107479 107665 187 1 1 121 61 10 0.344 1.29 4.05 Intr + 107994 108146 153 2 0 69 103 56 0.621 5.47 4.06 Term + 111754 111852 99 0 0 80 48 62 0.635 -0.47 4.07 PlyA + 113943 113948 6 1.05 5.19 PlyA - 114821 114816 6 -3.24 5.18 Term - 116313 115390 924 0 0 106 48 768 0.996 66.78 5.17 Intr - 119613 119317 297 0 0 96 82 420 0.995 39.27 5.16 Intr - 120318 120193 126 0 0 65 88 37 0.742 2.28 5.15 Intr - 125726 125467 260 0 2 79 46 300 0.956 22.08 5.14 Intr - 128223 128055 169 0 1 20 94 233 0.138 16.62 5.13 Intr - 129628 129520 109 0 1 70 34 49 0.032 -2.01 5.12 Intr - 135429 135339 91 1 1 64 33 128 0.015 3.95 5.11 Intr - 147684 147633 52 0 1 97 85 18 0.498 0.88 5.10 Intr - 150562 150420 143 2 2 117 61 78 0.974 8.07 5.09 Intr - 151423 151299 125 0 2 115 86 57 0.940 8.43 5.08 Intr - 158257 158169 89 1 2 84 110 14 0.977 1.97 5.07 Intr - 158990 158880 111 2 0 59 83 53 0.844 2.48 5.06 Intr - 160917 160765 153 0 0 105 61 63 0.974 5.57 5.05 Intr - 162627 162521 107 1 2 95 110 23 0.949 5.13 5.04 Intr - 168064 167959 106 1 1 84 95 78 0.997 7.89 5.03 Intr - 169391 169315 77 0 2 98 93 99 0.904 10.63 5.02 Intr - 170579 170525 55 2 1 34 113 -6 0.195 -4.95 5.01 Init - 173311 173309 3 1 0 113 81 0 0.323 1.80 5.00 Prom - 175051 175012 40 -5.76 6.03 PlyA - 175198 175193 6 1.05 6.02 Term - 176910 176272 639 0 0 -6 43 758 0.362 56.11 6.01 Init - 177051 177025 27 0 0 88 127 27 0.707 6.25 6.00 Prom - 187412 187373 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40079 39969 111 2 0 96 41 84 0.819 3.06 S.002 Init - 128205 128055 151 0 1 37 94 227 0.860 18.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_1|121_aa MALPPEKKMSSDNQWSADEDEGQLSRLIRKSRDSPFVPIGIAGFVTVVSCGLYKLKYRRD QKMSIHLIHMRVAAQGFVVGAVTLALATERGIVSNNKKKKEEEEEEEENEEEEEKGFITE I >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_1|366_bp atggccttaccgccagagaaaaaaatgtcttcagataaccagtggtcagcagatgaggat gaaggccaattatcccgactaatcaggaaatctagagactccccctttgtccctataggt atagcaggctttgtgactgtggtgtcctgtggtctttacaagctaaagtacagaagagat cagaaaatgtcaattcatcttattcacatgagagttgctgcccaaggatttgttgttgga gctgtgactctagccttggccacagagcgaggcattgtctcaaataataagaagaagaag gaggaggaggaggaggaggaggagaatgaggaggaggaggagaagggatttattacagag atatga >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_2|376_aa MSEIPFTIASKRIKYLGIQLTREVKDLSKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KTAILPKVIYRFNAIPIKLPMTFFTELGKTTLKFIWNQKRACIAKSILSQKNKAGSITLP DFKLYYQATVTKTAWYWCQNRDIDQWNRTEPSEIIPHIYNHLTFDKPDKNEKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKPPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKISAIYSSDKGLISRI YKELKQFHKKKNNPINKWVKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRY >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_2|1131_bp atgagtgaaatcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggaggtgaaggacctctccaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaacggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattgggaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaagcatcacgctacct gacttcaaactatactaccaggctacagtaaccaaaacagcatggtactggtgccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaac catctgacctttgacaaacctgacaaaaacgagaaatggggaaaggattccctatttaac aaatggtgctgggaaaactggttagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaccaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaatttctgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaatttcacaagaaaaaaaacaaccccatcaacaagtgggtgaag gatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtactag >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_3|559_aa MAQLPAAPDGAPGLCRGALTCFDASKEADGHRARDGLYYQFLSPGDSEEYFATYFNEKIS IPEEEYSCFSFRKLWAFTGPGFLMSIAYLDPGNIESDLQSGAVAGFKLLWILLLATLVGL LLQRLAARLGVVTGLHLAEVCHRQYPKVPRVILWLMVELAIIGSDMQEVIGSAIAINLLS VGRIPLWGGVLITIADTFVFLFLDKYGLRKLEAFFGFLITIMALTFGYEASGCRTPQIEQ AVGIVGAVIMPHNMYLHSALVKSRQVNRNNKQEVREANKYFFIESCIALFVSFIINVFVV SVFAEAFFGKTNEQVVEVCTNTSSPHAGLFPKDNSTLAVDIYKGGVVLGCYFGPAALYIW AVGILAAGQSSTMTGTYSGQFVMETVICSYVFFQGFLNLKWSRFARVVLTRSIAIIPTLL VAVFQDVEHLTGMNDFLNVLQSLQLPFALIPILTFTSLRPVMSDFANGLGWRIAGGILVL IICSINMYFVVVYVRDLGHVALYVVAAVVSVAYLGFVFYLGWQCLIALGMSFLDCGHTGL LKLQIMRAANPATQAIESF >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_3|1680_bp atggcccagctcccagctgcaccggatggcgcgcccggcctgtgtcggggtgcgctgacc tgcttcgacgcctcgaaagaggccgacgggcacagggcaagggatggcctttattaccag tttttgtcccctggggactcagaggagtacttcgccacttactttaatgagaagatctcc attcctgaggaggagtactcttgttttagctttcgtaaactctgggctttcaccggacca ggttttcttatgagcattgcctacctggatccaggaaatattgaatccgatttgcagtct ggagcagtggctggatttaagttgctctggatccttctgttggccacccttgtggggctg ctgctccagcggcttgcagctagactgggagtggttactgggctgcatcttgctgaagta tgtcaccgtcagtatcccaaggtcccacgagtcatcctgtggctgatggtggagttggct atcatcggctcagacatgcaagaagtcattggctcagccattgctatcaatcttctgtct gtaggaagaattcctctgtggggtggcgttctcatcaccattgcagatacttttgtattt ctcttcttggacaaatatggcttgcggaagctagaagcattttttggctttctcatcact attatggccctcacatttggatatgaggcaagtggctgtcgcactccacagattgaacag gctgtgggcatcgtgggagctgtcatcatgccacacaacatgtacctgcattctgcctta gtcaagtctagacaggtaaaccggaacaataagcaggaagttcgagaagccaataagtac tttttcattgaatcctgcattgcactctttgtttccttcatcatcaatgtctttgttgtc tcagtctttgctgaagcattttttgggaaaaccaacgagcaggtggttgaagtctgtaca aataccagcagtcctcatgctggcctctttcctaaagataactcgacactggctgtggac atctacaaagggggtgttgtgctgggatgttactttgggcctgctgcactctacatttgg gcagtggggatcctggctgcaggacagagctccaccatgacaggaacctattctggccag tttgtcatggagacagtcatctgctcctatgttttcttccagggattcctgaacctaaag tggtcacgctttgcccgagtggttctgactcgctctattgccatcatccccactctgctt gttgctgtcttccaagatgtagagcatctaacagggatgaatgactttctgaatgttcta cagagcttacagcttccctttgctctcatacccatcctcacatttacgagcttgcggcca gtaatgagtgactttgccaatggactaggctggcggattgcaggaggaatcttggtcctt atcatctgttccatcaatatgtactttgtagtggtttatgtccgggacctagggcatgtg gcattatatgtggtggctgctgtggtcagcgtggcttatctgggctttgtgttctacttg ggttggcaatgtttgattgcactgggcatgtccttcctggactgtgggcatacgggcctc ttgaaacttcagataatgagagcagccaaccctgcaactcaggctatagagtccttctga >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_4|253_aa MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGIISIPPFANYLVFLLMYLFPRQLLIRHF WTPKQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKKALSRAMLLT SYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTAQEVKSPSLSIVFSGLYYSPLWIFN SAASTVMQQSNCN >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_4|762_bp atggcgctctccagggtgtgctgggctcggtcggctgtgtggggctcggcagtcacccct ggacattttgtcacccggaggctgcaacttggtcgctctggcctggcttggggggcccct cggtcttcaaagcttcacctttctccaaaggcagatgtgaagaacttgatgtcttatgtg gtaaccaagacaaaagcgattaatgggaaataccatcgtttcttgggtcgtcatttcccc cgcttctatgtcctgtacacaatcttcatgaaaggtattatttccattccaccttttgcc aactacctggtcttcttgctaatgtacctgtttcccaggcaactactgatcaggcatttc tggaccccaaaacaacaaactgatttcttagatatctatcatgctttccggaagcagtcc cacccagaaattattagttatttagaaaaggtcatccctctcatttctgatgcaggactc cggtggcgtctgacagatctgtgcaccaagaaagccttgagccgggccatgcttctcaca tcttacctgcctcctcccttgttgagacatcgtttgaagactcatacaactgtgattcac caactggacaaggctttggcaaagctggggattggccagctgactgctcaggaagtaaaa tcgccttcactctccattgtcttttctgggctgtattacagccctctgtggatcttcaac tctgctgcctccactgtgatgcagcagtccaactgtaactga >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_5|998_aa MGDITAGNLNGILGMKLHQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVFHDRRLQYT EHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTSVFIQVHCI STEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGADRKQKTDRE KMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSSFSLGEGMVRPRL TIYVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSI SPCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKGCSRAAASGAGLRAGFGFAA VTAYNSRHAAGISAEQIRVLLELKSKDGGKRFVPREMDYLRLLGCMKETPLKPMDAFTGS GLKRKFDDVDVGSSVSNSDDEISSSDSADSCDSLNPPTTASFTPTSILKRQKQLRRKNVR FDQVTVYYFARRQGFTSVPSQGGSSLGMAQRHNSVRSYTLCEFAQEQEVNHREILREHLK EEKLHAKKMKKASECKLGDTCLEIDGMFVEIDFRVGGKMRRVFEKLGPRKVMLTKNGTVE SVEADGLTLDDVSDEDIDVENVEVDDYFFLQPLPTKRRRALLRASGVHRIDAEEKQELRA IRLSREECGCDCRLYCDPEACACSQAGIKCQVDRMSFPCGCSRDGCGNMAGRIEFNPIRV RTHYLHTIMKLELESKRQVSRPAAPDEEPSPTASCSLTGAQGSETQDFQEFIAENETAVM HLQSAEELERLKAEEDSSGSSASLDSSIESLGVCILEEPLAVPEELCPGLTAPILIQAQL PPGSSVLCFTENSDHPTASTVNSPSYLNSGPLVYYQVEQRPVLGVKGEPGTEEGSASFPK EKDLNVFSLPVTSLVACSSTDPAALCKSEVGKTPTLEALLPEDCNPEEPENEDFHPSWSP SSLPFRTDNEEGCGMVKTSQQNEDRPPEDSSLELPLAV >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_5|2997_bp atgggggatatcacagcaggtaatctaaatggaattcttgggatgaaactccaccaagga cagtcttatgaaattcgaatgctagacaataggaaacttggagaacttccagaaattaat ggcaaattggtgaagagtatattccgtgtggtgttccatgacagaaggcttcagtacact gagcatcagcagctagagggctggaggtggaaccgacctggagacagaattcttgacata gatatcccgatgtctgtgggtataatcgatcctagggctaatccaactcaactaaataca gtggagttcctgtgggaccctgcaaagaggacatctgtgtttattcaggtgcactgtatt agcacagagttcactatgaggaaacatggtggagaaaagggggtgccattccgagtacaa atagataccttcaaggagaatgaaaacggggaatatactgagcacttacactcggccagc tgccagatcaaagttttcaagcccaaaggtgcagacagaaagcaaaaaacggatagggaa aaaatggagaaacgaacacctcatgaaaaggagaaatatcagccttcctatgagacaacc atactcacagagtgttctccatggcccgagatcacgtatgtcaataactccccatcacct ggcttcaacagttcccatagcagtttttctcttggggaagggatggtgcgtccaaggtta accatttatgtttgtcaggaatcactgcagttgagggagcagcaacaacagcagcagcaa cagcagcagaagcatgaggatggagactcaaatggtactttcttcgtttaccatgctatc tatctagaagaactaacagctgttgaattgacagaaaaaattgctcagcttttcagcatt tccccttgccagatcagccagatttacaagcaggggccaacaggaattcatgtgctcatc agtgatgagatgatacagaactttcaggaagaagcatgttttattctggacacaatgaaa ggttgcagcagagctgccgcctcgggagccggtttgcgcgccggcttcggctttgcagca gttaccgcctacaactcccggcatgctgctggcatttctgctgagcagattcgtgtcctt ctggaacttaaatccaaagatggaggaaaaagatttgtacctagggaaatggactatctc agactccttggttgtatgaaagaaacccctttgaaaccaatggatgcattcacgggctcg ggtctcaagaggaagtttgatgatgtggatgtgggctcatcagtttccaactcagatgat gagatctccagcagtgatagtgctgacagctgcgacagcctcaatcctcctaccactgcc agcttcacacccacatccatcctgaagcggcagaagcagctgcggaggaagaatgtacgc tttgaccaggtgactgtatactactttgcccggcgccaaggttttaccagtgtgcccagc cagggtggtagctctctgggcatggcccagcgccataactctgtacggagctatacactc tgtgagtttgcccaggaacaggaggtgaaccatcgagagattctgcgtgagcacctgaag gaagagaaactccatgccaagaaaatgaagaaggcaagtgagtgtaaattaggggataca tgtctggaaatagacggtatgtttgtggaaatagattttcgtgtaggagggaagatgaga cgggtttttgagaagcttggacccagaaaggtgatgctgaccaagaatgggacagtggag tcggtggaggctgatggcctgacgctggatgatgtgtcagatgaagatattgatgtggaa aatgtggaggtggatgattacttcttcctgcagcctctgcccaccaaacggcgacgggcc ctgctgagggcttctggggtccaccgtattgatgctgaagagaagcaagaacttcgagcc atccgcctgtcacgggaagaatgtggttgtgactgccgactgtattgtgacccagaagcg tgtgcctgcagccaggctgggattaaatgccaggtggatcgcatgtcctttccatgtggc tgctcccgggatggctgtgggaacatggcaggacgcattgaatttaatccaatccgggtc cggactcattacctccacaccattatgaagctggagctggagagcaagcggcaggtgagc cgcccagcagccccagatgaggagccctccccgactgccagttgcagcctgacaggagca cagggctctgagacccaggacttccaggagttcattgctgagaatgagacagcagtgatg cacctgcagagtgcagaggaactggagcggctcaaggcagaagaagattccagcggctct agtgccagcctggactcgagcatcgagagcctgggtgtgtgcatcctagaggagcctctg gctgtccccgaagagctgtgcccaggccttacagcccccattctcatccaggctcagctg cccccaggctcctctgtcctgtgttttaccgagaactcagaccacccaactgcctcaacg gtgaacagcccatcctacttgaacagtgggcccctggtctattatcaagtggagcagagg ccagtcttgggagtgaaaggagagcctggtacggaagaaggctcagcctctttcccaaag gagaaggatctgaatgtcttctctctccctgttacctcactcgtggcttgtagctccaca gacccagctgccctctgtaaatcagaggtggggaaaacacccaccctagaagctctattg cccgaagattgtaaccctgaggagcctgaaaatgaagacttccacccttcctggtccccc tcaagcctccccttccgcacggacaatgaagagggctgtgggatggtgaagacctcccag cagaatgaggatcggccccctgaagattcttccttagaactccctctggcagtgtga >gi568815586f:50948357_51159428|GENSCAN_predicted_peptide_6|221_aa MAAKVFESTDIVVGDSLSHPMGTETNYLCLSPPRNVPIITGSKDLQNVNITLRIIFQPVA SQLPRIFTSIGEDYDEPVLTYITTEILKSVVARFDAGEVITQRELVSRQVSNDLTEQAAT FGLILDDVSLTYLTFGKEFTEAVEAKQVAQQEAERARFVKEKAEQQKKAEQQKKVEQQKK AAVISAEGDSKATELIANSLATAGDGLMELCKLEAAEALGT >gi568815586f:50948357_51159428|GENSCAN_predicted_CDS_6|666_bp atggcagccaaagtgtttgagtccacggacattgtggtaggggactcactttctcatccc atgggtacagaaaccaattatctttgcctttctccaccacgtaatgtaccaatcatcact ggtagcaaagatttacagaatgtcaatatcacactgcgcatcatcttccagcctgttgct agccagcttcctcgcatcttcaccagcatcggagaggactatgatgagcctgtgctgacg tacatcacgaccgagatcctcaagtcagtggtggctcgctttgatgctggagaagttatc actcagagagagctggtctccaggcaggtgagcaacgaccttacggagcaagcagccaca tttgggctcatcctggacgacgtgtccttgacatatctgacctttggaaaggagttcaca gaagcagtggaagccaaacaggtggctcagcaggaagcagagagggccagatttgtgaag gaaaaggctgagcagcagaaaaaggctgagcagcagaaaaaggttgagcagcagaaaaag gcagccgtgatctctgctgagggcgactccaaggcaaccgagctgattgccaactcactg gccaccgcgggggacggcctgatggagctgtgcaagttggaagccgcggaggctctcgga acatga