GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:42:38 Sequence gi568815581r:76636164_76837160 : 200997 bp : 47.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12663 12780 118 2 1 42 45 98 0.138 1.26 1.02 Intr + 15473 15784 312 0 0 82 15 293 0.003 17.56 1.03 Intr + 34890 35009 120 1 0 18 103 97 0.369 4.67 1.04 Intr + 37656 37723 68 1 2 66 77 53 0.326 0.62 1.05 Term + 38153 38260 108 1 0 89 37 66 0.358 0.11 1.06 PlyA + 38787 38792 6 1.05 2.09 PlyA - 39782 39777 6 1.05 2.08 Term - 41539 41425 115 0 1 91 38 312 0.986 24.54 2.07 Intr - 49002 48909 94 1 1 90 80 161 0.993 14.52 2.06 Intr - 52051 51950 102 2 0 55 113 127 0.683 12.05 2.05 Intr - 55926 55748 179 2 2 60 66 13 0.331 -4.14 2.04 Intr - 60705 60572 134 0 2 98 51 154 0.274 12.14 2.03 Intr - 70433 69809 625 1 1 93 -26 854 0.005 66.75 2.02 Intr - 73799 73745 55 0 1 71 55 72 0.682 0.24 2.01 Init - 74783 74294 490 2 1 105 36 509 0.340 40.77 2.00 Prom - 75255 75216 40 -0.36 3.07 PlyA - 76693 76688 6 1.05 3.06 Term - 82697 82566 132 2 0 83 46 154 0.537 8.79 3.05 Intr - 84335 84197 139 1 1 70 78 109 0.748 8.57 3.04 Intr - 85193 85100 94 0 1 93 78 22 0.274 0.72 3.03 Intr - 87895 87609 287 0 2 76 80 145 0.800 9.49 3.02 Intr - 89692 89304 389 1 2 86 103 532 0.999 48.09 3.01 Init - 90312 90184 129 0 0 65 89 291 0.999 27.05 3.00 Prom - 90369 90330 40 -16.86 4.00 Prom + 90434 90473 40 -8.56 4.01 Init + 90596 90716 121 1 1 75 67 105 0.975 5.45 4.02 Intr + 90759 91015 257 1 2 103 96 227 0.665 22.16 4.03 Intr + 91135 91243 109 0 1 83 33 63 0.472 0.16 4.04 Term + 100027 100235 209 2 2 67 44 406 0.791 31.60 4.05 PlyA + 100319 100324 6 -10.49 5.02 PlyA - 100337 100332 6 -1.75 5.01 Sngl - 100997 100632 366 2 0 54 41 831 0.773 71.10 5.00 Prom - 101207 101168 40 -4.16 6.00 Prom + 102851 102890 40 -7.96 6.01 Init + 104795 104901 107 1 2 86 111 -15 0.815 0.43 6.02 Intr + 105806 105885 80 2 2 69 107 50 0.874 4.19 6.03 Intr + 108159 108303 145 1 1 107 54 122 0.069 10.04 6.04 Intr + 109230 109276 47 0 2 101 91 -1 0.039 -0.45 6.05 Intr + 131223 131288 66 1 0 92 95 36 0.860 3.58 6.06 Intr + 133583 133708 126 0 0 74 63 114 0.991 8.15 6.07 Intr + 138834 139008 175 1 1 79 80 171 0.884 14.40 6.08 Intr + 140243 140378 136 2 1 80 91 77 0.961 7.77 6.09 Term + 142025 142189 165 1 0 76 39 119 0.996 3.72 6.10 PlyA + 142636 142641 6 1.05 7.03 PlyA - 143318 143313 6 1.05 7.02 Term - 145188 145114 75 0 0 112 45 70 0.931 2.94 7.01 Init - 170343 170206 138 0 0 88 57 121 0.024 7.14 7.00 Prom - 182193 182154 40 -5.86 8.02 PlyA - 184433 184428 6 1.05 8.01 Sngl - 190567 190082 486 1 0 43 37 185 0.245 4.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 108159 108307 149 1 2 107 44 159 0.930 11.46 S.002 Term - 176954 176872 83 0 2 110 45 58 0.806 1.56 S.003 Init - 182000 181965 36 2 0 76 81 49 0.866 2.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_1|241_aa MSTKEGHVACCSMKEPYPQSNLQMLKEPTNQRRRQTNPVVVLINLLLLGMDIIILIIFSY SLAHSKLSFLVFISILFLTITFFLFFIYLIVNLLLPIFLISVPISRGISSISASSTTSFF SFKSLVVISELVSMAASDMMGHAEDSGRHQARRVQVGGATPRGVEPAGGHDWVQKRNVII QPSGSLEKVVTVSPGSGEPANCIQMELRFGSFTEMNLCELMRTISDEVAEMVSTEASTYG E >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_1|726_bp atgagcacaaaagaaggccacgtggcctgttgctccatgaaagagccgtatccacaaagc aacctccaaatgctgaaggagccaaccaaccagaggaggaggcagacaaatccagttgtt gtcctcatcaatcttctgcttcttggtatggacatcatcatccttatcatcttcagctac tcacttgcccatagcaaactcagcttcctcgtcttcatctccatcctcttcctcaccatc accttcttcctcttcttcatctacctcattgtcaacctcctgctcccaattttcctcatt agtgttcccattagcagaggcatctcttccatttccgcctcctccacaacttccttcttc tcctttaagtccttggtggtgatctcggagctggtgtctatggctgcatctgacatgatg gggcatgccgaggactcaggccgccatcaggcccgaagagtccaggttggtggggccact ccccgcggtgtggagcctgcagggggccacgactgggtgcagaaaaggaatgtgatcatt cagccgagtgggtcattggaaaaggtggttaccgtcagtccaggaagtggggagccagct aattgtatccagatggagcttagatttggttcattcactgaaatgaatctctgtgagctg atgagaacaatctcagatgaggtagcagaaatggtgtcaacagaggccagcacttacggg gaatag >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_2|597_aa MEAPAELLAALPALATALALLLAWLLVRRGAAASPEPARAPPEPAPPAEATGAPAPSRPC APEPAASPAGPEEPGEPAGLGELGEPAGPGEPEGPGDPAAAPAEAEEQAVEARQVRTGAL AAPSPPPGLRLPLPFPAEPGSGAAASSPGCPQEAQNPRPAGNAGKLGLDGVKMQDKALLE HQCGLWKLQRPQPSQNDPGAGELPHHLSCLCDGSVVLFDGSVVLFDGSVVLFDGSVVLCD GSVGLCDGSVGLCDGSVGLCDGRVGLCDGSVVLCDGSVGLCDGSVGLCDGSVGLCDGRVG LCDGRVGLCDGSVGLCDGSVGLCDGSVGLCDGSVVLCDGSVGLCDGSVGLCDGSVVLCDG SVVLCDSRVGLRDGSVVLCDGSVGLCDGSVEMILVAPTMAQLPDGGEDSDHETLERLPLA GRTFTDSFNAELKARCEPLCLAQNLNVYQPLCAQEGLGTSADTGSFGDTDLPWLMDSSPW SAIICLSFLLDHELLCAKGTDPVLVLQEEEQDLDGEKGPSSEGPEEEDGEGFSFKYSPGK LRGNQYKKMMTKEELEEEQRVQKEQLAAIFKLMKDNKETFGEMSDGDVQEQLRLYDM >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_2|1794_bp atggaggcgccggccgagctactggccgcgctgcctgcgctggccaccgcgctggccctt ctgctcgcctggctactggtgcggcgtggggcggccgcgagcccggagcctgcccgcgcg cccccggaacccgcgcccccggccgaggccaccggggccccggcgccgtcccgcccctgc gcccccgagccggcggcctcgcccgcggggccggaggagcctggagagcccgcggggctg ggggagctcggggagcctgcgggaccgggggagcccgaagggccaggggatcccgcggcg gcgccagcggaggcggaggagcaggcggtggaggcgaggcaggtacgcaccggggccctc gcggccccctccccacccccggggctccggctgccgctgccgttccccgccgagccaggg agcggggccgcggcctccagcccaggctgcccccaggaagcgcagaacccgcggcccgca ggaaatgcaggtaagctgggcctggatggagtgaagatgcaggataaagccctcctggag caccagtgtgggctctggaagctacagagaccccagccttcccagaatgacccaggagca ggtgaacttccccatcatctctcctgcctttgtgatggcagtgtggtcctttttgatggc agtgtggtcctttttgatggcagtgtggtcctttttgatggcagtgtggtcctttgtgat ggcagtgtgggcctctgtgacggcagcgtgggcctctgtgacggcagcgtgggcctctgt gatggcagagtgggcctctgtgatggcagcgtggtcctttgtgatggcagcgtgggcctc tgtgacggcagcgtgggcctctgtgacggcagcgtgggcctctgtgatggcagagtgggc ctctgtgacggcagagtgggcctttgtgatggcagcgtgggcctctgtgatggcagcgtg ggcctctgtgacggcagcgtgggcctctgtgacggcagtgtggtcctttgtgatggcagc gtgggcctttgtgatggcagcgtgggcctttgtgatggcagtgtggtcctctgtgatggc agtgtggtcctttgtgatagcagagtgggccttcgtgatggcagtgtggtcctctgtgat ggcagtgtgggcctttgtgatggcagtgtggaaatgatcctggtcgcacccaccatggca cagctgcctgatggtggggaggactcagaccatgagaccctggagagattgcctttggct ggccgcaccttcacggattcattcaacgctgagctgaaggccaggtgtgagccactgtgc ctggcccagaatctaaatgtgtaccagcctctctgtgctcaggagggcttgggcacgtca gctgatactgggtcctttggggatacggatctgccctggctcatggactcctcgccctgg agtgcaattatttgtttgtctttcctactggaccacgagctcctctgtgccaagggcact gaccctgtccttgtcttgcaggaagaggagcaggacttggatggtgagaaggggccatca tcggaagggcctgaggaggaggacggagaaggcttctccttcaaatacagccccgggaag ctgaggggaaaccagtacaagaagatgatgaccaaagaggagctggaggaggagcagaga gttcagaaggaacagctggctgccatcttcaagctcatgaaagacaacaaggagacgttt ggcgagatgtccgacggcgacgtgcaggagcagctccggctctacgacatgtag >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_3|389_aa MNHKSKKRIREAKRSARPELKDSLDWTRHNYYESFSLSPAAVADNVERADALQLSVEEFV ERYERPYKPVVLLNAQEGWSAQEKWTLERLKRKYRNQKFKCGEDNDGYSVKMKMKYYIEY MESTRDDSPLYIFDSSYGEHPKRRKLLEDYKVPKFFTDDLFQYAGEKRRPPYRWFVMGPP RSGTGIHIDPLGTSAWNALVQGHKRWCLFPTSTPRELIKVTRDEGGNQQDEAITWFNVIY PRTQLPTWPPEFKPLEILQKPGETVFVPEQIQHGQRQCKPPTATPCHCVPTLTWRDQLSG ILKQEHPELAVLADSVDLQESTGIASDSSSDSSSSSSSSSSDSDSECESGSEGDGTVHRR KKRRTCSMVGNGDTTSQDDCVSKERSSSR >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_3|1170_bp atgaaccacaagagcaagaagcgcatccgcgaggccaagcggagtgcgcggccggagctc aaggactcgctggattggacccggcacaactactacgagagcttctcgctgagcccggcg gccgtggcggataacgtggaaagggcagatgctttacagctgtctgtggaagaatttgtg gagcggtatgaaagaccttacaagcccgtggttttgttgaatgcgcaagagggctggtct gcgcaggagaaatggactctggagcgcctaaaaaggaaatatcggaaccagaagttcaag tgtggtgaggataacgatggctactcagtgaagatgaagatgaaatactacatcgagtac atggagagcactcgagatgatagtcccctttacatctttgacagcagctatggtgaacac cctaaaagaaggaaacttttggaagactacaaggtgccaaagtttttcactgatgacctt ttccagtatgctggggagaagcgcaggcccccttacaggtggtttgtgatggggccacca cgctccggaactgggattcacatcgaccctctgggaaccagtgcctggaatgccttagtt cagggccacaagcgctggtgcctgtttcctaccagcactcccagggaactcatcaaagtg acccgagacgaaggagggaaccagcaagacgaagctattacctggtttaatgttatttat ccccggacacagcttccaacctggccacctgaattcaaacccctggaaatcttacaaaaa ccaggagagactgtctttgtaccagaacagatacagcacgggcagcggcagtgcaagcca cccacagccaccccgtgccactgtgtcccaaccctgacctggagggaccagctctcgggg attttgaagcaagagcaccccgagttggcagtcctcgcagactcggttgaccttcaggag tccacagggatagcttccgacagctccagcgactcttccagctcctccagctccagttcg tcagactccgactcagagtgcgagtctggatccgagggcgatgggacagtgcaccgcagg aagaagaggaggacgtgcagcatggtgggaaacggggacaccacctcccaggacgactgt gtcagcaaagagcgcagctcctccaggtga >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_4|231_aa MLGRSRRLRHRPAPPRPSPPALPLCPSLPVAPAARCQFCASRQGYHQIWAFPFLPSGATA TWPAASRSRSLAARSLPRSPARPGPNDALLGEHDFRGQGVRAQRFRFSEEPGPGADGAVL EVHVPQVDPRSKNVGYEALRCIKCLSVRILETLPRSDPGHYVGDLGGLFDRDLDLDSLLD TGGGLLDRDRDLDRERDLETDEDLDLDLRADLDLEVDRDRERVRDRDFERL >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_4|696_bp atgttgggaaggtcacgtcggctgcgtcaccgccccgccccgccccgcccctcccctccc gcgctgccgctgtgcccatcacttccggtcgcgccagccgcccgttgccagttctgcgcg tcgcggcagggttatcaccagatctgggctttccccttcttgccgtcaggtgctacggcc acgtggcccgcggcttcccgctcgcgcagtctggcagcccggagccttccgcggtccccc gcccgcccggggcccaacgacgccctactgggcgagcacgatttccgaggacagggggtc cgggcccagcgctttcgattctcggaggagccgggtccgggggccgacggggctgtcctg gaggtccacgtcccgcaggttgacccccggagcaaaaacgtcggatatgaagcccttcgg tgcattaaatgtctctcagtaagaattttggagaccctgccaagaagtgaccctggtcac tatgttggagacttggggggactcttcgatcgcgacctggatttggattccctcttggac actgggggaggactcctggaccgagaccgggacctggaccgcgaacgagatctggagacc gacgaggacttggacttggaccttcgtgcggatctggacttggaggtcgaccgagatcga gaacgagtgcgggaccgagacttcgagcggctgtag >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_5|121_aa MSYGRPPPDVEGMTSLKVDNLTYRTSPDTLRRVFEKYGRVGDVYIPRDRYTKESRGFAFV RFHDKRDAEDAMDAMDGAVLDGRELRVQMARYGRPPDSHHSRRGPPPRRYGGGGYGRRSR R >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_5|366_bp atgagctacggccgcccccctcccgatgtggagggtatgacctccctcaaggtggacaac ctgacctaccgcacctcgcccgacacgctgaggcgcgtcttcgagaagtacgggcgcgtc ggcgacgtgtacatcccgcgggaccgctacaccaaggagtcccgcggcttcgccttcgtt cgctttcacgacaagcgcgacgctgaggacgctatggatgccatggacggggccgtgctg gacggccgcgagctgcgggtgcaaatggcgcgctacggccgccccccggactcacaccac agccgccggggaccgccaccccgcaggtacgggggcggtggctacggacgccggagccgc aggtaa >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_6|348_aa MAIIYGVFSASNLITPSVVAIVGPQLSMFASGLFYSMYIAVFIQPFPWSFYTASVFIGIA AAESDRRTVFIALTVISLVGTVLFFLIRKPDSENVLGEDESSDDQDMEVNEHWVFYKLKA CGNPASKKSFKLCVTKEMLLLSITTAYTGLELTFFSGVYGTCIGATNKFGAEEKSLIGLS GIFIGIGEILGGSLFGLLSKNNRFGRNPVVLLGILVHFIAFYLIFLNMPGDAPIAPVKGT DSSAYIKSSKEVAILCSFLLGLGDSCFNTQLLSILGFLYSEDSAPAFAIFKFVQSICAAV AFFYSNYLLLHWQLLVMVIFGFFGTISFFTVEWEAAAFVARGSDYRSI >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_6|1047_bp atggctattatctatggagtgttctctgcttcaaatttgattacaccgtcagtggttgcc attgtaggacctcaactctctatgtttgccagtggtttattttacagcatgtacattgcc gttttcatccagcctttcccgtggtccttctacacagcctctgttttcattggaattgct gctgctgagagtgaccgaagaacagtgtttattgccctaacggtgattagccttgtgggg acagttctattctttctcattcggaaaccagattctgaaaatgtcctaggagaagatgag tcttctgatgaccaggacatggaagtcaacgaacactgggttttttacaaattgaaggct tgtggcaaccctgcatcgaaaaagtcttttaagttatgtgtcaccaaggagatgctcctt cttagtattacaactgcttatacaggtctggaattaactttcttctctggtgtatatgga acctgtattggtgctacaaataaatttggagcagaagagaaaagccttattggactttct ggcattttcatcggcattggagaaattttaggtggaagcctcttcggcctgctgagcaag aacaatcgttttggtagaaatccagttgtgctgttgggcatcctggtgcacttcatagct ttttatctaatatttctcaacatgcctggagatgccccgattgctcctgttaaaggaact gacagcagtgcttacatcaaatccagcaaagaagttgccattctctgcagttttctgttg ggccttggagacagctgctttaatacccagctgcttagtatcttgggctttctgtattct gaagacagcgccccagcatttgccatcttcaagtttgttcagtctatttgcgcagccgtg gcatttttctacagcaactaccttctccttcactggcaactcctggtcatggtgatattt gggttttttggaacaatttctttcttcactgtggaatgggaagctgccgcctttgtagcc cgcggctctgactaccgaagtatctga >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_7|70_aa MRLHSSALGWSMGLGALEQVAALIGEARAGQEPTAGAGRLRHSRLQLSLLVTFDGDSCHV VGCPEDYSVA >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_7|213_bp atgcgcctgcactcctcagcccttgggtggtcgatgggactgggtgccctggagcaggtg gcggcgctcattggggaggctcgggcagggcaggagcccacggcgggggcggggaggctc agacatagcaggctgcagctgtccttgcttgtgacctttgatggagacagctgccatgtc gtaggctgccctgaggactactccgtagcatga >gi568815581r:76636164_76837160|GENSCAN_predicted_peptide_8|161_aa MQFQQQKCTITPARKQQVQASYYSAFDSPALTWLLTAFTRRNLNLHRNDETGAEGDIPEP GRLQASFPPQPILSAEWGREVLLASCWTAAALACGSYTHSLTGTSAGRNRIGYLHLNAPL PLTGTCHCSVQSCPPRCLEGLCGTAEYTRTQRRGASLNTVT >gi568815581r:76636164_76837160|GENSCAN_predicted_CDS_8|486_bp atgcaatttcagcagcagaaatgcaccatcactccggccaggaagcagcaggtacaggca tcttattactcggcatttgatagcccggctctcacttggttgttaacagccttcacaaga agaaacctcaatctgcaccgaaatgatgagaccggtgcagagggagacatcccggagcct ggacgccttcaggcctcgtttcctccccagcccatcctcagcgcagagtgggggagagaa gtcctgcttgcctcatgttggacagcagccgcccttgcatgtggaagttacacacactca ctcacagggaccagcgcagggagaaacagaattggctacctccacctcaatgctcccctt ccgctcacgggcacctgccactgctctgtccagtcctgccctccacggtgccttgagggc ctgtgtggtacagccgagtacaccaggacccagcgaagaggtgcctccctcaacactgtc acatag