GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:25:14 Sequence gi568815590r:76883264_77084178 : 200915 bp : 36.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2834 2851 18 1 0 85 115 -1 0.256 1.70 1.02 Intr + 8462 8518 57 1 0 63 83 69 0.506 2.06 1.03 Term + 10590 10763 174 2 0 75 48 114 0.528 2.88 1.04 PlyA + 11094 11099 6 1.05 2.05 PlyA - 12005 12000 6 1.05 2.04 Term - 15644 15019 626 0 2 34 42 303 0.656 13.86 2.03 Intr - 18431 18302 130 2 1 51 89 70 0.287 2.75 2.02 Intr - 28799 28645 155 0 2 12 92 96 0.099 1.27 2.01 Init - 61435 61312 124 1 1 51 55 134 0.834 6.80 2.00 Prom - 62483 62444 40 -5.95 3.00 Prom + 67618 67657 40 -1.35 3.01 Sngl + 90804 91382 579 2 0 74 42 174 0.811 7.32 3.02 PlyA + 91384 91389 6 -0.45 4.00 Prom + 91754 91793 40 -3.65 4.01 Init + 93577 93605 29 0 2 90 78 23 0.515 0.64 4.02 Term + 96588 96678 91 0 1 125 39 96 0.450 4.61 4.03 PlyA + 96691 96696 6 1.05 5.02 PlyA - 97022 97017 6 1.05 5.01 Sngl - 100915 99998 918 1 0 92 43 322 0.975 24.18 5.00 Prom - 102327 102288 40 -6.75 6.00 Prom + 102883 102922 40 -4.05 6.01 Init + 104009 104097 89 1 2 77 63 65 0.855 3.06 6.02 Intr + 109195 109340 146 1 2 122 52 116 0.521 10.41 6.03 Intr + 116573 116757 185 0 2 25 45 109 0.288 -1.11 6.04 Intr + 116803 117011 209 0 2 56 2 418 0.024 26.95 6.05 Intr + 133621 133751 131 1 2 17 47 105 0.151 -1.28 6.06 Intr + 136526 136608 83 0 2 150 83 15 0.480 5.64 6.07 Intr + 137531 137609 79 1 1 85 91 47 0.496 2.91 6.08 Term + 139276 139799 524 2 2 48 36 260 0.898 10.25 6.09 PlyA + 140110 140115 6 1.05 7.00 Prom + 141612 141651 40 -3.65 7.01 Init + 144324 144537 214 2 1 76 88 203 0.897 18.05 7.02 Term + 145610 145686 77 0 2 104 49 30 0.496 -2.28 7.03 PlyA + 145698 145703 6 1.05 8.00 Prom + 148345 148384 40 -3.55 8.01 Sngl + 159982 160638 657 0 0 59 41 245 0.902 12.82 8.02 PlyA + 160866 160871 6 1.05 9.00 Prom + 161037 161076 40 -6.15 9.01 Sngl + 162577 163809 1233 0 0 37 42 313 0.647 17.15 9.02 PlyA + 165025 165030 6 1.05 10.03 PlyA - 165363 165358 6 1.05 10.02 Term - 194211 194025 187 2 1 63 38 146 0.703 3.08 10.01 Init - 200402 200188 215 2 2 57 89 167 0.433 11.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 116803 117015 213 0 0 56 38 418 0.901 29.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_1|82_aa MPHPPKLQCGEELKEDKDWRQRDDLKNPLTVLTFPSRFGIWRNKSGLVLPLGFRCGVFKT KSKTITGSFSSRDKNCIANLEQ >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_1|249_bp atgccccatccccccaagctgcagtgtggagaagaactaaaggaggacaaagactggagg cagagagacgatttaaagaacccactgaccgttttgacctttccttccaggtttggcatt tggaggaataaatcaggattagtgctgccattaggcttcagatgtggtgtcttcaagaca aaaagtaagaccattactggcagttttagctctcgggacaagaactgcattgctaatttg gaacagtaa >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_2|344_aa MVYFAQVLHTVQELVLASASDKDLRRLPIMAEGEGGAVVSHIFHEEEERWRGKGFNWTSI TTIIQDPHGAQVFHVAAQMKRSFSSKKGRYQSYGIAEQKAADSFCRLKRPCLTALKRAVV IPAQHLSSENGQTASSKLQTTIREYYKHLYANKLENLEEMDKFLDTYPLQILNQEEVESL NRPITGSEIETIINSLPTKKSTGPDGFTAEFYQRYKEELVLFLLKQFQSIEKEGVFPNSF YEASIILIPKPGRDTTKKENFRPISLMNINAKMLNNILANQIQQHIKKLIHHDQVSFIPG MQGWFNIGKSINVIHYRNRTNDKNHIIISIDAEKAFDKIEQPSC >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_2|1035_bp atggtttactttgctcaggttctgcatactgtgcaagaactggtactggcatctgcttct gataaggacctcagaaggcttccaatcatggcagaaggtgaaggaggagcagttgtgtca catatctttcatgaagaagaagaaagatggcgtgggaagggctttaactggacatcaatc acaactattatccaagatccccatggggcccaagtttttcacgtggcagcccaaatgaag agaagtttctctagcaaaaaagggagataccaatcatatggcatagcagaacaaaaggca gcagacagcttctgcagacttaaacgtccttgtctgacagctctgaagagagcagtggtt atcccagcacaacatttgagctctgagaatggacagactgcctcctcaaaattacaaact accatcagagaatactataaacacctttatgcaaataaactagaaaatctagaagaaatg gataaatttctggacacataccctctccaaatactaaaccaggaagaagttgaatctctg aatagaccaataacaggctctgaaattgagacaataattaatagtctaccaaccaaaaaa agtacaggaccagacggattcacagccgaattctaccagaggtacaaggaggagctggta ctattccttctgaaacaattccaatcaatagaaaaagagggagtcttccctaattcattt tatgaggccagcatcatcctgataccaaagcctggcagggacacaacaaaaaaagagaat tttagaccaatatccctgatgaacatcaatgcgaaaatgctcaataacatactggcaaac caaatccagcagcacatcaaaaagcttatccaccacgatcaagtcagcttcatccctggc atgcaaggctggttcaacataggcaaatcaataaatgtaattcattatagaaacagaacc aacgacaaaaaccatatcattatctcaatagatgcagaaaaggccttcgacaaaattgaa caaccttcatgctaa >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_3|192_aa MLPDFKLYYKASVTKTAWYWYQNRDIDLWNRTEPSEIIPHIYNHLIFDKPEKNKKWGKDS LFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKNIKALEENLGNNIQDIGM GKDFITKTPKAMATKAKIDKCDLIKLKSFCTAKETTIKANRQPTEWEKIFAIYSSDKGLI SRIYKDLKQIYK >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_3|579_bp atgctacctgacttcaaactatactacaaggcttcagtaaccaaaacagcatggtactgg taccaaaacagagatatagacctatggaacagaacagagccctcagaaataataccacac atctacaaccatctgatctttgacaaacctgagaaaaacaagaaatggggaaaggattcc ctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggat cccttcctaacaccttacacaaaaatcaattcacgatggattaaagacttaaatgttaga cctaaaaacataaaagccctagaggaaaacctaggcaataacattcaggacataggcatg ggcaaggacttcattactaaaacaccaaaagcaatggcaacaaaagccaaaatagacaaa tgtgatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaaagcgaac aggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggctaata tccagaatctacaaagacctcaaacaaatttacaagtaa >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_4|39_aa MDAAVGCYPKHGSLRVIHKNLLYFKPLSTPASAEPYLRP >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_4|120_bp atggatgcagctgtaggctgttatcctaagcatggatccctgagagttatacataaaaat ctcttgtacttcaaacccctttcaacgccagcttctgcagaaccctatctgcgaccttag >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_5|305_aa MASRKENAKSANRVLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKACL WVFLWRFTIYSKNATVGQSVLNIKYKNDFSPNLRYQPPSKNQKIWYAVCTIGGRWLEERC YDLFRNHHLASFGKVKQCVNFVIGLLKLGGLINFLIFLQRGKFATLTERLLGIHSVFCKP QNICEVGFEYMNRELLWHGFAEFLIFLLPLINVQKLKAKLSSWCIPLTGAPNSDNTLATS GKECALCGEWPTMPHTIGCEHIFCYFCAKSSFLFDVYFTCPKCGTEVHSLQPLKSGIEMS EVNAL >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_5|918_bp atggcttccagaaaagagaatgcgaagagtgcaaacagagtgctaagaataagccagttg gatgcacttgaactaaacaaggccctggagcagctagtttggtcccagtttactcagtgc tttcatggatttaaacctgggctgttagctcgctttgagccagaggtgaaagcgtgctta tgggttttcttgtggagattcaccatctactccaaaaatgccacagtgggacagtcagtt ttgaatattaagtacaaaaatgatttttcccctaacctgagatatcagccacccagtaaa aatcaaaaaatctggtatgctgtttgtacaattggtggcaggtggttagaagaacgatgc tatgatttgtttcgaaaccatcatttagcatcatttgggaaagtcaagcagtgtgtgaat tttgtgattggacttttgaaattaggtgggctgattaattttttgattttccttcagagg ggaaagtttgcaactttgacagaacgtctcctaggtattcattctgtattttgcaagcct caaaacatatgtgaagttggctttgaatacatgaatagggaacttctctggcatggtttt gctgaatttctgatttttctcttaccacttatcaatgtccagaagttgaaagccaagctg tcttcatggtgtattcctcttactggtgcacctaatagtgacaatacattagccaccagt ggcaaagaatgcgctctatgtggagagtggcccaccatgcctcacaccataggatgtgag catattttctgttatttctgtgctaagagtagtttcttatttgacgtgtactttacttgt cctaagtgtggcacagaagtacacagtctgcagccactgaaatcaggaatcgagatgtca gaagtaaatgctctttag >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_6|481_aa MVTTILQKVREGFLGEAIPKQPPDISMSAWYSYSRILSLEKGSKGVEENDEIKITRNSIN PIMESSSLYSSTTRFVIIAALICLFQGDRKPINREKKANGDNYKVVKLHDLPAENKTSGL HSGSLGQTLTLGVCETPTLHPNLDYQGFRNSQASGATQEPEVAEKKKKKLHQRELRFSGA AEGTGWPNNRERRQAEEPKGRKAEADTRARRGYSSPYANSGNSYSTLLTRSFPYHQNEFH FRPNLAIIVSPNPGVCSTKILTCLILPRHLLFGGSKLKEVGKLPPFPSVERSTTSTMLED ASEPLHTEIQTTIREYYKHLYTNKLENLEEIDKLLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAKFYQRYKEEMVPLLLKLFQSIEKEGILPNSFYEASIILIQ KPGRDTTKKENFRPISLMNIDAKILNKILANGIQQHIKNLSTMIKWASSLECKAGSIYSN Q >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_6|1446_bp atggttacaactatacttcagaaggtcagagaaggcttcctgggagaggcaatacctaag cagccacctgatattagcatgtcagcctggtacagctattctagaatactctctttagaa aagggcagcaaaggagtagaagaaaatgatgaaatcaaaataactagaaattcaattaac ccaattatggaatcttcttccctctacagcagtacaacaagatttgttataatagcagct ttgatttgccttttccagggagacaggaagccaataaacagggaaaaaaaagcaaacggc gacaattacaaggtcgtgaaactccacgacctcccagctgaaaacaagacctcgggtttg cactccggctctctagggcagacactgaccttaggagtctgcgaaacgcccacgctccac ccgaatctggattaccaaggcttccggaactcgcaggcttccggagcgacgcaggagccg gaagtggctgaaaaaaaaaagaaaaagttacaccaacgagaactgcgctttagcggcgct gctgaaggcactggatggccaaacaaccgcgaacgccgacaagccgaagagccgaagggc cgaaaagccgaagcggacacccgcgcgcgacgaggttacagcagcccctatgccaactct gggaactcctattcaacgttactgacccgttcattcccataccatcagaatgaatttcac ttcaggcctaaccttgccatcattgtttctcccaatcctggtgtttgctctactaaaatc cttacatgcttaattttgcctaggcatttgctttttggaggatctaaactaaaagaagtg ggaaagttgccacctttcccttctgtagaaagatctaccacttcaactatgctagaagat gcttcagaacctctccacacagaaatacaaactaccatcagagaatactacaaacacctc tacacaaataaactagaaaatctagaagaaatagataaattgctcgacacatacaccctc ccaagactaaaccaggaagaagttgaatctctgaataggccaataacaggctctgaaatt gtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagcc aaattctatcagaggtacaaggaggaaatggtaccactccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgatacaa aagcctggcagagacacaacaaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaatggaatccagcagcacatcaaaaactta tccaccatgatcaagtgggcttcatccctggaatgcaaggctggttcaatatactcaaat caataa >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_7|96_aa MVELSGTIKELDDTGVMISVTYQFNATLAPAAARLSQENGCRLMQAKQVVAMIAAAAPDL ALLLKQINKTSVRDRLVSLSDRLSEPYPAPFQFPST >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_7|291_bp atggtagagttaagtggcactattaaagagctagatgatacaggggtaatgatctctgtc acataccaatttaatgccactctggcccctgcagcagccagattgagccaggaaaacggc tgtagactaatgcaggctaaacaagtagtagccatgattgcagctgctgcaccagatttg gcactgttgctaaaacagatcaacaagacctcagtcagggacagacttgtatcattgtct gacagactctcagagccttatccggctcccttccaattcccttccacatag >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_8|218_aa MEDQMNEIKQEEKFREKRIKRNEQSLQEIWEYVKRPNLCLIGVPESDGENGTKLENTLQD IIQENFPNLARQASIQIQEIQRTPQRYSLRRATPRHIIVRFTKVEMKEKMLRATREKGQV IHKGKPIRLTADLLVETLQARREWGPIFNILKEKNFQPRISYPAKLNFISEGEIKYFTDK QMLRDFVTTRPALEELLKEALNMERNNQYQPLQKHAKL >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_8|657_bp atggaagatcaaatgaatgaaatcaagcaagaagagaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggaatatgtgaaaagaccaaatctatgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat atcatccaggagaacttccccaacctagcaaggcaggccagcattcaaattcaggaaata cagagaacgccacaaagatactccttgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcaaccagagagaaaggtcaggtt atccacaaagggaagcccatcagactaacagctgatctcttagtagaaactctacaagca agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatc tcatatccagccaaattaaacttcataagtgaaggagaaataaaatactttacagacaaa caaatgctgagagattttgtcaccaccaggcctgccctagaagagctcctgaaggaagca ctaaacatggaaaggaacaaccagtaccagccactgcaaaaacatgccaaattgtaa >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_9|410_aa MNPDAKILNKILANQIQQHIKKLIHHEEVGFIPGMQGWFNIRKSINVIQHINRTNDKNHM IISIDAEKAFDKIQQHFMLKTLNKLDIDGAYLEIIRAIYDKPTANIKLNGQKLEAFPLTT GTRQGCPLSPLLFNIVLEVLARAIKQEKERKGIQLGKEEVKLSLFAGDMIIYLENPIISA QNILKLISNFSKVSGYKINVQKSQAFLYSNNRQTESQIMSELLFSSASRRIKYLGIQLTR NVKDLFKENYKPLLNKIKEGTNKWKNIPCCWIGRINIMKMAILSKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQNTARIAKSILSKKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTESSEIMPHIYNHLIFDKPEKNKKWGKDSLFIIGNIINGAGKTG >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_9|1233_bp atgaaccctgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcaacacatc aaaaagcttatccaccatgaagaagttggcttcatccctgggatgcaaggctggttcaac atacgcaaatcaataaacgtaatccagcatataaacagaaccaatgacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcacttcatgctaaaa actctcaataaattagatattgatggggcatatctcgaaataataagagctatttatgac aaacccacagccaatatcaaactgaatgggcaaaaactggaagcattccctttgacaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaagcaggagaaggaaagaaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcaggtgacatgatcatatatctagaaaaccccatcatctcagcc caaaatatccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacagcaataacagacaaacagagagccaaatcatgagt gaactcctattctcaagtgcttcaaggagaataaaatacctaggaatccaacttacaagg aatgtgaaggacctcttcaaggagaactacaaaccactgctcaacaaaataaaagagggt acaaacaaatggaagaacattccatgctgctggataggaagaatcaatatcatgaaaatg gccatactgtccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttgaagttcatatggaaccaaaatacagcccgc attgccaagtcaatcctaagcaaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagtcctcagaaataatgccacatatctacaaccatctg atctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttataataggg aatataataaatggtgctgggaaaactggctag >gi568815590r:76883264_77084178|GENSCAN_predicted_peptide_10|133_aa MELAGWEPMLLDAAAAPQLWFQIQASLCSQGFRKPLPLQASKCLLLLPDLSLIPVPPEQS CGRAQAVSQTARDINGAGGHYPQQTNAGTENQTSHVLTYKWELNDENTWTHGGEKLTLGP VGVGGQEEKEHQD >gi568815590r:76883264_77084178|GENSCAN_predicted_CDS_10|402_bp atggagttggcagggtgggagcccatgcttctggatgcagctgcagcccctcagctgtgg ttccagatccaggcatccctgtgttctcagggtttcaggaagcctctgcccctgcaagct tcgaagtgcctgctcctgctacctgacctctccctgatcccagtgcccccagagcaaagc tgtggccgagctcaagcagtgtcgcaaacagccagggacataaatggagctggaggccat tatcctcagcaaactaatgcaggaacagaaaaccaaacatcgcatgttctcacttacaag tgggagctgaatgatgagaacacatggacacatggaggagaaaaactcacactggggccg gttggggttggggggcaggaagagaaggagcatcaggactaa