GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:11:34 Sequence gi568815587r:68057691_68290076 : 232386 bp : 45.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 Intr - 8204 8096 109 1 1 94 91 143 0.976 15.49 1.09 Intr - 8826 8739 88 1 1 87 84 82 0.899 6.73 1.08 Intr - 11247 11189 59 2 2 73 116 -31 0.835 -3.27 1.07 Intr - 12603 12499 105 2 0 55 105 45 0.788 2.23 1.06 Intr - 13167 13034 134 0 2 120 106 119 0.963 16.34 1.05 Intr - 17140 17027 114 1 0 92 111 75 0.920 10.84 1.04 Intr - 18893 18738 156 2 0 81 46 58 0.356 1.11 1.03 Intr - 23767 23714 54 1 0 67 108 18 0.239 0.88 1.02 Intr - 39440 39329 112 1 1 64 101 101 0.155 9.38 1.01 Init - 63487 63138 350 1 2 57 67 567 0.447 46.16 1.00 Prom - 64738 64699 40 -4.06 2.13 PlyA - 80216 80211 6 1.05 2.12 Term - 84616 84526 91 0 1 87 54 108 0.689 4.49 2.11 Intr - 90737 90605 133 0 1 102 83 52 0.933 5.80 2.10 Intr - 95692 95547 146 0 2 100 72 82 0.978 7.73 2.09 Intr - 101481 100080 1402 1 1 89 23 668 0.005 48.70 2.08 Intr - 109488 109292 197 2 2 127 108 102 0.945 15.16 2.07 Intr - 113461 113325 137 1 2 43 116 118 0.908 9.47 2.06 Intr - 113561 113542 20 0 2 65 121 -11 0.776 -3.77 2.05 Intr - 114019 113853 167 0 2 39 103 118 0.865 8.00 2.04 Intr - 116223 116114 110 0 2 107 38 24 0.862 -1.62 2.03 Intr - 117493 117328 166 0 1 98 103 96 0.981 12.06 2.02 Intr - 128238 128091 148 1 1 122 59 67 0.943 6.49 2.01 Init - 132359 132227 133 2 1 79 42 66 0.678 1.40 2.00 Prom - 147715 147676 40 -4.26 3.00 Prom + 149656 149695 40 -3.86 3.01 Init + 155654 155947 294 1 0 71 105 190 0.536 14.19 3.02 Term + 199926 200129 204 2 0 40 52 211 0.775 10.07 3.03 PlyA + 201529 201534 6 -0.45 4.04 PlyA - 203671 203666 6 -1.75 4.03 Term - 205228 203955 1274 2 2 73 55 769 0.518 63.86 4.02 Intr - 214370 214167 204 0 0 104 90 134 0.804 14.37 4.01 Init - 225501 225495 7 0 1 84 99 0 0.147 1.73 4.00 Prom - 227057 227018 40 -2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 72489 72597 109 0 1 79 52 132 0.894 6.68 S.002 Init + 175671 175719 49 2 1 89 58 47 0.801 0.92 S.003 Term + 180100 180230 131 2 2 71 55 111 0.903 4.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:68057691_68290076|GENSCAN_predicted_peptide_1|427_aa MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQHEGGGGDH SLQSTRTVLPPGEQGSIMSKQRLGGGFADPCLNAPDTFCDVGLGGAEAMVLESVMFAILA ERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKMPFNKEPKW LFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPVVFCHNDCQ EGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYPFFRANIRK YPTKKQQ >gi568815587r:68057691_68290076|GENSCAN_predicted_CDS_1|1281_bp atgaaaaccaaattctgcaccgggggcgaggcggagccctcgccgctcgggctgctgctg agctgcggtagcggcagcgcggccccggcgcccggcgtggggcagcagcgcgacgccgcc agcgacctcgagtccaagcagctgggcggccaacagccgccgctcgcgctgccccctccg ccgccgctgccgctgccgctgccgctgccccagcccccgccgccgcagccgcccgcagac gagcagccggagccccggacgcggcgcagggcctatctgtggtgcaaggagttcctgccc ggcgcctggcggggcctccgcgaggacgagttccacatcagtgtcatcagaggcggcctt agcaacatgctgttccagtgctccctacctgacaccacagccacccttggtgatgagcct cggaaagtgctcctgcggctgtatggagcgattttgcagatgaggtcctgtaataaagag ggatccgaacaagctcagaaagaaaatgaatttcaacatgagggtggtggtggggaccac agtcttcagagcaccaggaccgtgctgccccctggggagcaggggagcataatgtctaag cagagacttggaggaggctttgcagacccttgcttaaatgccccggatactttctgtgat gttgggcttggaggggctgaggccatggttctggagagcgttatgtttgccattctcgca gagaggtcacttgggccaaaactctatggcatctttccccaaggccgactggagcagttc atcccgagccggcgattagatactgaagaattaagtttgccagatatttctgcagaaatc gccgagaaaatggctacatttcatggtatgaaaatgccattcaataaggaaccaaaatgg ctttttggcacaatggaaaagtatctaaaggaagtgctgagaattaaatttactgaggaa tccagaattaaaaagctccacaaattgctcagttacaatctgcccttggaactggaaaac ctgagatcattgcttgaatctactccatctccagttgtattttgtcataatgactgtcaa gaaggtaatatcttgttgctggaaggccgagagaattctgaaaaacagaaactgatgctc attgatttcgaatacagcagttacaattacaggggattcgacattggaaatcacttctgt gagtggatgtatgattatagctatgaaaaatacccttttttcagagcaaacatccggaag tatcccaccaagaaacaacag >gi568815587r:68057691_68290076|GENSCAN_predicted_peptide_2|949_aa MVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGFEGQSRYVPS SGMSAKELCENDDLATSLVLDPYLGFQTHKMNTRFRPIKGRQEELKEVIERFKKDEHLEK AFKCLTSGEWARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKI VATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFIN HDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAF KSRVGLPAPAPVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNAT SNRKSSVGVKKNSKSRTLTRQSMSRIPASSNSTSSKLTHINNSRVPKKLKKPAKPLLSKI KLRNHCKRLEQKNASRKLEMGNLVLKEPKVVLYKNLPIKKDKEPEGPAQAAVASGCLTRH AAREHRQNPVRGAHSQGESSPCTYITRRSVRTRTNLKEASDIKLEPNTLNGYKSSVTEPC PDSGEQLQPAPVLQEEELAHETAQKGEAKCHKSDTGMSKKKSRQGKLVKQFAKIEESTPV HDSPGKDDAVPDLMGPHSDQGEHSGTVGVPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTA KSKKKRRITRYDAQLILENNSGIPKLTLRRRHDSSSKTNDQENDGMNSSKISIKLSKDHD NDNNLYVAKLNNGFNSGSGSSSTKLKIQLKRDEENRGSYTEGLHENGVCCSDPLSLLESR MEVDDYSQYEEESTDDSSSSEGDEEEDDYDDDFEDDFIPLPPAKRLRPLSLTHIHGNCCK AANLEDRRQTQARRDCFGNVLKKPDLLPDPASSLPGKHTRTSVKCGNCQDACVEEAWQED QGECPLFLLGNQAKQASWRRKWKERLIGHRRRHWACVSPMSPWFISTSP >gi568815587r:68057691_68290076|GENSCAN_predicted_CDS_2|2850_bp atggtggtgaatggcaggagaaatggaggcaagttgtctaatgaccatcagcagaatcaa tcaaaattacagcacacggggaaggacaccctgaaggctggcaaaaatgcagtcgagagg aggtcgaacagatgtaatggtaactcgggatttgaaggacagagtcgctatgtaccatcc tctggaatgtccgccaaggaactctgtgaaaatgatgacctagcaaccagtttggttctt gatccctatttaggttttcaaacacacaaaatgaatactagatttaggcctattaaagga aggcaggaagaactaaaggaagtaattgaacgttttaagaaagatgaacacttggagaaa gccttcaaatgtttgacttcaggcgaatgggcacggcactattttctcaacaagaataaa atgcaggagaaattattcaaagaacatgtatttatttatttgcgaatgtttgcaactgac agtggatttgaaatattgccatgtaatagatactcatcagaacaaaatggagccaaaata gttgcaacaaaagagtggaaacgaaatgacaaaatagaattactggtgggttgtattgcc gaactttcagaaattgaggagaacatgctacttagacatggagaaaacgacttcagtgtc atgtactccacaaggaaaaactgtgctcaactctggctgggtcctgctgcgtttataaac catgattgcagacctaattgtaagtttgtgtcaactggtcgagatacagcatgtgtgaag gctctaagagacattgaacctggagaagaaatttcttgttattatggagatgggttcttt ggagaaaataatgagttctgcgagtgttacacttgcgaaagacggggcactggtgctttt aaatccagagtgggactgcctgcgcctgctcctgttatcaatagcaaatatggactcaga gaaacagataaacgtttaaataggcttaaaaagttaggtgacagcagcaaaaattcagac agtcaatctgtcagctctaacactgatgcagataccactcaggaaaaaaacaatgcaact tctaaccgaaaatcttcagttggcgtaaaaaagaatagcaagagcagaacgttaacgagg caatctatgtcaagaattccagcttcttccaactctacctcatctaagctaactcatata aataattccagggtaccaaagaaactgaagaagcctgcaaagcctttactttcaaagata aaattgagaaatcattgcaagcggctggagcaaaagaatgcttcaagaaaactcgaaatg ggaaacttagtactgaaagagcctaaagtagttctgtataaaaatttgcccattaaaaaa gataaggagccagagggaccagcccaagccgcagttgccagcgggtgcttgactagacac gcggcgagagaacacagacagaatcctgtgagaggtgctcattcgcagggggagagctcg ccctgcacctacataactcggcggtcagtgaggacaagaacaaatctgaaggaggcctct gacatcaagcttgaaccaaatacgttgaatggctataaaagcagtgtgacggaaccttgc cccgacagtggtgaacagctgcagccagctcctgtgctgcaggaggaagaactggctcat gagactgcacaaaaaggggaggcaaagtgtcataagagtgacacaggcatgtccaaaaag aagtcacgacaaggaaaacttgtgaaacagtttgcaaaaatagaggaatctactccagtg cacgattctcctggaaaagacgacgcggtaccagatttgatgggtccccattctgaccag ggtgagcacagtggcactgtgggcgtgcctgtgagctacacagactgtgctccttcaccc gtcggttgttcagttgtgacatcagatagcttcaaaacaaaagacagctttagaactgca aaaagtaaaaagaagaggcgaatcacaaggtatgatgcacagttaatcctagaaaataac tctgggattcccaaattgactcttcgtaggcgtcatgatagcagcagcaaaacaaatgac caagagaatgatggaatgaactcttccaaaataagcatcaagttaagcaaagaccatgac aacgataacaatctctatgtagcaaagcttaataatggatttaactcaggatcaggcagt agttctacaaaattaaaaatccagctaaaacgagatgaggaaaatagggggtcttataca gaggggcttcatgaaaatggggtgtgctgcagtgatcctctttctctcttggagtctcga atggaggtggatgactatagtcagtatgaggaagaaagtacagatgattcctcctcttct gagggcgatgaagaggaggatgactatgatgatgactttgaagacgattttattcctctt cctccagctaagcgcttgaggccgttaagcttaacacatattcatggaaactgttgtaaa gctgctaatctggaggacagaagacagacgcaggcccgtagagattgctttggtaatgtt ctgaaaaagccagatctgttgccagatcccgcgtcctccttgccagggaaacacacacga acatcagtaaaatgtggtaattgccaggatgcatgtgtggaagaagcgtggcaggaggac caaggagagtgccccctgtttctgctggggaaccaagcaaagcaggcttcatggaggagg aaatggaaagagcgcctcatcggccatcgccgcaggcactgggcttgtgtgtccccgatg tcgccctggtttatctccacgtccccatga >gi568815587r:68057691_68290076|GENSCAN_predicted_peptide_3|165_aa MSPRSVRAPRPQPAAAAAVAPPAPPASARCAPGARPRAPPPGASPRLAHGRPYENRPPAS GLRVPGPCAARCRLRFPGVLTSSPRCPAARTAAASPPPRNWLARGHALSSSRPCPAGALK ALEEPLLPTAVVCSATPNICRQIRCECSALTALEEPLLPVAIVCS >gi568815587r:68057691_68290076|GENSCAN_predicted_CDS_3|498_bp atgagtccccgcagcgtccgagccccccgcccccaaccggccgccgccgccgccgtcgcg ccccccgcgcccccggcctcggctcgctgcgcccccggcgcccgcccccgcgccccgcca cccggggcctcacctcgcctcgcgcacggccgcccgtacgaaaaccgtccgccggcttca ggactccgcgtcccaggtccgtgcgccgcgcggtgccgcctccggttcccgggcgtcctc acctcatcgccccgctgccctgccgcccgcaccgccgccgcgtcgccgcccccgagaaac tggctggctcgaggccatgccctgtcgagctcgaggccatgccctgccggggctctcaag gctttggaggagcctctgctccctacggccgtggtgtgctcagccacgcccaacatttgc agacagatccggtgtgaatgctcagctctcacggctttggaggagcctctgctccctgtg gccatcgtgtgctcctga >gi568815587r:68057691_68290076|GENSCAN_predicted_peptide_4|494_aa MPGPAAAALRPGQQARPALRDVISGVWAGPGQGALPGAAWVALPATSPRRASGSFLLSEG HQAAGGLLRPGNFVPNKMWKGLVKRNASVETVDNKTSEDVTMAAASPVTLTKGTSAAHLN SMEVTTEDTSRTDVSEPATSGGAADGVTSIAPTAVASSTTAASITTAASSMTVASSAPTT AASSTTVASIAPTTAASSMTAASSTPMTLALPAPTSTSTGRTPSTTATGHPSLSTALAQV PKSSALPRTATLATLATRAQTVATTANTSSPMSTRPSPSKHMPSDTAASPVPPMRPQAQG PISQVSVDQPVVNTTNKSTPMPSNTTPEPAPTPTVVTTTKAQAREPTASPVPVPHTSPIP EMEAMSPTTQPSPMPYTQRAAGPGTSQAPEQVETEATPGTDSTGPTPRSSGGTKMPATDS CQPSTQGQYMVVTTEPLTQAVVDKTLLLVVLLLGVTLFITVLVLFALQAYESYKKKDYTQ VDYLINGMYADSEM >gi568815587r:68057691_68290076|GENSCAN_predicted_CDS_4|1485_bp atgccaggccccgccgcagccgctcttcgccccggccagcaggcccgccccgccctccgt gacgtcatttccggcgtttgggcggggcccgggcagggcgcgctgcccggagctgcctgg gttgcgctgccggccacgtccccgcgccgggcctcaggctccttcctactgtccgagggc caccaggccgccgggggcctgctgcgcccgggcaactttgtccctaacaaaatgtggaag ggattagtcaagaggaatgcatctgtggaaacagttgataataaaacgtctgaggatgta accatggcagcagcttctcctgtcacattgaccaaagggacttcggcagcccacctcaac tctatggaagtcacaacagaggacacaagcaggacagatgtgagtgaaccagcaacttca ggaggtgcagctgatggtgtgacctccattgctcccacggctgtggcctccagtacgact gcggcctccattacgactgcggcctccagtatgactgtggcctccagtgctcccacgact gcagcctccagtacaactgtggcctccattgctcccacgactgcagcctccagtatgact gcggcctccagcactcccatgacacttgcactccccgcgcccacgtccacttccacaggg cggaccccgtccactaccgccactgggcatccatctctcagcacagccctcgcacaagtg ccaaagagcagcgcgttgccaagaacagcaaccctggccacattggccacacgtgctcag actgtagcgaccacagcaaacacaagcagccccatgagcactcgtccaagtccttccaag cacatgcccagtgacaccgcggcaagccctgtaccccctatgcgtccccaagcacaaggt cccattagccaggtgtcagtggaccagcctgtggttaacacaacaaataaatccacaccc atgccctcaaacacaaccccagagcccgcccccacccccacagtggtgaccaccaccaag gcacaagccagggagccaactgccagcccagtgccagtacctcacaccagcccaatccct gagatggaggccatgtcccccacgacacagccaagccccatgccatatacccagagggcc gctgggccaggcacatcccaggcaccggagcaggtagagactgaagccacaccaggtact gattccactgggccaacacccaggagctcagggggcactaagatgccagccacggactcg tgccagcccagcacccaaggccagtacatggtggtcaccactgagcccctcacccaggcc gtggtagacaaaactctccttctggtggtgctgttactcggggtgacccttttcatcaca gtcttggttttgtttgccctgcaggcctatgagagctacaagaagaaggactacacccag gtggactacttaatcaacgggatgtatgcggactcagaaatgtga