GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:36:36 Sequence gi568815591f:90312677_90513408 : 200732 bp : 38.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1519 1414 106 0 1 105 76 73 0.351 7.10 1.03 Intr - 8037 7905 133 1 1 50 89 96 0.874 4.68 1.02 Intr - 8881 8578 304 1 1 37 89 197 0.804 9.94 1.01 Init - 22096 22028 69 1 0 25 75 99 0.346 1.41 1.00 Prom - 23622 23583 40 -5.75 2.04 PlyA - 23811 23806 6 1.05 2.03 Term - 34327 33930 398 2 2 55 44 359 0.429 22.45 2.02 Intr - 36428 36271 158 1 2 65 -77 211 0.298 1.03 2.01 Init - 36620 36487 134 2 2 62 98 109 0.778 8.96 2.00 Prom - 37711 37672 40 -3.65 3.00 Prom + 37997 38036 40 -9.45 3.01 Init + 38196 38312 117 2 0 60 37 90 0.718 1.35 3.02 Intr + 40140 40333 194 2 2 57 108 219 0.717 18.17 3.03 Intr + 41782 41873 92 1 2 58 95 99 0.961 6.32 3.04 Intr + 42410 42554 145 0 1 93 110 89 0.976 10.02 3.05 Term + 43695 43767 73 0 1 73 41 69 0.601 -2.90 3.06 PlyA + 45026 45031 6 1.05 4.02 PlyA - 45218 45213 6 1.05 4.01 Sngl - 47540 47100 441 2 0 71 33 223 0.877 11.10 4.00 Prom - 47594 47555 40 -5.65 5.02 PlyA - 47711 47706 6 1.05 5.01 Sngl - 51198 50581 618 0 0 45 42 328 0.717 19.84 5.00 Prom - 52010 51971 40 -9.85 6.05 PlyA - 53080 53075 6 1.05 6.04 Term - 53586 53172 415 2 1 1 48 291 0.894 10.15 6.03 Intr - 53927 53696 232 0 1 40 72 111 0.490 0.61 6.02 Intr - 55728 55514 215 2 2 -9 102 110 0.543 0.14 6.01 Init - 56243 56098 146 2 2 88 81 117 0.963 10.64 6.00 Prom - 57700 57661 40 -6.45 7.00 Prom + 58027 58066 40 -9.05 7.01 Init + 59303 59325 23 1 2 76 92 -19 0.448 -3.49 7.02 Intr + 59479 59552 74 1 2 73 103 49 0.530 3.03 7.03 Intr + 61626 61678 53 1 2 104 96 20 0.565 2.11 7.04 Intr + 65458 65535 78 0 0 128 56 28 0.829 2.43 7.05 Intr + 70280 70403 124 1 1 30 92 136 0.982 7.44 7.06 Term + 72216 72478 263 1 2 90 36 163 0.986 6.00 7.07 PlyA + 73855 73860 6 1.05 8.00 Prom + 84198 84237 40 -5.05 8.01 Init + 90661 90873 213 0 0 55 96 187 0.296 15.02 8.02 Intr + 99980 100430 451 1 1 20 84 347 0.003 19.45 8.03 Term + 106029 106153 125 1 2 77 48 43 0.181 -3.23 8.04 PlyA + 106450 106455 6 1.05 9.06 PlyA - 108195 108190 6 1.05 9.05 Term - 109443 109309 135 0 0 86 48 59 0.478 -1.26 9.04 Intr - 109860 109793 68 1 2 32 101 67 0.437 0.11 9.03 Intr - 110217 110118 100 0 1 89 100 27 0.818 2.76 9.02 Intr - 111851 111795 57 2 0 85 75 94 0.891 5.96 9.01 Init - 128467 128369 99 1 0 51 48 173 0.128 9.91 9.00 Prom - 133721 133682 40 -4.95 10.00 Prom + 135659 135698 40 -3.65 10.01 Init + 145512 145554 43 2 1 80 60 56 0.019 2.63 10.02 Intr + 153620 153837 218 0 2 118 99 72 0.044 8.60 10.03 Term + 162357 162464 108 2 0 30 43 126 0.385 -0.17 10.04 PlyA + 163378 163383 6 1.05 11.06 PlyA - 163556 163551 6 1.05 11.05 Term - 165747 165659 89 1 2 69 49 68 0.072 -2.16 11.04 Intr - 168738 168605 134 2 2 20 -1 156 0.235 -0.73 11.03 Intr - 172805 172662 144 1 0 49 106 102 0.313 6.58 11.02 Intr - 181365 181302 64 1 1 77 94 40 0.238 0.36 11.01 Init - 182036 181988 49 2 1 83 58 65 0.389 2.20 11.00 Prom - 190130 190091 40 -3.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100430 430 1 1 83 84 340 0.812 27.96 S.002 Init - 113133 113125 9 0 0 72 106 0 0.837 0.78 S.003 Init + 157111 157255 145 0 1 53 96 137 0.933 9.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_1|204_aa MGLGTVEQGAALVGRLGLRRSPQLRKKKEQSKKERAKKTLSLWHSKKVGGGQERREQRPK QGRRAQRPPRSGGWEVGRGQGGASRARPPTICPLTQPLEKDRGPSGGSVSGTPRGRSRTG SSTRGRWQTAVYGGKGESCVLDLFSSPYPSPLAVNVVKGTFPTAALNSRIPRYTELSCLL RFLWVVTMSQTMFLMTLTILSTGQ >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_1|612_bp atgggactgggcaccgtggagcagggggcggcgctggtcgggaggctcggtctgcgcagg agcccacagctaagaaaaaagaaggaacaaagtaagaaggagagagcgaagaagaccctg agcctttggcatagcaagaaagtcggaggagggcaggagaggcgagagcagcggccaaag cagggccgcagggcgcagaggcccccgaggtctggaggctgggaagttgggcgaggtcag ggaggggcctcgagagcgcggccacccacgatctgccccctcactcagcctctagaaaag gatcgcggtcccagcggcgggtctgtctcgggaaccccgcgtggccgcagcagaaccggc agctccacccgcgggaggtggcagacggctgtgtacggggggaaaggtgagtcctgtgtt ctggatctgttctcctctccctatccctccccgttggctgtcaatgttgtaaaagggaca tttccaacagcagccttgaatagcaggatcccacgttacaccgagttgtcatgtcttctg aggttcctgtgggttgtgacaatgtctcagaccatgtttttgatgaccttgacaattttg agtactggtcag >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_2|229_aa MIFYEKDCQPWKRTLIEKTSGKSEVYIIEDAIIIEKAMKAISPKQIYKRANQGNHERGCE YAKNGGKVGKGLQDMDLEETQELTATKPEELTEDSLMKLQTLPLPPNNRPNHARYTHHCH RLLTKRHYRRGGEVYGKTKQGKGRHEDSSGETGEQRRTRAVSAKGTRLRGPARTFLNNTQ LQCTMAATGQETFLRKRHKPQYSAAFLRHPLPPTRFLLRPHLRTGACVV >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_2|690_bp atgattttctacgaaaaggattgtcaaccatggaaaagaacactaatagagaaaacttca ggaaaatctgaagtttacataattgaagatgccattattattgaaaaagccatgaaagcc ataagccccaaacagatttacaaaagagccaatcaaggaaatcatgaaagaggctgtgaa tatgccaaaaacggcgggaaagtgggtaaaggacttcaagatatggatcttgaagaaact caagagctaacagccaccaaaccagaggaattaacagaagacagcttgatgaaacttcaa accctgcccctgcccccaaacaaccgtccaaaccacgccaggtacacacaccactgtcac cgcttactgacaaagcgccactacaggaggggcggagaagtctatgggaaaaccaagcaa ggaaaaggccgccacgaagacagtagtggagagacaggggagcaaagaagaaccagagct gtcagcgctaaggggaccaggctgaggggacccgcacggacctttctgaacaacacgcaa ctgcaatgcaccatggctgcaacaggccaggaaaccttcttgcggaagcggcacaagccg cagtacagcgcggcctttttacgtcatcccctgccgccgacgcgatttctcctccgccca cacctgcggactggcgcatgcgttgtgtga >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_3|206_aa MTLLLGSTASITSGTLYGFHGGVQGLQCSAEHDEKCKRTYGNFIDKLRLFTRGGSGGMGY PRLGGEGGKGGDVWVVAQNRMTLKQLKDRYPRKRFVAGVGANSKISALKGSKGKDCEIPV PVGISVTDENGKIIGELNKENDRILVAQGGLGGKLLTNFLPLKGQKRIIHLDLKLIADVG LVGNIMGLIGSYGQDWQSSQSQLHFA >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_3|621_bp atgactttacttcttgggagcactgccagcatcactagtggcactttgtatgggttccat ggtggtgttcaaggtttacagtgtagcgctgaacatgatgaaaaatgcaagagaacttat ggaaatttcatcgataagctaagactcttcaccaggggaggatccggtggaatgggttat cctcgtttaggtggagaaggtggaaaaggtggtgatgtctgggttgtagcccagaacaga atgactttaaaacaacttaaagacaggtatcctcggaaacggtttgtggctggagtagga gcaaacagcaaaattagtgcactgaaaggctccaaaggaaaagactgtgaaatccctgtg cctgtgggtatttcagtaactgatgaaaatggtaaaattataggagaactcaataaagaa aatgacagaattttggtagctcaaggaggtcttggtggtaaattacttacaaatttctta ccattgaaaggccagaaacgaataattcaccttgatctaaaacttatagctgatgtaggc ctagtagggaacattatgggcttaatagggtcttatggacaagactggcagtccagccaa tcacagttgcattttgcataa >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_4|146_aa MGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIGVNRQPTEWEKMFAIYSSDKGL ISRIYKVLKQIYKKKTNNPIKKWAKDMNRHFSKEDIYASNKHMKKCSSSLAVKCKSKPQG DTISHQLEWRSLKSQETTGAGEDVEK >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_4|441_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcggagtg aacaggcaacctacagaatgggagaaaatgtttgcaatctactcatctgataaagggcta atatccagaatctacaaagtactcaagcaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcatccaac aaacatatgaaaaaatgctcatcatcactggccgtcaaatgcaaatcaaaaccacaagga gataccatctcacaccagttagaatggcgatccttaaaaagtcaggaaacaacaggtgct ggagaggatgtggagaaatag >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_5|205_aa MKQEENFREKRVKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKSENTLQDIIQENFL NLARQANIQIQEIQRPPQRYSSKRATQRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKLI TLTADLSAETLKARREWGPIFNILKEKNFQPRISYPAKLRFISEGEIKSFIDKQMLRDFV NTSPALQELLKEALNMERNNRYQPL >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_5|618_bp atgaagcaagaagagaattttagagaaaaaagagtaaaacgaaatgaacaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgat ggggagaatggaaccaagtcagaaaacactctgcaggatattatccaggagaacttcctc aacctagcaaggcaggccaacattcaaattcaggaaatacagagaccaccacaaaggtat tcttcaaaaagagcaactcaaagacacataattgtcagattcaccaaagttgaaatgaag gaaaaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagctcatc acactaacagctgatctctcggcagaaactctaaaagccagaagagagtgggggccaata ttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaactaaga ttcataagtgaaggagaaataaaatcctttatagacaagcaaatgctgagagattttgtc aacaccagccctgccctacaagagctcctgaaggaagcactaaacatggaaaggaacaac cggtaccagccactgtaa >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_6|335_aa MGRNQSRKAENSKNQSAYSPPKDCSSSPAMEQNWMENEFDKLTEVDFRSDWENGSKLENT LQDIIEENFPNLAKTANIQIQEIRRTPQRYSSRRATARHIIVRFTKVEMKKKILRAAREK EIQITIRQYYKQFYANKLENLEEMDKFLDTYMLPRLKQEEFESLNRPVTGPENEAIINCL PTKKSPGRGGFTAQFYQSLAEIQQQQQQQKENSRPISLTNINAKILNKILANRTQQHIKK LIHHDQVGFILGMQGWFNVHKSINLIHHINRSNDKNHMIISIDAEKAFEKIQQRFMLKTL KTLGINRMYLKILRAIYDKPTANLIRNKKNWKHSL >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_6|1008_bp atggggagaaaccagagcagaaaagctgaaaattccaaaaaccagagtgcctattctcct ccaaaggactgcagctcctcgccagcaatggaacaaaactggatggagaatgagtttgac aaattaacagaagtagacttcagaagtgactgggagaatggatccaagttagaaaacact cttcaggatattattgaggagaacttccccaacctagcaaagacggccaacattcaaatt caggaaatacggagaacaccacaaagatactcctcgagaagagcaaccgcaagacacata attgtcagattcaccaaagttgaaatgaagaaaaaaatattaagggcagccagagagaaa gaaatacaaattaccatcagacaatactataaacaattctatgcaaataaactagaaaat ctagaagaaatggataaattcctggacacatacatgctcccaagactaaaacaggaagaa tttgaatctctgaatagaccagtaacaggtcctgaaaatgaggcaataattaattgctta ccaaccaaaaaaagtccaggacgaggtggattcacagctcaattctaccagagcctggca gagatacaacaacaacaacaacaacaaaaagagaattctaggccaatatccctgacgaac atcaatgcgaaaatcctcaataaaatactggcaaaccgaacccagcagcacatcaaaaag cttatccaccatgatcaagttggcttcatccttgggatgcaaggctggttcaatgtacac aaatcaataaacctaatccatcacataaacagaagcaacgacaaaaaccacatgattatc tcaatagatgcagaaaaggccttcgagaaaattcaacagcgcttcatgctaaaaactctc aaaacactaggtatcaacagaatgtatctcaaaatattaagggctatttatgacaaaccc acagccaatctcatacggaataagaaaaactggaagcattccctttga >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_7|204_aa MNLSLPFIFPNAGKSSLLSCVSHAKPAIADYAFTTLKPELGKIMYSDFKQVDISGFQLSS HTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPKDFL HLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLLNLW ISDTMSSTEPPSKHAVTTSKMDII >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_7|615_bp atgaatttatctttaccttttatattcccaaatgctggaaaatcctctttgctaagttgt gtttctcatgcaaaacctgcaattgcagattacgcatttacaacattaaagcctgaactt ggaaagataatgtacagtgatttcaaacaggttgatatttctggatttcagctttcttct cacactcaatacaggacagcttttgaaaccataatactgcttacaaaagagttggaattg tacaaagaggaacttcagacaaaacctgcactcttggcagttaataaaatggacttgcca gatgcccaagataagttccatgaattgatgagccagctccagaatcctaaagattttctg catttatttgaaaaaaacatgattccagagaggactgtagagttccaacatatcatcccc atatctgcagttactggagaaggaatcgaagaattaaagaattgtataagaaagtcactg gatgaacaggccaaccaggaaaatgatgcacttcataagaaacagttgcttaatttgtgg atttctgatacaatgtcttctactgagccaccatcaaagcatgctgttactacttccaaa atggatataatttaa >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_8|262_aa MLPAAGGPVDSGNCSPSHPGHCRSGPRENLTALPEASGVPRWERTSWDAERETLQEAPQC GRVKCPACEKQYSTSLPAMGCRDVHAATVLSFLCGIASVAGLFAGTLLPNWRKLRLITFN RNEKNLTVYTGLWVKCARYDGSSDCLMYDTTWYSSVDQLDLRVLQFALPLSMLIAMGALL LCLIGMCNTAFRSSVPNIKLAKCLVNSAGCHLVAGLLFFLAELKGSSGVLEPSGTGPQEP VCTSPPNCPTCDIILVAWNWPG >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_8|789_bp atgttgcctgcggctggcggcccagtggattctgggaattgtagtcccagccatccaggg cattgccgttcagggccacgggaaaacctgactgcgctcccagaagcctccggtgtacct cgctgggaacgcacttcctgggacgctgagagggagacgctccaagaggctcctcagtgt gggcgagtaaaatgccctgcgtgtgagaagcagtactccacaagcttgcctgccatgggc tgtcgggatgtccacgcagccacagtcctttccttcctgtgtggaatcgcctcagtagca ggcctctttgcagggactctgcttcccaactggagaaaattacgattgatcacattcaac agaaacgagaagaacctgactgtttacacaggcctgtgggtgaaatgtgcccggtatgac gggagcagtgactgcctgatgtacgacactacttggtactcatcagttgaccagctggac ctgcgtgtcctccagtttgccctacccctcagcatgctgatcgccatgggtgccctgctg ctctgcctgattggaatgtgcaacactgccttcaggtcctcggtgcccaacatcaaactg gccaagtgtctggtcaatagtgcaggttgccacctggtggctgggctgctatttttcctg gcagagctcaaaggtagcagtggagtgctggagccaagtggtactggcccacaagaacca gtgtgtacatctcctcccaactgcccaacctgtgacatcatattggttgcttggaattgg ccagggtga >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_9|152_aa MPVIAPEELPAGQDVEVGGSDADDFDPVYAWANVLSGEIQPDIRRNSPLIFHGLLQPIKV VLTVWFGFGKLVLQYHSTFRSNKQGISILAATAIRSHENPGKSDSRGNSTLTQDELRNLW SPLKNENAGSLVQKLRISRHPPQNIKTRAGPF >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_9|459_bp atgcctgttattgcccctgaagaacttccagcgggacaagatgtagaagttggaggcagt gatgctgatgatttcgaccctgtgtatgcctgggctaatgtcttatcgggggaaattcag ccagatatcaggcgaaattcacccctgatatttcacggactacttcagcccatcaaggta gtacttacagtttggtttgggtttggtaaattggtgctccaatatcactctacctttaga agtaataaacaaggaatctctattttggctgcaactgcaataagaagccatgagaatcct ggcaaaagcgatagccgtggcaatagcaccctcacacaggatgagctacgtaatttgtgg agcccactgaaaaatgaaaatgcaggatcccttgttcaaaaattacgaatttcaagacac ccaccacagaatattaaaacaagagcagggcccttctga >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_10|122_aa MVNWKPSEKDIKEDGPACPQLDLTLVIGLAAKRRGRGSPERTPIKIPLLCFPAERSAPRC KFLDSWRICRVNIRSIHPFTKAIRNVLHSQQQERQYGSTKSPAPKSHGAMQCIQIVLMNA AR >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_10|369_bp atggtgaactggaagccaagtgaaaaagatatcaaggaagatggcccagcatgtccccag ctggatttaaccctagttattggcctagcagccaagagaagaggtcggggttctcctgag aggacacctattaagatacccttattgtgttttccagcggaaaggtcagccccccgctgc aagttcttagattcctggaggatctgcagagtcaacattaggagtatccatccattcacc aaggccatcaggaatgtgctgcacagccaacagcaggaacgtcagtatggcagcaccaaa tctcctgcccccaagtcccatggagcaatgcaatgtatccagattgtgcttatgaatgca gccagatga >gi568815591f:90312677_90513408|GENSCAN_predicted_peptide_11|159_aa MESRYIVRASLELLASGKKCGPRRALLPPILKNNCLHSGSLRVSLTDYIANLKECVELYS NTIKSNAHLSEISDLELGAVADSDLGCIHADVDEICSSSNQLVMHSKRKYKPKDQSELRN VGPVPLKIRGSESILEIPMGKLSERKGKVNGKDSAASAM >gi568815591f:90312677_90513408|GENSCAN_predicted_CDS_11|480_bp atggaatctcgctatattgtccgggctagtctcgaactcctggcctcaggaaagaagtgt ggaccaagaagggcactattaccaccaatcctgaaaaataactgtcttcacagtgggtct ttgagagttagtttaactgattatatagctaacttgaaggagtgtgtggagctgtattca aacaccatcaagagtaatgcacatctctcagagatttcagacttagagctgggggcagta gctgatagtgaccttggctgtattcatgcagatgttgatgaaatatgctcaagcagcaat cagctagtaatgcacagcaaacgaaaatataaaccaaaggaccaatcagagttgcggaat gtgggtccagtgccgttaaaaataagagggtccgaaagtatccttgaaatacccatggga aagttgagtgaaagaaaaggcaaggtgaatggtaaagattctgctgcctcagctatgtga