GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:28:18 Sequence gi568815588f:114838254_115075376 : 237123 bp : 39.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4680 4973 294 2 0 70 76 189 0.779 12.18 1.02 Intr + 5488 5684 197 0 2 58 86 71 0.863 1.19 1.03 Intr + 7114 7228 115 1 1 93 76 83 0.989 7.13 1.04 Intr + 7760 7836 77 1 2 60 92 70 0.998 1.99 1.05 Intr + 7922 8114 193 2 1 66 99 157 0.998 13.17 1.06 Intr + 8306 8475 170 1 2 63 84 105 0.999 5.42 1.07 Intr + 8837 8980 144 2 0 5 95 130 0.936 3.88 1.08 Intr + 10394 10484 91 2 1 36 96 85 0.974 3.18 1.09 Intr + 16944 17087 144 2 0 124 90 0 0.851 3.36 1.10 Intr + 22978 23081 104 0 2 77 92 155 0.798 12.85 1.11 Intr + 23182 23231 50 1 2 86 56 49 0.231 -1.09 1.12 Intr + 29921 29998 78 0 0 111 105 99 0.352 12.50 1.13 Intr + 36854 36986 133 0 1 33 5 143 0.346 -0.72 1.14 Intr + 37976 38246 271 2 1 99 53 138 0.762 8.02 1.15 Term + 38785 39141 357 0 0 50 43 254 0.594 10.63 1.16 PlyA + 41185 41190 6 1.05 2.00 Prom + 46252 46291 40 -4.75 2.01 Init + 47617 47619 3 0 0 109 101 0 0.701 3.45 2.02 Term + 61237 61368 132 0 0 123 46 91 0.618 5.51 2.03 PlyA + 61546 61551 6 1.05 3.03 PlyA - 61769 61764 6 -0.45 3.02 Term - 63118 62813 306 1 0 31 43 275 0.419 11.43 3.01 Init - 77700 77482 219 0 0 35 91 131 0.290 6.78 3.00 Prom - 93427 93388 40 -3.65 4.00 Prom + 95615 95654 40 -7.15 4.01 Init + 99859 99913 55 0 1 82 60 58 0.621 3.90 4.02 Intr + 99984 100286 303 1 0 31 84 394 0.472 28.94 4.03 Intr + 104392 104490 99 2 0 100 86 91 0.852 9.26 4.04 Intr + 112841 112896 56 0 2 69 116 26 0.322 1.28 4.05 Intr + 121473 121554 82 2 1 116 72 87 0.946 8.19 4.06 Intr + 132115 132187 73 2 1 56 95 70 0.571 2.05 4.07 Intr + 136038 136132 95 0 2 4 121 97 0.436 3.29 4.08 Term + 136870 137126 257 2 2 78 50 233 0.990 13.26 4.09 PlyA + 139399 139404 6 1.05 5.00 Prom + 148692 148731 40 -2.35 5.01 Init + 165573 165816 244 2 1 55 28 212 0.238 9.84 5.02 Intr + 180306 180570 265 1 1 56 80 137 0.011 5.35 5.03 Intr + 187208 187402 195 2 0 66 38 101 0.023 0.51 5.04 Intr + 198088 198137 50 1 2 88 100 61 0.079 4.71 5.05 Term + 200927 201549 623 0 2 85 32 238 0.236 11.49 5.06 PlyA + 201847 201852 6 -0.45 6.05 PlyA - 203155 203150 6 1.05 6.04 Term - 204335 204145 191 0 2 41 49 133 0.121 1.33 6.03 Intr - 204587 204446 142 2 1 77 101 141 0.093 13.31 6.02 Intr - 209747 209685 63 2 0 27 116 60 0.057 0.70 6.01 Init - 230368 230288 81 1 0 70 84 63 0.332 5.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_1|805_aa NKMKSLASKGVPNVISEDTLKGQDSLSTDTGQSRQPEELSGATGMEQTELEDEPPHQMDH LSTSLDNLSVTSLPEASVVCPNQDYNLVNSLLNLTRSPDGRIAVKACEGLMLLVSLPEPA AAKCLTQSTCLCELLTDRLASLYKALPQSVDPLDIETVEAINWGLDSYSHKEDASAFPGK RALISFLSWFDYCDQLIKEAQKTAAVALAKAVHERFFIGVMEPQLMQTSEMGILTSTALL HRIVRQVTSDVLLQEMVFFILGEQREPETLAEISRHPLRHRLIEHCDHISDEISIMTLRM FEHLLQKPNEHILYNLVLRNLEERNYTEYKPLCPEDKDVVENGLIAGAVDLEEDPLFTDI SPENTLPNQEWLSSSPPATPDHPKNDGKTEVHKIVNSFLCLVPDDAKSSYHVEGTGYDTY LRDAHRQFRDYCAICLRWEWPGSPKALEKCNLEAAFFEGHFLKVLFDRMGRILDQVVGDL MLRIQRIQDFTPKLLLVRKRLLGLEPEGPIIDHITLLEGVIVLEEFFQGSPNPRATDQYQ SVAGWELGCIAGGTAGRSAALDGLSSEGRKAAGGGSAVLQKDCCAAEGLLWILTADGENR RSPRSAVDQACRSTSSLVLLRPLALSLLMQDLACSCDSSPSSPATGYCSPDSITAQLWCF RHPTSPPRASVSLSVKWGWYQLPQRVVLRTRLKFCSVYASKRVADTSGQGQTYNELKGAF TGWNGLRAPLFLEEQSMGSKGVPRPLNSSRVLSLEGDVSKQSLSPLGTAPPEGSGQARVS KFIQLVGQLEPMQLLQKSRKIWIRY >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_1|2418_bp aataagatgaaatcattggcttccaaaggagtaccaaatgtaatttcagaagatacatta aaaggtcaggattccttgtcaacagatacaggacagtcccgtcaaccagaggaactatct ggtgctactggaatggagcaaacagaattggaagatgagcctcctcatcagatggatcac ctgtccacaagcttggataacctcagtgtcacctcactgccagaggcctcggttgtttgt ccaaatcaggattacaatttagtgaattctttgttaaatcttactagaagtcctgatggc agaatagctgtgaaggcatgtgaaggcttgatgctgttagtaagtttgccagagcctgcg gctgcaaagtgccttacacagagcacttgcttgtgtgaactactgacagacagacttgcc tccctgtacaaggccctacctcagtcagtggatccgttagatattgaaaccgtggaagca attaactggggcttggactcatatagtcataaagaagatgcttcagcatttccaggaaaa cgagccttaatttcatttctttcctggtttgattattgtgatcagctcattaaggaagcc caaaagactgctgctgttgctcttgccaaagctgttcatgaaagatttttcattggtgtt atggaacctcaattaatgcaaacttctgagatgggtattctcacatccactgctctgctt catcgcatcgttcggcaagtgacctctgatgttttgcttcaagaaatggtgttttttatc cttggagaacagagggaaccagaaactctggcagaaatcagcagacatcctttaaggcat aggttaattgaacattgtgatcacatatctgatgagataagcataatgacattacgaatg tttgaacatcttttacaaaaacccaatgagcacattctttacaacttggtcttgagaaat cttgaagaaagaaattatacagaatataaacctttgtgcccagaagataaagatgtggta gaaaatggattgatagcaggagcagtagatctggaagaagatccattatttactgacatt tcaccagaaaacactttgccaaaccaagagtggcttagttcttcacctcctgctactcca gaccaccccaaaaatgatggaaaaactgaagttcataaaattgtaaatagttttctctgt ctggtaccggatgacgcaaaatcctcctaccatgttgagggcacaggatatgacacttac ctccgagacgctcataggcagttccgagactactgtgctatctgcttaagatgggagtgg cctgggtctccaaaagcattggaaaagtgcaatttagaagctgctttctttgaaggtcat tttttgaaagtgctgttcgacagaatgggaagaattcttgatcaggttgttggagacctt atgcttcgaatccagcgtattcaagactttactcccaagcttctgttagtcagaaagcgg ttacttggtttggaacctgaaggccctattattgaccacatcacactgctagagggtgtg attgtgttagaagagttctttcaggggtcccccaacccccgggccacagaccagtaccag tccgtggccggttgggaactgggctgcatagcaggagggactgcaggccgctctgcagcc ctggatgggctcagttccgagggaaggaaggcggccggcggaggcagtgctgtgctgcag aaggactgctgtgctgcggaaggactgctgtggattctgactgcagacggggagaacaga aggtccccaaggagcgctgtggaccaggcatgcaggtccacgtcatctctggtcctgctc cgacctttggcactgagcctcctaatgcaggatttggcttgttcctgtgacagctccccc tcatcgcccgcaacagggtactgcagtccagactccatcactgctcagctctggtgcttt aggcaccccacatcacctccccgagcctcagtttccttatctgtaaaatggggatggtac caactaccacagagggttgttttaagaacacgtctgaagttttgttctgtttatgcctcc aagcgtgttgccgacacatccgggcaaggacaaacgtacaacgaactcaagggagctttc acaggctggaacgggctcagagcacccctgttcttagaggaacaatcaatgggaagcaaa ggtgtacccaggcctctcaacagttcccgagtgctgtcactggagggcgatgtctccaaa cagtctctttccccactgggaacggcgccgcccgagggttcaggtcaagcacgagtttcc aaattcattcaattagtagggcaactggaacccatgcagctccttcagaaatcccgcaaa atatggatacgctattaa >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_2|44_aa MADVAFMVVRGIPESLMSSKLQAQPKESRWFPQKILGLGSTAHP >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_2|135_bp atggcagacgttgccttcatggtggtaagagggattccagagtctcttatgtcctcaaag cttcaagctcagccaaaagaatctcgatggtttccacaaaaaatcttgggccttggatca actgcccacccttga >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_3|174_aa MLLNTLQCIGQPPEQNYPGQNTGTVTAEKLCTSQTVLSEGRIRNFNLLITAQSVISGFKV IFKPGLLIHQIKQEGILQTLLISAVTVELESIVSSPRCTARTTNSVVELMALSKATALLA CRCQPTHLPVLVDWFGDPLGVRISSDGFMEWINEDNLRKFVCGIFTNPVRIQDS >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_3|525_bp atgctgctaaacaccttgcaatgcataggacagccccctgaacagaattacccaggccaa aacaccggtactgttactgctgagaaactgtgcactagtcaaacagtactcagtgaaggg agaattaggaactttaacctgctaataacagcccaatccgttatttcgggcttcaaagta atatttaaacctggcctgttgattcatcaaattaagcaggaaggaattttacaaacttta cttatatcagcagtaacggtggagctggagagtattgtgtcttctccacgctgcacggcg agaaccaccaacagtgtggtggaacttatggccctttccaaggccacggctcttttggcc tgcagatgtcagcccacacatctccctgtgcttgtggattggtttggtgatccactgggt gtcaggatttcttccgatggctttatggaatggatcaatgaggataacctcagaaaattt gtatgtggaatcttcaccaacccagtaagaattcaggactcttag >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_4|339_aa MDYNSHKSVRPLRSAQLRGLGYKSMAASEAAVVSSPSLKTDTSPVLETAGTVAAMAATPS ARAAAAVVAAAARTGSEARVSKAALATKLLSLSGVFAVHKPKGPTSAELLNRLKEKLLAE AGMPSPEWTKRKKQTLKIGHGGTLDSAARGVLVVGIGSGTKMLTSMLSGSKRYTAIGELG KATDTLDSTGRVTEEKPYDKITQEDIEGILQKFTGNIMQVPPLRLAIYYVTVNIPDVECG GGFYIRSLVSDIGKELSSCANVLELTRTKQGPFTLEEHALPEDKWTIDDIAQSLEHCSSL FPAELALKKSKPESNEQVLSCEYITLNEPKREDDVIKTC >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_4|1020_bp atggactacaattcccacaagtccgtgcgacctctccgttctgcgcagcttcgcggtctg ggctacaaaagtatggccgcttctgaggcggcggtggtgtcttcgccgtctttgaaaaca gacacatcccctgtccttgaaactgcaggaacggtcgcagcaatggctgcgaccccgtca gcaagggctgcagccgcggtggttgcggccgcggccaggaccggatccgaagccagggtc tccaaggccgctttggctaccaagctgctgtccttgagcggcgtgttcgccgtgcacaag cccaaagggcccacttcagccgagctgctgaatcggttgaaggagaagctgctggcagaa gctggaatgccttctccagaatggaccaagaggaaaaagcagactttgaaaattgggcat ggagggactctagacagcgcagcccgaggagttctggttgttggaattggaagcggaaca aaaatgttgaccagtatgttgtcagggtccaagagatatactgccattggagaactgggg aaagctactgatacactagattctacggggagggtaacagaagaaaaaccttacgataaa ataacacaagaagatattgaaggcattctacagaaatttactggaaatataatgcaagtg ccccccctgaggttagcaatttactatgtcactgttaacattccagatgttgaatgtgga ggaggtttttatatcagaagcttggtcagtgacattggaaaagaactatcttcctgtgcc aatgtgctagagctgacccgaaccaaacagggaccatttacgctagaagaacatgccctt cctgaagacaaatggacaattgatgacattgcacagtctcttgagcattgctcatctctt ttcccagcagagttggcacttaaaaaatcaaaacctgagtctaatgaacaggttttgagc tgtgaatatataactctaaatgagccaaagagagaagatgatgtaattaagacgtgttga >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_5|458_aa MEELCLMKLHPRGGGGGCSCKNSELPYPDQGPGRAFWRKQPLNCQLIEELYLNSSGSPGL LFHVHLLALKFEYLQGPESLPAWWPQGSGLCIVAEGFAAEYSSKQSGSCIASYDPTLEST QHHFRVFYGLKSHKNQPVSRRRNRDHISQLEKGQSHIVIRAYGIRSIAVSQFLRVTIEER VEMEFRSRSHRSECDIVLKVSTCQCGVSHAVVNHHLSPIMEDLLWASCHMHYKDVVMTAF PEAKLLVQGKAKGNKQRNLQRTTKTPGLSVMSPSSCSGGGEFGPTQVHVPFSLSDLKQIK ADQGKLSDDPDRYTDVLQGLGQTFNLTWRDVMLLLDQTLAFNLKNVALATAREFGDTWYL SQVNDRMTAGERDKVSPGQQAIPSVDPHWDLDSDHWDWSRKHLLTCVLERLRRIRKEPMN YSMMSTITQEKEESLAFLEWLQEALRKYTPLSPNSLKG >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_5|1377_bp atggaagagctgtgcctaatgaaacttcatccacgtggaggtggaggaggctgcagctgc aagaactcagagctgccttacccagaccagggaccagggagggctttctggaggaaacag cctctgaactgccagctgatagaggagctctacctcaactcttctggttccccagggctg cttttccacgtccatttattggcactgaagtttgaataccttcaggggcccgaaagcctg ccagcatggtggcctcagggcagtggactctgcatagtggctgaaggcttcgcagctgag tattccagcaagcaaagtgggagctgtattgcctcatatgacccaaccttggaatccaca cagcatcacttccgtgtattctacgggttgaaaagtcacaaaaaccaaccagtttcaagg agaaggaacagagatcacatttctcaattggagaagggtcaaagtcacattgtaatcaga gcctatgggatacgaagtattgcggtcagccagtttctgagggtgacaatagaggaaagg gtggagatggagttcaggtccagaagccatagaagcgagtgtgacattgtgctcaaggtc agcacatgtcagtgtggggtgtcacatgctgttgtgaaccatcatttatcaccaattatg gaagacctcctatgggcatcttgccatatgcattataaagatgtagtgatgactgccttc cctgaagcaaagcttctggttcaaggaaaggccaaaggaaacaagcaaagaaatctccaa aggaccacaaaaacccctgggctatcggttatgtccccttcaagctgtagcgggggaggg gaatttggcccaacccaggtacatgtccccttctccctctctgatttaaagcagatcaag gcagaccaggggaagctttcagatgatcctgataggtatacagatgtcctacagggtcta gggcaaaccttcaatctcacttggagagatgtcatgctattgttagatcaaaccctggcc tttaatttaaagaatgtggctttagccacagcccgagagtttggagatacctggtatctt agtcaagtaaatgatagaatgacagctggggaaagggacaaagtctctcccggtcagcaa gccatccctagtgtggatccccactgggacctagactcagatcattgggactggagtcgc aaacatctgttgacctgtgttctagaaagactaaggagaattaggaaagagcctatgaat tattcaatgatgtccaccataactcaggaaaaggaagaaagtcttgccttccttgagtgg ctacaggaggccttaagaaaatatactcccctgtcacccaactcactcaagggttaa >gi568815588f:114838254_115075376|GENSCAN_predicted_peptide_6|158_aa MHLHTTTTATTTTNNNNNNNNNSIEWQLADKSRFLSLGQNLDCGKLTKAAMETSMTGVTL KATDQKYRFPTVSGICSFRWVLGLADFKNEATDPCGVKLQSFAVSVTALKGSTSGVLCSF QWVRGLLTSGMKPQTLAVSVTAHKGSVDTKNEQQQDLL >gi568815588f:114838254_115075376|GENSCAN_predicted_CDS_6|477_bp atgcatcttcatacaacaactactgctactactactaccaataataataacaacaacaac aataatagcattgagtggcagcttgctgacaaatccaggtttctaagtctaggtcagaac ctggactgtggtaaattaacaaaggctgcaatggagacttcaatgactggagtaaccctg aaagccacagatcaaaaatacagatttcctactgtgtctggaatttgttccttccggtgg gttcttggtcttgctgacttcaagaatgaagccacagacccttgcggagtgaagctgcag agcttcgcagtgagtgttacagctcttaaaggcagcacatcgggagttctttgttccttc cagtgggttcgtggtctgttgacttcaggaatgaagccacagaccctcgcagtgagtgtt acagctcataaaggtagtgtggacacaaagaatgagcagcagcaagatttattgtga