GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:38:16 Sequence gi568815594r:56710142_56921662 : 211521 bp : 44.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 198 473 276 2 0 30 32 226 0.772 6.58 1.02 PlyA + 486 491 6 1.05 2.00 Prom + 1533 1572 40 -5.66 2.01 Init + 2111 2240 130 1 1 75 89 18 0.236 -1.08 2.02 Term + 5053 6242 1190 2 2 18 41 393 0.505 19.51 2.03 PlyA + 6329 6334 6 1.05 3.00 Prom + 6697 6736 40 -6.56 3.01 Sngl + 7039 7914 876 0 0 86 50 289 0.521 20.88 3.02 PlyA + 7960 7965 6 -0.45 4.00 Prom + 8145 8184 40 -2.46 4.01 Init + 8269 8401 133 0 1 78 47 79 0.217 3.10 4.02 Term + 28789 28928 140 2 2 -37 42 347 0.817 15.73 4.03 PlyA + 30050 30055 6 1.05 5.05 PlyA - 31601 31596 6 1.05 5.04 Term - 49346 49274 73 1 1 96 39 59 0.723 -0.82 5.03 Intr - 51348 50782 567 2 0 50 93 242 0.117 12.79 5.02 Intr - 64791 64656 136 1 1 66 26 142 0.286 5.43 5.01 Init - 65969 65900 70 2 1 77 73 33 0.667 1.91 5.00 Prom - 75903 75864 40 -1.26 6.05 PlyA - 76020 76015 6 1.05 6.04 Term - 76830 76824 7 2 1 131 45 0 0.026 -2.66 6.03 Intr - 101653 101544 110 1 2 100 92 27 0.343 3.38 6.02 Intr - 110438 110395 44 0 2 99 127 -21 0.727 0.86 6.01 Init - 111521 111317 205 2 1 80 49 256 0.896 18.30 6.00 Prom - 129601 129562 40 -2.16 7.00 Prom + 171586 171625 40 -4.96 7.01 Init + 175175 175320 146 1 2 88 86 84 0.660 7.79 7.02 Term + 177915 179094 1180 0 1 3 42 478 0.871 25.96 7.03 PlyA + 179355 179360 6 1.05 8.00 Prom + 180726 180765 40 -2.46 8.01 Init + 196796 196852 57 1 0 88 64 29 0.532 1.81 8.02 Intr + 199019 199075 57 1 0 81 62 68 0.612 2.58 8.03 Intr + 200489 201395 907 1 1 67 110 578 0.576 48.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_1|91_aa MLKNAESNAELKGLDVDSLVIEYNRVNKAPKMRPWTYRAHGWINPYMSSPCHTEKILTEN EQVVPRPEKEATQKKKISQKKLKKQKLMAQE >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_1|276_bp atgcttaaaaatgcagagagtaatgctgaactcaagggtttagatgtagattctctggtc attgagtataaccgagtaaacaaagcacctaagatgcgcccctggacctacagagctcat ggttggattaacccatacatgagctccccctgccacacagagaagatccttactgaaaat gaacaggttgttcctagaccagaaaaggaggctacccagaagaaaaagatatcccagaag aaactgaagaaacaaaaacttatggcacaggagtaa >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_2|439_aa MPGCAHGQTLCLLTHPLVLCTWLTLGRHGIQAGSTSQRQPSGPNRSTRQKVNKDIQELNS ALHQADLIDIYRTLHSKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNCLSDN SAIKLELRIEKLTQNLSTTWKLNNLILNDYWVNNKMKAEIKIFFETSENKDTTYKNLWDT VKAVCRGKFIALTAHKRKQERSKTDTLTSQLKELEKQEQTHSKASRRQEITKIGAELKET ETQKSLQKINESRSWLFEKINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTT IREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITDSAIEAIINSLPNKKS PGPDRFSAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILTPKPGRDTTKKENF RPISLMNINAKILNKILAN >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_2|1320_bp atgcctggctgtgcacatggccagaccctgtgcttgctcacacaccccttggtgctctgc acctggctcacccttggcaggcatgggatccaggctggtagcacaagccaaaggcagcct tctgggccaaacagatcaacgagacagaaagttaacaaggatatccaggaattgaactca gctctgcaccaagcagacctaatagacatctacagaactctccactccaaatcaacagaa tatacattcttctcagcaccacatcacacttattccaaaattgaccacatagttggaagt aaagcactcctcagcaaatgtaaaagaacagaaattataacaaactgtctctcagacaac agtgcaatcaaactagaactcaggattgagaaactcactcaaaatctctcaactacatgg aaactgaacaacctgatcctgaatgactactgggtaaataacaaaatgaaggcagaaata aagatattctttgaaaccagtgagaacaaagacacaacatacaagaatctctgggacaca gttaaagcagtgtgtagagggaaatttatagcactaactgcccacaagagaaagcaggaa agatctaaaactgataccctaacatcacaattaaaagaactagagaagcaagagcaaaca cattcaaaagctagcagaaggcaagaaataactaagatcggagcagaactgaaggagaca gagacacaaaaatcccttcaaaaaatcaatgaatccaggagctggctttttgaaaagatc aacaaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatcaaata gacacaataaaaaatgataaaggggatatcaccactgatcccacggaaatacaaactacc atcagagaatactataaacacctctatgcaaataaactagaaaatctggaagaaatggat aaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatccctgaat agacctataacagactctgcaattgaggcaataattaatagcctaccaaacaaaaaaagt ccaggaccagacagattctcagccgaattctaccagaggtacaaggaggagctggtacca ttccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttat gaggccagcatcatcctgacaccaaagcctggcagagacacaacaaaaaaagagaatttt agaccaatatccctgatgaacatcaatgcaaaaatcctcaataaaatactggcaaactga >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_3|291_aa MPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRALIAKTILSQKNKAGGITLPDFKL CYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNKWCW ENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDTGMGKDFMSK TPKAMATQAKIDKWDLIKLKNFCTAKETTITVNRQPTEWEKIFAIYPSDKGLNIQNLQRT QTNLQEKNNPIKKWAKDMNRHFSKEDIYTVNRHMKNAHHHWPSEKCKSKPQ >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_3|876_bp atgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagccctcattgcc aagacaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaacta tgctacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacagaacagagccctcagaaataataccacacatctacaaccatctgatcttt gacaaacctgacaaaaacaagaaatgggggaaggattccctatttaataaatggtgctgg gaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttataca aaaattaattcaagatggattaaagacttaaatgttcgacctaaaaccataaaaacccta gaagaaaacctaggcaataccattcaggacacaggcatgggcaaggacttcatgtctaaa acaccaaaagcaatggcaacacaagccaaaattgacaaatgggatctaattaaactaaag aacttctgcacagcaaaagaaactaccatcacagtgaacaggcaacctacagaatgggag aaaatttttgcaatctacccatctgacaaagggcttaatatccagaatctacaaagaact caaacaaatttacaagaaaaaaacaaccccatcaaaaagtgggcaaaggatatgaacaga cacttctcaaaagaagacatttatacagtcaacagacacatgaaaaatgctcatcatcac tggccatcagagaaatgcaaatcaaaaccacaatga >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_4|90_aa MEYYAAIQKDEFMSFVGTWMKLEAIILSKLSQGQKIKHRMFSLIEGRRKKKEEGEGGGGG EEEEEDEEEEEEEEEEEEEEEEEEEEEEEI >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_4|273_bp atggaatactatgcagccatacaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaagccatcattctcagcaaactatcgcaaggacagaaaatcaaacaccgcatg ttctcactcatagaaggaagaagaaagaagaaagaagaaggagaaggaggaggtggagga gaagaagaggaagaagatgaagaagaagaagaggaggaggaggaggaggaagaagaagaa gaagaggaagaggaagaagaagaagaaatttaa >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_5|281_aa MEQETDHLVKKIPGSTQELERNIYLVSCEPEVEAHYEDQDDASNDNEKQLTGRRQFDSNI RYPVATALGRMEADASVDMFSKVLENQLLQTTKLVEEHLDSEIQKLDQMDEDELEHIKEK RLEALRKAQQQKQEWLSKGHGEYREIPSERDFFQEVKESKKVVCHFYRDSTFRCKILDRF LAILSEKHLETKFLKLNVEKAPFLCERLHIKVIPILALVKDGKTQDYVVGFTDLGNTDDF TTETLEWGLSCSDILNYRPYRVLADIAMAFVNCQCAGGSVF >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_5|846_bp atggagcaagaaactgaccacctcgtgaaaaaaattccgggaagtactcaggaactagaa aggaacatatacttggtgagctgtgagccagaagtagaagcccattatgaagaccaagat gatgcatctaatgataatgagaagcagctaactggcagaagacaatttgacagcaacata cgatatcccgttgctactgcacttggaagaatggaagctgatgcatctgttgacatgttt tccaaagtcctggagaatcagctgcttcagactaccaaactggtggaagaacatttggat tctgaaattcaaaaactggatcagatggatgaggatgaattggaacacattaaagaaaag agactcgaggcactaaggaaagctcaacagcagaaacaagaatggctttccaaaggacat ggggaatacagagaaatccctagtgaaagagacttttttcaagaagtcaaggagagtaaa aaagtggtttgccatttctacagagactccacattcaggtgtaaaatactagacagattt ctggcgatattgtccgagaaacacctcgagaccaaatttttgaagctgaatgtggaaaaa gcacctttcctttgtgagagactgcatatcaaagtcattcccatactagcactggtaaaa gatgggaaaacacaagattatgttgttgggtttactgacctaggaaatacagatgacttc accacagaaactttagaatgggggctcagttgttctgacattcttaattacagaccatat agggtacttgctgacattgccatggcatttgtaaactgtcagtgcgctggtggcagtgtc ttttag >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_6|121_aa MALSVLRLALLLLAVTFAGSARSGPGERGPPEKSGFGSQTGGGPCPAPGGLGDGTRAPVT GGSPEDLPASLIPQFGLFSKYRTPNCSQYRLPGCPRHFNPVCGSDMSTYANECTLCMKIR T >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_6|366_bp atggcgctgtcggtgctgcgcttggcgctgctgctcctggcagttaccttcgcaggtagc gctcggagcggtcctggcgagcggggacctccggagaaaagcgggtttgggagtcagacc ggcggcggaccctgccctgctccgggcggcctcggcgacggtacccgcgcgcccgttact ggcggttccccagaggacctgccagcctctctgatccctcaatttggtctgttttcaaaa tatagaacgccaaactgctctcagtatagattaccaggatgtcccagacactttaaccct gtgtgtggcagtgacatgtccacttatgccaatgaatgtactctgtgcatgaaaatcagg acataa >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_7|441_aa MGRNQSRKAENSKNQSASFLPKDCSSSPAMEESWTENDFDELTEVGFRRPITGSEIEAII NTLQTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQTIEKEGLVPNSFYEASIILIPKPVR DTTEKDNFRPISLMNIDAKILNKILANQIQQHIKKLIQHDQVCFIPGMQGLFNICKSINV IHDINRTNDKNHMIILIYAEKAFDKIQQPFMLKTLNKLSIDGSYLKIIRAIYDKPAANII LNGQKLEAFPSKTDTRQGCPFSPLLFNIVLEVLARAIRQEKEIKRIQLGKEEVKLSLFAD DMVVYLENPIVSAQNLLKLISNFSKVSGYISVQKSQAFLYTNNRQTESQIMSELPFTIAT KRIKYLGIQLTRDVKDLFRENYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVFC RFNAIPIKLPMTFFTDWKKLL >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_7|1326_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaccagagcgcctcttttctt ccaaaggattgcagctcctcgccagcaatggaagaaagctggacagagaatgactttgac gagttgacagaagtaggcttcagaagaccaataacaggctctgaaattgaggcaataatt aatactctacaaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagctggtaccattccttctgaaactattccaaacaatagaaaaagag ggactcgtccctaactcattttatgaggccagcatcatcctgataccaaagccggtcaga gacacaacagaaaaagataattttagaccaatatctctgatgaacattgatgcgaaaatc ctcaacaaaatactggcaaaccaaatccaacagcatatcaaaaagcttatccagcacgat caagtctgcttcatccctgggatgcaaggcttgttcaacatatgcaaatcaataaacgta atccatgacataaacagaaccaacgacaaaaatcacatgattatcttaatatatgcagaa aaggcctttgacaaaattcaacagcccttcatgctaaaaactctcaataaactaagcatt gatggaagttatctcaaaataataagagctatttatgacaaacctgcagccaatatcata ctgaatgggcaaaaactggaagcattcccttcaaaaactgacacaagacaaggatgcccc ttctcaccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaagag aaagaaataaagcgcattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatggttgtatatttggaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaggtctcaggatacatcagtgtgcaaaaatcacaagcattcctatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgctaca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcagggag aactacaaacccctgctcaacgaaataaaagaggacacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggtattttgt agattcaatgccatccccatcaagctaccaatgactttcttcacagattggaaaaaatta ctttaa >gi568815594r:56710142_56921662|GENSCAN_predicted_peptide_8|341_aa MGSHYASEDRCDLRLTFRKCWRAQEKAQDECRGDSRELNTVMATQVMGQSSGGGGLFTSS GNIGMALPNDMYDLHDLSKAELAAPQLIMLANVALTGEVNGSCCDYLVGEERQMAELMPV GDNNFSDSEEGEGLEESADIKGEPHGLENMELRSLELSVVEPQPVFEASGAPDIYSSNKD LPPETPGAEDKGKSSKTKPFRCKPCQYEAESEEQFVHHIRVHSAKKFFVEESAEKQAKAR ESGSSTAEEGDFSKGPIRCDRCGYNTNRYDHYTAHLKHHTRAGDNERVYKCIICTYTTVS EYHWRKHLRNHFPRKVYTCGKCNYFSDRKNNYVQHVRTHTX >gi568815594r:56710142_56921662|GENSCAN_predicted_CDS_8|1023_bp atggggagccactatgcttctgaggacaggtgtgacttacgactgacgtttaggaagtgt tggagagcgcaggaaaaggctcaggacgagtgtcggggcgactcccgcgagttgaataca gttatggccacccaggtaatggggcagtcttctggaggaggagggctgtttaccagcagt ggcaacattggaatggccctgcctaacgacatgtatgacttgcatgacctttccaaagct gaactggccgcacctcagcttattatgctggcaaatgtggccttaactggggaagtaaat ggcagctgctgtgattacctggtcggtgaagaaagacagatggcagaactgatgccggtt ggggataacaacttttcagatagtgaagaaggagaaggacttgaagagtctgctgatata aaaggtgaacctcatggactggaaaacatggaactgagaagtttggaactcagcgtcgta gaacctcagcctgtatttgaggcatcaggtgctccagatatttacagttcaaataaagat cttccccctgaaacacctggagcggaggacaaaggcaagagctcgaagaccaaacccttt cgctgtaagccatgccaatatgaagcagaatctgaagaacagtttgtgcatcacatcaga gttcacagtgctaagaaattttttgtggaagagagtgcagagaagcaggcaaaagccagg gaatctggctcttccactgcagaagagggagatttctccaagggccccattcgctgtgac cgctgcggctacaatactaatcgatatgatcactatacagcacacctgaaacaccacacc agagctggggataatgagcgagtctacaagtgtatcatttgcacatacacaacagtgagc gagtatcactggaggaaacatttaagaaaccattttccaaggaaagtatacacatgtgga aaatgcaactatttttcagacagaaaaaacaattatgttcagcatgttagaactcataca gnn