GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:22:29 Sequence gi568815575f:116072281_116273369 : 201089 bp : 36.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 380 375 6 1.05 1.01 Sngl - 20125 19763 363 1 0 87 55 145 0.800 7.17 1.00 Prom - 20450 20411 40 -3.85 2.03 PlyA - 20665 20660 6 1.05 2.02 Term - 36137 35602 536 0 2 29 43 508 0.955 33.82 2.01 Init - 36498 36219 280 0 1 61 -46 470 0.994 28.12 2.00 Prom - 40283 40244 40 -1.55 3.00 Prom + 40521 40560 40 -2.35 3.01 Init + 44386 44527 142 0 1 62 48 139 0.142 7.64 3.02 Intr + 49638 49928 291 1 0 6 71 158 0.082 2.08 3.03 Intr + 52244 52491 248 0 2 49 41 195 0.013 7.16 3.04 Term + 69196 69678 483 0 0 15 43 247 0.758 6.66 3.05 PlyA + 71562 71567 6 1.05 4.00 Prom + 71672 71711 40 -6.75 4.01 Sngl + 87953 88969 1017 1 0 88 43 695 0.985 61.77 4.02 PlyA + 89197 89202 6 1.05 5.00 Prom + 89366 89405 40 -5.95 5.01 Sngl + 89459 90112 654 1 0 49 39 194 0.529 6.52 5.02 PlyA + 90173 90178 6 1.05 6.04 PlyA - 92626 92621 6 1.05 6.03 Term - 101017 100712 306 1 0 13 42 413 0.060 23.33 6.02 Intr - 107826 107627 200 1 2 91 49 137 0.276 8.25 6.01 Init - 107983 107887 97 1 1 68 54 55 0.541 0.71 6.00 Prom - 114573 114534 40 -3.75 7.07 PlyA - 114640 114635 6 1.05 7.06 Term - 130970 130866 105 2 0 75 45 78 0.335 -0.37 7.05 Intr - 132874 132794 81 1 0 78 91 54 0.599 3.62 7.04 Intr - 143672 143499 174 2 0 32 76 152 0.694 7.61 7.03 Intr - 144938 144788 151 1 1 57 72 58 0.264 0.34 7.02 Intr - 160740 160652 89 0 2 122 32 110 0.215 6.65 7.01 Init - 161570 161493 78 2 0 70 59 65 0.818 2.91 7.00 Prom - 162012 161973 40 -1.35 8.00 Prom + 165161 165200 40 -6.35 8.01 Init + 167231 167289 59 1 2 81 81 -9 0.435 -1.47 8.02 Intr + 167999 168124 126 2 0 56 103 112 0.605 8.37 8.03 Intr + 172738 172848 111 1 0 43 77 115 0.254 4.48 8.04 Intr + 173291 173527 237 2 0 36 44 142 0.217 0.41 8.05 Intr + 174098 174225 128 2 2 66 50 136 0.786 7.00 8.06 Intr + 174469 174647 179 2 2 10 92 78 0.286 -0.88 8.07 Intr + 178138 178294 157 0 1 47 49 149 0.353 5.56 8.08 Term + 182413 182915 503 2 2 44 32 289 0.322 12.56 8.09 PlyA + 182933 182938 6 1.05 9.00 Prom + 184109 184148 40 -5.65 9.01 Sngl + 184163 184606 444 1 0 71 49 208 0.864 11.19 9.02 PlyA + 185508 185513 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 49947 50123 177 2 0 88 46 184 0.806 8.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_1|120_aa MRDSAIRLGYYAFPMAFAICRSDPLVCLHHQDPGFQAQNQVAVWADTKLVAGVFFSYPSG TWNPSKTEPFTSLEKGLKPGSQVVLLSGSHSLEAQQAKNHWIEIQQAQQSEVNLGRLSLV >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_1|363_bp atgagggactctgctatcaggctgggttactatgcttttcctatggcttttgcaatctgc agatcagatcctcttgtgtgcctacaccaccaggaccctgggtttcaagcacaaaaccag gtggctgtttgggcagacaccaagctagtggcaggagtatttttttcataccccagtggc acctggaaccccagcaagacagaaccattcacttccctggaaaaagggctgaagccaggg agccaagtggtcttgctcagcgggtcccactcccttgaagcccagcaagctaagaaccac tggattgaaattcagcaagcacagcagtctgaggtcaacctgggacgattgagcttggtg tga >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_2|271_aa MKEERSKKEEEEEEEEEEKEEGEGEEEGEEEGGGGGRRRKRRRRRRRRRRRRRRRRKKKE EERRREKKKEEEDYRPTSLMNLDAKILKKIPANQELYRNYGILADAMEQVVQHKNAYEVI PGGVKGGTKEKRLVAQFIPKFFKQFPELVESAINALLDLCEDEDVSIRRQAIKELPQFAT GENLPRVADILMQLLQTNNSAEFNLVNNALLSIFKMVAKGTLGGLFSQILRGEDIVRERA IKFLSTKLKTLPDEVLTKEVKELILTESKMS >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_2|816_bp atgaaagaagaaagaagcaagaaggaagaggaggaggaggaggaagaagaagaaaaagaa gaaggagaaggagaagaagaaggagaagaagaaggaggaggaggaggaagaagaagaaaa agaagaaggagaaggagaagaagaaggagaagaagaaggaggaggaggaagaagaaggaa gaagaaagaagaagagagaagaaaaaagaagaagaagactacagaccaacatccctgatg aatttggatgcaaaaattcttaagaaaataccagctaaccaggagctttaccgcaattat ggcatcctggccgatgccatggagcaagtggtccagcataaaaatgcctatgaagtgata ccgggtggtgtgaaaggtggtactaaggaaaaacgattagtagcccaatttattccgaaa ttctttaagcagtttccagaattggttgagtctgctatcaatgcactgttagacctctgt gaggatgaagatgtatctattcgacgtcaagcaattaaagaactgcctcaatttgccact ggagaaaatcttcctcgagtggcagatatattaatgcaacttttgcagacaaataactct gcagaatttaacctagtgaacaacgccctgttaagtatatttaaaatggttgcaaaaggg actttaggtgggttgttcagccaaatacttcgaggagaggacattgttagagaacgagca attaaattcctttctacaaaacttaagactttaccagatgaagtcttaacaaaggaagtg aaagagcttatactaactgaatccaaaatgtcctag >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_3|387_aa MINITNDQGNANQNHNVIPPNSCKNVHNQKTVDVGMDPVIMEHFYTAALEAPHSRGRLTP HTARYSSETKLPEEQSGSNICCSTTSAILQPLLLIPRQKRSGVDLQQTPKDLQLRVLTVR RKTNKQKGHPHQKPICTSPSSKTKEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPSL NQEEAESLKRPITGSEIEAVIHSLPTKKTPGPDGFTAELYQRYKEELRLGALCPAAPEGQ GTAQAVASEGGSPKPWQLPHGVEPAGVQKSRIEVWEPLSRFQKMYGNDWMPRQKFAEGAG PSWRTSARTVWKENVESEPPHRVPIGALPSGAVRREALSSRPHNGRSTGSLCHAPGKATG SWEGGCTLQRHRGGAAQNHGNPPLASA >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_3|1164_bp atgatcaacattactaatgatcagggaaatgcaaatcaaaatcacaatgtgataccacct aactcctgcaagaatgtccataatcaaaaaacagtagatgttggaatggatccagtgatc atggaacacttctacactgctgccttagaggcaccccacagtaggggcagactaacacct cacacggcccggtactcctctgagacaaaacttccagaggaacaatcaggcagcaacatt tgctgttcaacaacatctgctattctgcagcctctgctgctgatacccaggcaaaaaaga tctggagtggacctccagcaaactccaaaagacctgcagttgagggtcctgactgttaga aggaaaactaacaaacagaaaggacatccacaccaaaaacccatctgtacgtcaccatca tcaaagaccaaagaaatacaaactaccatcagagaatactataaacacctctacgcaaat aaactagaaaatctggaagaaatggataaattcctcgacacatacaccctcccaagtcta aaccaggaagaagctgaatctcttaagagaccaataacaggctctgaaattgaggcagta attcatagcttaccaaccaaaaaaactccaggaccagatggattcacagccgaattatac cagaggtacaaggaggagctgagacttggtgccctgtgtccagccgctccagagggccaa ggtacagctcaggctgttgcttcagagggtggaagccccaagccttggcagcttccacat ggtgttgagcctgcaggtgtgcagaagtcaagaattgaggtttgggaacctctgtctaga tttcagaagatgtatggaaatgactggatgcccaggcaaaagtttgctgaaggggcgggc ccttcctggagaacctctgctaggacagtgtggaaggaaaatgtggagtcagagccccca cacagagtccctattggggcactgcctagtggagctgtgagaagagaggctctgtcctcc agaccccacaatggtagatccactggcagcttgtgtcacgcacctggaaaagccacaggc agctgggagggaggctgtaccctgcaacgccacaggggtggagctgcccaaaaccatgga aacccacctcttgcatcagcgtga >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_4|338_aa MGKKQSRKTGNSKNQSASPPPKKHSSSPAMEQSWTENDFDELREEGFRRSNNSELQEEIQ TKGKEVKNFEKNLDECVTRITNTEKCLKELMELKAKAQELREECRSLRSRCDQLEERVSV MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRLNLHLIGVPESDEENGTKLENTLQD IIQENFPNLAREANIQFQEIQRTPQRYSPRRATPRQIIVRFTKVEMKEKMLRAAREKGQV THKGKPIRLTADLLAETLQARREWKPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTHK QMLRDFVTTRPALKELLKEALNMERNNQYQPLQNHAKL >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_4|1017_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaagaaacacagttcctcaccagcaatggaacaaagctggacagagaatgactttgac gagttgagagaagaaggcttcagacgatcaaacaactctgagctacaggaggaaattcaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtgtaactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcaagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactatgtgaaaagactaaacctacatctg attggtgtacctgaaagtgacgaggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaagggaggccaacattcagtttcaggaaata cagagaacgccacaaagatactccccgagaagagcaactccaagacaaataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcaggtt acccacaaagggaagcccatcagactaacagctgatctcttggcagaaactctacaagcc agaagagagtggaagccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacacacaag caaatgctgagagattttgtcactaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccagtaccagccactgcaaaatcatgccaaattgtaa >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_5|217_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSVLHQVDLIDIYRTLHPKSTEYTFFSAPNHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDCWV HNKMKAEIKMFFETNENKDTTYHNLWDTFKAVCRGKFIALNVHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKMRAELKEIETQKNPSKI >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_5|654_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacaaaaagttaac aaggatacccaggaattgaactcagttctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccaaaccacacctattcc aaaattgaccacatagttgggagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaataacctgctcctgaatgactgctgggta cataacaaaatgaaggcagaaataaaaatgttctttgaaaccaacgagaacaaagacaca acataccacaatctctgggacacattcaaagcagtatgtagagggaaatttatagcacta aatgtccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atgagagcagaactgaaggaaatagagacacaaaaaaacccttcaaaaatttaa >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_6|200_aa MISFDPMSHIQVTLMQGVGSHGLVKLPTVALQGAWFKLSVDLSFWGLEDGGPLLTAPVGS ALVGALCGGSDPTFPFHTALVEVVREGPTPTANFFLGIQPSNWNPKHTAELLLEPVSNKT IQKRINAAVGESQEDGKRKCQVNDCYNFAAINDTHPGQSIQEGQNMEGKPANDEGQNNSS CHLQDLVTGYPVLPIAIRLQ >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_6|603_bp atgatctcctttgaccccatgtctcacatccaggtcacgctgatgcaaggggtgggttcc catggtcttgtgaagctccccactgtggctttgcagggagcatggttcaagctgtcagtg gacttatcattctggggtctggaggacggtggccctcttctcacagctccagtaggcagt gccctagtaggtgctctctgtgggggctctgaccccacatttcccttccacactgcccta gtggaggttgtccgtgaaggacccacccctacagcaaacttcttcctgggcatccagcca agtaattggaaccctaaacacactgcggagcttctgttggaaccggtttccaacaaaaca atacagaaacggattaacgcagctgttggtgaatcccaagaggatggcaaaaggaagtgc caggtcaatgactgctataacttcgcagctattaatgacacccatccaggccagagcatc caggaaggtcagaacatggaagggaagccagcaaatgatgaaggccagaacaacagcagc tgccatcttcaggacttggtcacgggttatcctgttcttcccatagctattcgtcttcag taa >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_7|225_aa MAITKKSKITDAGKVVEKREHLYTVGETSFIKIVERGLTSRERLISQYPVNSLYKGDINE AGNHHSQQTNTGTENQTPHVLTRKWEFNNENTWTHGGEHHTLWGLSHHPNTETWKRYNNS NVKLQTDTLDEQQSKNPQQNIVIPNAAAHQKTYLLRSSRLYPLDLPLVSVSNTTVTATLL SFVLSGTQTSKVESKDWQHSPQADRRALGILVNTSGGLAQQSVDE >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_7|678_bp atggctattactaaaaagtcaaaaataacagatgctggcaaggttgtggagaaaagggaa cacttatacactgttggtgaaacatcattcatcaagattgtggagagaggacttactagt agagagcggttgatttcccaatacccggttaactcactctacaaaggggacataaatgaa gctggaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacatgtt ctcactcgtaagtgggagtttaacaatgagaacacatggacacatggaggggaacatcac acactctggggcctgtcgcatcatcctaataccgaaacctggaagagatacaacaacagc aacgtaaaacttcagactgatacccttgatgagcaacaatccaaaaatcctcagcaaaat attgtcataccaaatgcagcagcacatcaaaaaacttatctactacgatcaagtaggctt tatcccttggatctccctctggtgtctgtatctaacactacagttactgcgacactactt tcctttgttctaagtggaacccagacatccaaggttgagtccaaggactggcagcactca ccacaagctgacagaagggctttggggattttagtgaataccagtggaggcctggcacaa caatctgtggatgagtga >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_8|499_aa MDIAILQCTGKSVHHQKGSREKHYGTIKYYRASLELAEQELPKRVEKKEDNVSLNSFSAM LPRKTSVSAAVLVSATIPISRVQGPLQVLGQEVFLLLHCLPIAHLPINVKPPLISPAQKE TIKEISKGPQNPLGYRLCPLQAVGGGEFGPTRVHVPFSLSDLKQINADLGKLSDDPDRDQ EEEAQKEKQDQRKAAALVMALRQTNLGDSERTENGAGQSLAVEGQEIDFLLDNGAAFSVL ISCPRQLSSRSVTIRGILGQPVTRYFSHLLSCNWETFLQISSPLEDTTTAGPFFTPIQEE VARVVIIQFPTAIGVSCLEGGLTGEANRASESETQTTIREYYKHLYANKLENLEEMDKFL NTYTLPRLNQEEVESLNRPITGSEIESIINSLPTKKSPGPDGFTAKFYQRYKEELVPFLL KLFQSIEKEGILPNSFYEASIILIPKTGRDTTKKENFRPISLMNIDAKILNKILANRIQQ HIKKLIYHDQVGFIPGMQG >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_8|1500_bp atggatatagcaattttacagtgcacaggaaagagtgtccatcaccagaaggggtcaaga gaaaagcattatggaacaattaaatattaccgagccagtttagaacttgctgagcaggaa ttgcccaaaagagtggagaagaaagaagacaatgtttcactgaattcattctctgctatg ttaccgaggaaaactagtgtttctgctgctgtgttggtgagtgcaactattccgatcagt agggtccagggaccgttgcaggttcttgggcaagaggtgtttctgctgttgcattgcctc cctatagctcaccttcctattaatgttaagcctcctctaatctcccctgcccagaaggaa acaataaaagaaatctccaaaggaccacaaaaccccctgggctatcggttatgtcccctt caagctgtagggggaggggaatttggcccaacccgggtacatgtccctttctccctctct gatttaaagcagatcaatgcagacctgggaaagctttcagatgatcctgatagggatcaa gaggaagaggcccaaaaggaaaagcaagatcagagaaaggctgcagcattagtcatggcc ctcagacaaaccaaccttggtgattcagagaggacagaaaatggagcaggccaatcactt gccgttgagggccaggaaattgacttcctcctggacaatggtgcggctttctcagtgtta atctcctgtcccagacagctgtcctcaaggtccgttaccatccggggaatcctgggacag cctgtaaccaggtatttctcccacctcctcagttgtaattgggagacttttctacagata agttcccctctggaggacactacaactgcaggccccttcttcacccctatccaggaggaa gtagctagagtggtcatcatccaattcccaacagcaattggggtgtcctgtttagagggg ggattgacaggtgaagccaaccgggcttctgagtcagaaacacaaactaccatcagagaa tactacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctc aacacatacactctcccaagactaaaccaggaagaagttgaatctctgaatagaccaata acaggatctgaaattgagtcaataattaacagcttaccaaccaaaaaaagtccaggacca gatggattcacagccaaattctaccagaggtacaaggaggagctggtgccattccttctg aaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagc atcatcctgataccaaagactggcagagacacaacaaaaaaagagaattttagaccaata tccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcaa cacatcaaaaagcttatctaccatgatcaagtgggcttcatccctgggatgcaaggctag >gi568815575f:116072281_116273369|GENSCAN_predicted_peptide_9|147_aa MGKDFMSKTPKAVATKAKIDKWDLIKLKSFCTAKETTIRVNRKPTEWEKIFAIYSSDKGL ISRIYNELKQIYKKKTNNPINKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMRIKTT MRYHLTPVRMVIIKTSGNNRCGEDVEK >gi568815575f:116072281_116273369|GENSCAN_predicted_CDS_9|444_bp atgggcaaggacttcatgtctaaaacacccaaagcagtggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aataggaaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggtta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcgaatcaaaaccaca atgagataccatctcacaccagttagaatggtgatcattaaaacgtcaggaaacaacagg tgtggagaggatgtggagaaatag