GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:52:25 Sequence gi568815578f:37084179_37336709 : 252531 bp : 46.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 Intr - 11061 10923 139 2 1 98 27 79 0.513 3.27 1.22 Intr - 11847 11581 267 2 0 24 75 353 0.132 24.25 1.21 Intr - 18657 18522 136 1 1 67 28 112 0.167 2.73 1.20 Intr - 24486 24421 66 1 0 91 127 38 0.970 6.88 1.19 Intr - 28268 28142 127 2 1 88 47 64 0.792 2.55 1.18 Intr - 29967 29812 156 0 0 95 94 140 0.999 15.51 1.17 Intr - 31159 30995 165 1 0 38 58 132 0.469 5.36 1.16 Intr - 36440 36326 115 1 1 92 94 30 0.730 4.45 1.15 Intr - 39621 39431 191 0 2 104 29 156 0.476 9.68 1.14 Intr - 44989 44854 136 0 1 58 61 82 0.874 3.07 1.13 Intr - 53792 53632 161 2 2 111 92 12 0.539 2.69 1.12 Intr - 57136 56984 153 1 0 62 92 50 0.842 3.07 1.11 Intr - 59663 59539 125 0 2 84 90 96 0.996 9.70 1.10 Intr - 63731 63596 136 2 1 84 56 36 0.931 0.14 1.09 Intr - 70989 70885 105 0 0 98 105 83 0.910 11.41 1.08 Intr - 73777 73682 96 1 0 81 92 15 0.709 1.41 1.07 Intr - 74858 74714 145 1 1 52 78 83 0.956 3.98 1.06 Intr - 75998 75919 80 2 2 136 64 78 0.978 8.55 1.05 Intr - 77488 77451 38 2 2 83 100 29 0.698 1.58 1.04 Intr - 83362 83279 84 2 0 50 70 77 0.568 1.89 1.03 Intr - 87879 87745 135 1 0 28 89 124 0.843 7.04 1.02 Intr - 89938 89817 122 0 2 112 98 88 0.994 12.34 1.01 Init - 90108 90092 17 0 2 53 88 -26 0.076 -6.22 1.00 Prom - 93649 93610 40 -5.66 2.00 Prom + 93779 93818 40 -8.16 2.01 Init + 94060 94073 14 0 2 72 66 12 0.086 -4.19 2.02 Intr + 95074 95191 118 1 1 73 76 139 0.121 11.67 2.03 Intr + 95297 95387 91 1 1 44 -5 94 0.085 -4.73 2.04 Intr + 100002 100195 194 1 2 47 116 216 0.869 19.51 2.05 Intr + 114219 114314 96 2 0 53 105 71 0.957 5.51 2.06 Intr + 114872 115047 176 1 2 99 69 124 0.987 10.34 2.07 Intr + 119707 119782 76 1 1 90 48 76 0.996 3.32 2.08 Intr + 120589 120723 135 0 0 79 67 135 0.942 11.26 2.09 Intr + 123095 123271 177 1 0 53 94 261 0.885 23.32 2.10 Intr + 125869 125987 119 0 2 80 110 44 0.937 5.06 2.11 Intr + 129582 129687 106 0 1 77 92 131 0.989 12.62 2.12 Intr + 139700 139791 92 1 2 98 94 97 0.995 10.09 2.13 Intr + 141510 141624 115 0 1 81 71 145 0.986 12.55 2.14 Intr + 144372 144566 195 2 0 39 68 201 0.914 12.81 2.15 Intr + 145795 145881 87 0 0 98 76 75 0.954 7.47 2.16 Intr + 148118 148213 96 1 0 72 105 85 0.935 8.81 2.17 Intr + 149842 149917 76 0 1 67 119 60 0.999 6.09 2.18 Intr + 152402 152531 130 0 1 81 94 184 0.914 18.05 2.19 Term + 155435 155516 82 2 1 117 38 13 0.278 -3.63 2.20 PlyA + 157419 157424 6 1.05 3.12 PlyA - 158690 158685 6 1.05 3.11 Term - 158831 158802 30 2 0 103 39 37 0.264 -1.85 3.10 Intr - 159553 159415 139 0 1 97 19 96 0.227 4.07 3.09 Intr - 170151 170032 120 2 0 54 80 95 0.635 4.91 3.08 Intr - 172320 172216 105 2 0 106 86 218 0.990 22.73 3.07 Intr - 172491 172436 56 0 2 73 84 -27 0.421 -6.82 3.06 Intr - 172730 172629 102 2 0 129 115 8 0.513 7.97 3.05 Intr - 177494 177446 49 1 1 13 105 60 0.205 -1.12 3.04 Intr - 177677 177565 113 2 2 18 94 125 0.479 5.28 3.03 Intr - 192063 191980 84 0 0 7 111 66 0.086 0.82 3.02 Intr - 195935 195913 23 0 2 46 102 36 0.030 -1.94 3.01 Init - 202492 202339 154 1 1 101 45 113 0.437 8.44 3.00 Prom - 207605 207566 40 -4.26 4.00 Prom + 209314 209353 40 -3.26 4.01 Init + 217086 217235 150 2 0 92 101 233 0.996 25.04 4.02 Intr + 229658 229751 94 1 1 19 37 134 0.019 0.94 4.03 Intr + 232327 232492 166 2 1 77 58 45 0.047 -0.58 4.04 Intr + 244039 244175 137 1 2 85 94 38 0.360 4.31 4.05 Intr + 249631 249711 81 2 0 72 96 45 0.792 3.31 4.06 Term + 249829 249971 143 2 2 52 47 117 0.863 2.09 4.07 PlyA + 251055 251060 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:37084179_37336709|GENSCAN_predicted_peptide_1|965_aa MKKIRRICSQEEVVIPCAYDSDSESVDLELSNLEIIKKGSSSIELTDLDIPDIPGLHCEP LSHSPRHLTQQDPLSEAIVEKLIQSIQKVFNVPDSSRNCLGNLGYKDKEDKIPIYAAKQG KRNPLEAAETQKVLVQEERPHSLSSSMRQEVFVTIADLSYQDVHLLLGSEDRAELFSLTI KSIITLPSVRTLTQIQEIMPNGTCNTECLYRQTFQAFSEMLQSLVVKDPHLENLDTIIKH LVPWLQSVKDHERERATASMAQVLKCLSKHLNLKLPLRFQRLGHLVALMALLCGDPQEKV AEEAAEGIHSLLHITLRLKYITHDKKDQQNLKRALTKCREFLELHSSAAKCFYNCPFRIA QVFEGFLDSNELCQFIMTTFDTLKTLKHPCIQRSAGELLLTLAKNTESQFEKVPEIMGVI CAQLSIISQPRVRQQIINTVSLFISRPKYTDIVLSFLLCHPVPYNRHLAEVWRMLSVELP STTWILWRLLRKLQKCHNEPAQEKMAYVAVAATDALYEVFLGNRLRAATFRLFPQLLMTL LIQIHHSIGLTMSDVDIPSGLYTEQEVPSEVTPLCALLERNQLLAQKVMYLLVPLLNRGN DKHKLTSAGFFVEREDIKSLLPYIVDSLRETDEKIVLSAIQILLQLVRTMDFTTLAAMMR TLFSLFGDVRSDVHRFSVTLFGAAIKSVKNPDKKSIENQVLDSLVPLLLYSQDENDAVAE ESRQVLTICAQFLKWKLPQEVYSKDPWHIKPTEAGTICRFFVCLLSKYMDHNELRRMGTD WIEDDLRDLLCDPEPSLCIIASQTLLLVQMARAEPKPKQRVNWLQKLMGRFSRALAQVVV GSAPGREKEVGGRGAQPAGPEGMFEDKPHAEGAAVVAAAGEALQALCQELNLDEGSAAEA LDDFTAIRGNYSLEVSGSRVELPVAVQQFDQSQCAHEADLSPTFPPPSEHRPDALHQCLA LNKTN >gi568815578f:37084179_37336709|GENSCAN_predicted_CDS_1|2895_bp atgaaaaaaataagaaggatctgtagtcaggaagaagtagtgatcccctgtgcctatgac agtgattcagaaagtgtggatttggagctgagcaacttagagattattaaaaaaggctca agtagcattgaactgacagacttggacatccctgacatccctggactccattgtgagccc ctgtcacatagccccagacacctgacccaacaggacccgctcagtgaggccattgttgag aaactgatccagtccatccagaaggttttcaatgtgcctgacagttccaggaactgtctt gggaatttgggctacaaagacaaagaagacaaaatccctatttatgcagccaagcaaggt aagagaaatcctctagaagcagctgaaacacaaaaggtactggtacaagaggaacgcccg cattctctgtccagttccatgcgccaggaggtctttgtcaccatcgctgatctcagttac caagatgtccatttgctgttgggctctgaagatcgagctgagttgttcagtcttaccatc aagagtataatcactctgccctctgtaaggacccttacccagatacaggaaatcatgccc aatgggacctgcaacacagagtgtctttacaggcagacgtttcaggcattctctgagatg ctccagagtttggtggtaaaagacccacatttggaaaatcttgacaccattattaagcac ttggtcccctggttacagtcagtcaaagaccatgagcgggaacgggccacggccagcatg gctcaagttctgaagtgcctatccaaacatctcaacttgaagcttccactgcgattccaa agacttggacacctagtggctctgatggcactgctctgtggggacccacaggaaaaggtg gctgaggaggctgcagagggcattcactccctgctgcatatcaccctgaggctgaagtat atcactcatgacaagaaagatcagcaaaacttgaaaagagcattgacaaaatgtcgagaa ttcctggagctccacagctctgccgctaaatgcttctacaactgtcccttcagaattgcc caggtctttgaaggttttcttgattcaaatgagctctgccagtttataatgactacattt gataccctgaaaaccctgaaacatccctgcatccagcgatcagcaggagaattactgcta actttggcaaaaaatacagagtcccaatttgagaaggtgccagaaattatgggagttatc tgtgcccagttatccataatcagccagcctagagtccgccaacaaatcataaataccgtg agtttatttatatccagacccaagtacacagatatagtgctcagcttccttctgtgtcat ccagtgccgtataacaggcacctggctgaggtgtggagaatgctgtcggtggagcttccc agcacgacctggattctgtggaggctcctgaggaagctgcagaaatgccataatgagcct gcacaggagaagatggcatatgtggctgtggctgcaacagatgccctttatgaggtgttt ttgggaaacaggcttcgagcagctacgttccgactctttcctcagcttctcatgacactg cttatccagattcatcacagcatcggcctcaccatgtctgatgtcgacatcccaagtggc ctgtacacagaacaggaagtgccttcagaggtcacccctttgtgcgcattgctggaaaga aatcagctccttgcacagaaggtcatgtacttattagtccctcttcttaaccgagggaat gataaacataaactcacatctgcaggcttttttgtggagagagaagacatcaagagcctg ttgccatacattgtagacagcttgcgtgaaaccgatgagaagatcgttctgtcagccatc cagatactcctgcaacttgttagaacaatggatttcactaccctggctgccatgatgagg accctgttctccttatttggtgatgtgagatctgatgttcatcgtttctccgtgactctc tttggagccgccataaagtctgtaaaaaacccagataagaagagtatagagaaccaagtc ctggacagcttggtcccactacttctgtattctcaggatgaaaatgatgcagtagctgag gagagcaggcaagtcctaactatatgtgcccagttcctgaagtggaagctgccccaagaa gtgtactccaaagatccctggcacatcaaacctactgaagcaggaacaatctgcagattc tttgtatgccttttatcgaagtacatggatcacaatgagctcaggaggatgggtactgac tggatagaggacgatctgagagacctgctgtgtgaccctgagccctcgctgtgcatcatc gcttcccagactctgttactagtccagatggcgagggccgaaccaaaacctaagcagaga gtgaactggttgcagaagctcatgggcagattttcgcgcgctttggcgcaggtggttgtg ggtagcgcgcctgggagggagaaagaagtcgggggccgtggcgcgcagcccgcggggcct gaagggatgttcgaggacaagccccacgctgagggggcggcggtggtcgccgcagccggg gaggcgctacaggccctgtgccaggagctgaacctggacgaggggagcgcggccgaagcc ctggacgactttactgccatccgaggcaactacagcctagaggtgagcggcagcagggtg gagctgccggtcgctgtgcagcagtttgatcaaagccaatgtgcacacgaagctgatctc agccctacgtttcctcctccgtcagagcatcgtcctgatgctttgcatcagtgtctggct ctgaataaaacaaat >gi568815578f:37084179_37336709|GENSCAN_predicted_peptide_2|724_aa MSRLQACRESLASPVAGSWSHFPERKSARGSDSGGTCSEEWRRRGHGHKLWLGRSRIEGP KEGCELVGVPATWRGSSTVFLLALTIIASTWALTPTHYLTKHDVERLKASLDRPFTNLES AFYSIVGLSSLGAQVPDAKKACTYIRSNLDPSNVDSLFYAAQASQALSGCEISISNETKD LLLAAVSEDSSVTQIYHAVAALSGFGLPLASQEALSALTARLSKEETVLATVQALQTASH LSQQADLRSIVEEIEDLVARLDELGGVYLQFEEGLETTALFVAATYKLMDHVGTEPSIKE DQVIQLMNAIFSKKNFESLSEAFSVASAAAVLSHNRYHVPVVVVPEGSASDTHEQAILRL QVTNVLSQPLTQATVKLEHAKSVASRATVLQKTSFTPVGDVFELNFMNVKFSSGYYDFLV EVEGDNRYIANTVELRVKISTEVGITNVDLSTVDKDQSIAPKTTRVTYPAKAKGTFIADS HQNFALFFQLVDVNTGAELTPHQTFVRLHNQKTGQEVVFVAEPDNKNVYKFELDTSERKI EFDSASGTYTLYLIIGDATLKNPILWNVADVVIKFPEEEAPSTVLSQNLFTPKQEIQHLF REPEKRPPTVVSNTFTALILSPLLLLFALWIRIGANVSNFTFAPSTIIFHLGHAAMLGLM YVYWTQLNMFQTLKYLAILGSVTFLAGNRMLAQQAVKRFNGVVIYLQKNVPFLVYNSVNP EKPL >gi568815578f:37084179_37336709|GENSCAN_predicted_CDS_2|2175_bp atgtcgaggctgcaagcctgccgcgagtccctggcgtcccctgtggcgggctcttggagc cactttcccgagcggaagtcagcccgcggctcggactccggcgggacctgctcggaggaa tggcgccgccgggggcatgggcacaagctctggctggggcgctctcggatcgagggtccg aaggagggctgcgagctggtgggagtgcccgcgacctggcggggttcaagcactgtcttc ctgttggccctgacaatcatagccagcacctgggctctgacgcccactcactacctcacc aagcatgacgtggagagactaaaagcctcgctggatcgccctttcacaaatttggaatct gccttctactccatcgtgggactcagcagccttggtgctcaggtgccagatgcaaagaaa gcatgtacctacatcagatctaaccttgatcccagcaatgtggattccctcttctacgct gcccaggccagccaggccctctcaggatgtgagatctctatttcaaatgagaccaaagat ctgcttctggcagctgtcagtgaggactcatctgttacccagatctaccatgcagttgca gctctaagtggctttggccttcccttggcatcccaagaagcactcagtgcccttactgct cgtctcagcaaggaggagactgtgctggcaacagtccaggctctgcagacagcatcccac ctgtcccagcaggctgacctgaggagcatcgtggaggagattgaggaccttgttgctcgc ctggatgaactcgggggcgtgtatctccagtttgaagaaggactggaaacaacagcgtta tttgtggctgccacctacaagctcatggatcatgtggggactgagccatccattaaggag gatcaggtcatccagctgatgaacgcgatcttcagcaagaagaactttgagtccctctcc gaagccttcagcgtggcctctgcagctgctgtgctctcgcataatcgctaccacgtgcca gttgtggttgtgcctgagggctctgcttccgacactcatgaacaggctatcttgcggttg caagtcaccaatgttctgtctcagcctctgactcaggccactgttaaactagaacatgct aaatctgttgcttccagagccactgtcctccagaagacatccttcacccctgtaggggat gtttttgaactaaatttcatgaacgtcaaattttccagtggttattatgacttccttgtc gaagttgaaggtgacaaccggtatattgcaaataccgtagagctcagagtcaagatctcc actgaagttggcatcacaaatgttgatctttccaccgtggataaggatcagagcattgca cccaaaactacccgggtgacatacccagccaaagccaagggcacattcatcgcagacagc caccagaacttcgccttgttcttccagctggtagatgtgaacactggtgctgaactcact cctcaccagacatttgtccgactccataaccagaagactggccaggaagtggtgtttgtt gccgagccagacaacaagaacgtgtacaagtttgaactggatacctctgaaagaaagatt gaatttgactctgcctctggcacctacactctctacttaatcattggagatgccactttg aagaacccaatcctctggaatgtggctgatgtggtcatcaagttccctgaggaagaagct ccctcgactgtcttgtcccagaaccttttcactccaaaacaggaaattcagcacctgttc cgcgagcctgagaagaggccccccaccgtggtgtccaatacattcactgccctgatcctc tcgccgttgcttctgctcttcgctctgtggatccggattggtgccaatgtctccaacttc acttttgctcctagcacgattatatttcacctgggacatgctgctatgctgggactcatg tatgtctactggactcagctcaacatgttccagaccttgaagtacctggccatcctgggc agtgtgacgtttctggctggcaatcggatgctggcccagcaggcagtcaagaggtttaat ggagttgtaatttatctgcagaaaaatgtaccctttttagtgtacaattctgtgaatcct gaaaaacctctgtag >gi568815578f:37084179_37336709|GENSCAN_predicted_peptide_3|324_aa MDTDAHIERTSCEDEGRYRDDASTAKEHHRLPAGHQKLVKRHGTGSSQPSEGEHNVGGKV FTESENSTLKGNDEVLWSNTLTLQTAQENEEINDGNARRLPEQTPSPGPLDLSSASEQRD ICRIRNTESKRPTAILFAHGLVPPRVKDATLGVLLCDPHPQQQLPLLPTSPFDPQRRQTQ LHRGHSEGSESLLGMRRYADAIFTNSYRKVLGQLSARKLLQDIMSRQQGESNQERGARAR LGRQVDSMWAEQKQMELESILVALLQKHSHWLSFVPSDQDTQQPLYFHPQALLPLLHTTV HQPRTQSVFQARTGLDPIQNGIEF >gi568815578f:37084179_37336709|GENSCAN_predicted_CDS_3|975_bp atggacacagatgcacacatagagagaacgtcatgtgaagatgaaggcagatatcgggac gatgcctctacagccaaggaacaccacagattgccagcaggccaccagaagctagtcaag aggcatgggacaggctcctcacagccctcagaaggagagcacaacgtgggcggtaaggta ttcactgaatcagagaactcgactcttaaaggaaatgatgaggtcttatggtctaatacc ctcactttacagacagctcaggaaaatgaagagataaatgatgggaacgccaggcggctg ccagagcaaacacccagcccagggcccctggatttgagcagtgcctcggagcagagggat atctgccgcatcagaaacactgagtccaagaggcccaccgccatcctctttgcccatgga ctggtgccaccccgggtgaaggatgccactctgggtgttcttctttgtgatcctcaccct cagcaacagctcccactgctccccacctccccctttgaccctcagaggaggcagacacag cttcacagaggtcactcagaggggtctgagtctctcttggggatgcggcggtatgcagat gccatcttcaccaacagctaccggaaggtgctgggccagctgtccgcccgcaagctgctc caggacatcatgagcaggcagcagggagagagcaaccaagagcgaggagcaagggcacgg cttggtcgtcaggtagacagcatgtgggcagaacaaaagcaaatggaattggagagcatc ctggtggccctgctgcagaagcacagccactggctgtcctttgttcccagtgaccaggac acccagcagcctttgtacttccaccctcaggccctactacccctgctccacaccactgtc caccagccccgcacccagtctgtcttccaggcccgcacaggtctggatcccattcagaac ggcatagagttctag >gi568815578f:37084179_37336709|GENSCAN_predicted_peptide_4|256_aa MASDLDFSPPEVPEPTFLENLLRYGLFLGAIFQLICVLAIIVPIPKSHEAGQTMGAVCRV AKETGIPVDEGDQRKLWGLTQCSGSEEAVTPYDRRDLDSRNSPQAPAGQSTTSSSFCFCD GLESRGLKHTVSIDCIRDPESLLLCSHLVETPNLKCGTLLLKPEKDPGLWPLAKAQAVQF SSAQAQCKGASLPHYPLLPYPGNVCGRRKGAKCTTDVTEHPQQVPPEGPAQSGCGKAKCM TSRLEDFGPQEPWAVD >gi568815578f:37084179_37336709|GENSCAN_predicted_CDS_4|771_bp atggcctctgacctagacttctcacctccggaggtgcccgagcccactttcctggagaac ctgctacggtacggactcttcctgggagccatcttccagctcatctgtgtgctggccatc atcgtacccattcccaagtcccacgaggcgggccaaaccatgggagcagtttgccgggtg gccaaggaaacaggtattcctgttgatgaaggtgaccagcggaagctctggggcctgact cagtgctcagggtctgaggaggctgtgacgccctatgaccgcagagatctagacagtcgt aacagtccccaggctccagctgggcaatccaccacttcctcttccttctgcttctgtgac ggtttagagtcaagggggctgaaacacactgtgagcatagactgtattagggatcctgag tctttgctcctatgtagtcacttggtagaaacgccgaacctgaaatgtggcactttgctt ctcaagccagagaaggatccaggcttatggcccttagcaaaagcccaagcggttcagttc agctcagcccaggcccagtgcaaaggagcctcccttcctcattacccgcttctgccctac cccgggaacgtgtgtggacgtagaaagggtgcaaaatgcacaacggatgtcacagagcat ccccagcaggtgcccccagagggaccagcccagagcgggtgcggaaaggccaagtgcatg accagccgccttgaggactttggcccccaggaaccttgggctgtggactag