GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:38:12 Sequence gi568815580f:12208293_12426127 : 217835 bp : 46.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3084 3146 63 2 0 67 59 93 0.573 5.45 1.02 Intr + 10841 10916 76 1 1 35 92 69 0.085 1.09 1.03 Intr + 22436 22491 56 0 2 84 76 59 0.142 3.00 1.04 Intr + 23037 23129 93 2 0 95 54 75 0.275 4.96 1.05 Intr + 24903 24981 79 2 1 102 80 51 0.268 4.82 1.06 Intr + 26245 26450 206 2 2 1 23 155 0.100 -0.78 1.07 Intr + 28908 28981 74 2 2 47 80 86 0.150 1.90 1.08 Intr + 31894 32014 121 1 1 96 88 18 0.491 3.10 1.09 Term + 34657 34734 78 0 0 150 37 -3 0.175 -1.44 1.10 PlyA + 35862 35867 6 1.05 2.00 Prom + 36745 36784 40 -8.96 2.01 Init + 37584 37759 176 2 2 76 100 175 0.309 16.34 2.02 Intr + 38528 38677 150 2 0 18 52 114 0.157 0.08 2.03 Intr + 39035 39235 201 2 0 53 72 148 0.166 8.10 2.04 Intr + 45583 45858 276 1 0 29 72 171 0.017 6.23 2.05 Intr + 50446 50481 36 1 0 139 79 -10 0.338 0.58 2.06 Intr + 54533 54677 145 2 1 93 44 99 0.850 6.28 2.07 Intr + 56015 56161 147 1 0 61 119 231 0.982 23.93 2.08 Intr + 65801 65982 182 1 2 89 77 319 0.984 29.57 2.09 Intr + 68831 68962 132 2 0 82 25 164 0.357 9.26 2.10 Intr + 71941 71990 50 1 2 106 70 17 0.190 0.02 2.11 Intr + 78861 79046 186 1 0 70 99 78 0.773 6.76 2.12 Intr + 79285 79339 55 2 1 95 103 21 0.875 2.34 2.13 Intr + 79678 79743 66 1 0 74 34 113 0.639 2.52 2.14 Intr + 96647 96756 110 2 2 85 89 73 0.813 7.03 2.15 Intr + 96935 96973 39 0 0 20 94 89 0.543 0.90 2.16 Term + 98194 98348 155 2 2 33 42 99 0.518 -2.12 2.17 PlyA + 98788 98793 6 -0.45 3.00 Prom + 99187 99226 40 -8.46 3.01 Init + 100001 100057 57 1 0 99 86 117 0.996 13.91 3.02 Intr + 100395 100503 109 2 1 40 67 270 0.994 19.96 3.03 Intr + 102651 102761 111 1 0 95 74 121 0.993 11.75 3.04 Intr + 113267 113392 126 0 0 41 83 52 0.537 0.65 3.05 Term + 116775 117838 1064 1 2 109 38 2543 0.999 243.70 3.06 PlyA + 118100 118105 6 1.05 4.19 PlyA - 119151 119146 6 1.05 4.18 Term - 121491 121273 219 0 0 72 44 304 0.815 21.44 4.17 Intr - 129243 129049 195 0 0 84 100 117 0.988 12.11 4.16 Intr - 132109 131909 201 1 0 64 89 132 0.484 10.38 4.15 Intr - 135955 135840 116 2 2 52 67 146 0.419 9.07 4.14 Intr - 140091 139981 111 1 0 90 79 76 0.833 7.25 4.13 Intr - 142918 142793 126 2 0 23 109 40 0.450 0.25 4.12 Intr - 143121 143014 108 1 0 107 77 126 0.370 13.66 4.11 Intr - 144866 144713 154 2 1 75 101 164 0.921 16.05 4.10 Intr - 148539 148402 138 0 0 54 70 188 0.994 14.26 4.09 Intr - 150651 150378 274 2 1 101 93 335 0.999 32.84 4.08 Intr - 151759 151635 125 1 2 57 24 193 0.999 9.18 4.07 Intr - 155564 155490 75 2 0 78 51 102 0.970 5.21 4.06 Intr - 159090 158984 107 1 2 60 42 67 0.595 -0.77 4.05 Intr - 162634 162557 78 2 0 94 77 38 0.608 2.82 4.04 Intr - 163399 163300 100 1 1 87 94 14 0.817 1.58 4.03 Intr - 168822 168677 146 1 2 64 113 173 0.158 17.40 4.02 Intr - 200085 199477 609 1 0 47 45 360 0.372 19.99 4.01 Init - 207865 207601 265 1 1 49 -11 203 0.113 3.68 4.00 Prom - 208210 208171 40 -6.76 5.00 Prom + 209963 210002 40 -10.94 5.01 Init + 210231 210241 11 2 2 46 75 2 0.315 -5.25 5.02 Intr + 212033 212201 169 2 1 96 89 303 0.878 31.15 5.03 Intr + 213248 213337 90 1 0 93 75 85 0.312 7.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:12208293_12426127|GENSCAN_predicted_peptide_1|281_aa MAAPSGGVNCEEFTEFQELLKEVNSVPHENGKDFTAELPSGLDLFVELGLATGDKTPAVT MMTLQVLLQQLSASAPSADSCFEGTPVAVELPESASVLAIMNEAAISVCAQALCGHTFST HVARRVKEMDSAFRKAPAHFRVKGFCDTQHNPISTWSVTTPTRQAVHQHKRESARPRSAQ LVHTPKNPYSGGLTNPLLTAVAGEPPRKPECGKMARGCTILHSHQQCTMGSNSSAYSPAF VTSLFVENCHPNGYEATPSVAEAGLYCRDLNSLQPPCLGLP >gi568815580f:12208293_12426127|GENSCAN_predicted_CDS_1|846_bp atggcagcacccagtggcggtgtgaactgtgaggaattcaccgagttccaggaattactc aaggaagtaaatagtgttccccatgagaatgggaaagacttcacagcggagctgccatct gggctggacctgtttgtggagcttggattggcaacaggagacaagactccagcagtgacg atgatgacgctccaggtgttgttgcagcaacttagtgcttctgccccctctgctgactcc tgctttgaaggcactcccgtggctgtggaacttccagaatcagcctcggtcttagcaatc atgaacgaagctgctataagcgtctgcgcacaggctttgtgtggacacacgttttcaacc catgtggccaggagggtcaaggagatggactctgctttccgcaaagctcctgcacacttc cgcgtcaagggcttctgtgatactcaacacaatcccatctccacatggtcagtgacaacc ccaaccaggcaagctgtccaccagcacaagagggagtctgcccgcccccgatcagctcag ctggtccacactcccaaaaacccatattcagggggtctcaccaatcctctattgacagct gtggctggagaacctccacgcaagccagaatgtgggaagatggcccgtggctgcaccatt ttacattcccatcagcaatgcacgatgggctccaattcctccgcatattcaccagcattt gttacttccctttttgttgaaaattgccatcctaatgggtatgaagcgactccctctgtc gccgaggctggactgtactgccgtgatctcaactcgctgcaacctccctgcctcgggctc ccgtga >gi568815580f:12208293_12426127|GENSCAN_predicted_peptide_2|701_aa MRNNDAKPAGCRAAATPQPPPTSFLPFLHETCRLSGPNTHQGCRGCQKRFSEVELVEERE KIDDKDFILIHLGPGVLSMANAGPNTNRSQYLTGAARTEWVASTWSLASEAPTETGCMRA CSGEHCGCQGPTGNSAEGSKREMFAVMRCTPAGNSRQLQQRGSPRARGQEELHTVGFRIP AGNQRLAEPGAGLWGTPELEASGAHIRLLTCPTRGLGLKSTYPSFRGQSDAARIGYDPNC PDPQKCKTNRLASDQKRPGEHLGFQGPRPRFHCKVDPKEWPLTFMGSQTKRVLFTPLMHP ARPFRVSNHDRSSRRGVMASSLQELISKTLDALVIATGLVTLVLEEDGTVVDTEEFFQTL GDNTHFMILEKGQKWMPGSQHVPTCSPPKRSGIARVTFDLYRLNPKDFIGCLNVKATMYE MYSVSYDIRCTGLKGLLRSLLRFLSYSAQVTGQFLIYLGTYMLRVLDDKEERPSLRSQAK GRRTNTGSCHEHPTRMRPALETVTWTTRAARVSGGHGLLTAQGFLKVSSRPAGLPYGKPS RLSLDAGGRRALMRRRCWQPGAFLGLLLPARLTQAQALRLGTLICKNVNDVRPGQDCGRE RDFSAFGEDETLLSYSLVQPRKGRERKKHSSAISYGPEHQVMNAITAFDPDSAERTASRS PETSRSACPLSNPSDNGLPEATLTPTFWLDQTPGLKFQPKG >gi568815580f:12208293_12426127|GENSCAN_predicted_CDS_2|2106_bp atgaggaacaacgacgcaaagccagccgggtgcagagccgccgcaaccccacagccgcca ccaacctccttcctgccctttctgcacgagacctgccgactgagtggaccaaatacacac caggggtgccgcggctgccagaaacggttctcggaagtggaattggtggaggaaagggag aaaattgatgacaaggacttcatcctgattcatctgggtcctggtgtcttgtccatggca aatgctggacctaacacaaacaggtcccagtatctcaccggcgcagccaggactgagtgg gtggcaagcacgtggtctttggcaagcgaggcacccacggagacaggatgcatgcgagca tgctcaggagagcactgtggatgccaaggaccaactggaaacagcgcagagggcagcaag agggagatgttcgctgtgatgcgctgtacacctgccgggaactccaggcagttacaacag cggggcagtccccgtgcccggggacaggaagagctacacactgttggcttccgcatccct gcagggaaccagcggctcgctgagcctggggctgggctctgggggactccagagctggag gcaagtggcgctcacatccgtctcctcacctgccccacccgtggcctgggtttaaaatcc acatacccgtctttccgcggccaaagtgatgctgccaggattggttatgaccccaactgc cccgacccccagaagtgcaaaacgaacaggctggcaagtgaccaaaagagacccggggag catctgggcttccaaggtcctcggcctaggtttcactgtaaggtggatcccaaggagtgg cccctgacatttatgggatcacagactaagcgagtcctgttcaccccgctcatgcatcca gctcgccctttccgggtctccaaccatgacaggagcagccggcgtggggtgatggcaagc agcctgcaggagctcatcagcaagactctggatgccctcgtcatcgctaccggactggtc actctggtgctggaggaagatggcaccgtggtggacacagaagagttctttcagaccttg ggagacaacacgcatttcatgatcttggaaaaaggacagaagtggatgccgggcagccag cacgtccccacttgctcgccgccgaagaggtcgggaatagcgagagtcaccttcgacttg tacaggctgaaccccaaggacttcatcggctgccttaacgtgaaggccaccatgtatgag atgtactccgtgtcctacgacatccggtgcacgggactcaagggcctgctgaggagtctg ctgcggttcctgtcctactccgcccaggtgacgggacagtttctcatctatctgggcaca tacatgctccgggtgctggatgacaaggaagagcggccatccctccggtcacaagccaag ggcaggcgcacaaacacaggtagctgccatgagcatcccacgcgcatgcgtcctgctctg gagactgtcacttggacgaccagggctgccagggtttccggagggcacggcctcctgact gctcagggcttcctcaaggtgagctcaagacccgcagggcttccctatggcaagccgtcg aggctttctttggatgcaggtggccgcagagcgctcatgcggcgtcggtgctggcagcca ggggcattccttggcctgctgctgcctgccaggctgacccaggcgcaggcgctcagactc ggcaccctcatctgtaagaacgtgaacgatgtccgccctggacaggactgcggtcgcgag agggacttctcagcatttggggaggacgagaccctcctgtcttattccttggtgcagccc agaaaagggcgagagaggaagaaacactcatcagccatctcctacggtccagagcaccaa gtcatgaacgccatcactgccttcgacccagacagcgcagagcgcacagcgagcaggagc ccagaaaccagcaggtcggcttgtcccttgtcaaatccaagtgacaacggacttccggaa gccacgctgacacctactttctggctcgatcagacccctggacttaaattccagcccaag ggctag >gi568815580f:12208293_12426127|GENSCAN_predicted_peptide_3|488_aa MREIVHIQAGQCGNQIGTKFWEVISDEHGIDPAGGYVGDSALQLERINVYYNESSSQKYV PRAALVDLEPGTMDSVRSGPFGQLFRPDNFIFVQCPQGECSIRDGPAHVKPERLVFGSGC ELLPPAVCRQKSSWSQTGAGNNWAKGHYTEGAELVDAVLDVVRKECEHCDCLQGFQLTHS LGGGTGSGMGTLLISKIREEFPDRIMNTFSVMPSPKVSDTVVEPYNATLSVHQLVENTDE TYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTSLRFPGQLNADLRKLAVNMV PFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDARNMMAACDPRHGRYLTVATVFR GPMSMKEVDEQMLAIQSKNSSYFVEWIPNNVKVAVCDIPPRGLKMASTFIGNSTAIQELF KRISEQFSAMFRRKAFLHWFTGEGMDEMEFTEAESNMNDLVSEYQQYQDATANDGEEAFE DEEEEIDG >gi568815580f:12208293_12426127|GENSCAN_predicted_CDS_3|1467_bp atgagggagatcgtgcacatccaggcgggccagtgcgggaaccagatcggcaccaagttt tgggaagtgatcagcgatgagcacggcatcgacccggccggaggctacgtgggagactcg gcgctgcagctggagagaatcaacgtctactacaatgagtcatcgtctcagaaatatgtg cccagggccgccctggtggacttagagccaggcaccatggacagcgtgcggtctgggcct tttgggcagcttttccggcctgacaacttcatctttgttcagtgcccccagggtgagtgc agcatcagggatggacctgcccacgtgaaaccagagaggctggtctttggcagtggctgt gaattgttgccaccagccgtgtgtagacaaaagtcaagttggagccagacgggtgcaggg aacaactgggcgaaagggcactacacggagggcgcggagctggtggacgcagtgctggac gtggtgcggaaggagtgcgagcactgcgactgcctgcagggcttccagctcacgcactcg ctgggcggcggcacgggctcaggcatgggcacgctgctcatcagcaagatccgtgaggag ttcccggaccgcatcatgaacaccttcagcgtcatgccctcgcccaaggtgtcggacacg gtggtggagccctacaatgccacactgtcggtgcaccagctggtggagaatacagacgag acctactgcatcgacaacgaggcgctctatgacatctgcttccgcactctgaagctgaca acgcccacctacggggacctcaaccacctggtgtccgccaccatgagtggggtcaccacc tcgctgcgcttcccgggccagctcaatgctgacctgcgcaagctggcggtgaacatggtg cccttcccgcgcctgcacttcttcatgcctggcttcgcgccgctcaccagccgcggcagc cagcagtaccgggccctgaccgtgcccgagctcacccagcagatgttcgacgccaggaac atgatggccgcctgcgatccgcgccatggccgctacctgaccgtggccaccgtgttccgc gggcccatgtccatgaaggaggtggacgagcagatgctggccatccagagtaagaacagc agctacttcgtggagtggattcccaacaacgtgaaggtggccgtgtgcgacatcccgccc cgcggcctgaagatggcctccaccttcatcggcaacagcacggccatccaggagctgttc aagcgcatctccgagcagttctcagccatgttccggcgcaaggccttcctgcactggttc acgggtgagggcatggatgaaatggagttcaccgaggcggagagcaacatgaacgacctg gtatccgagtaccagcagtaccaggatgccaccgccaatgacggggaggaagcttttgag gatgaggaagaggagatcgatggatag >gi568815580f:12208293_12426127|GENSCAN_predicted_peptide_4|1048_aa MAAVYPEKLKMDTPPLPKSVRVLSCLVEAEEECAEVYFQEEQRTAYSEIPNQAAGASSAA VRGDLWGILSPKGKRPPLPPSMDGSPHTREQRAPGPRPPPRLRPPPPRPPPLRLPPPWLS TTSGASVPAARPLRTPGERPYPAQSLAQTAVPGKRACPLRDAEVAVWERPARGGDLREAR CEVRPFLDRSSRAREQEPASGRPHRGPQPGLTQTRAPSSRSSLPPAQGAGSAIRAGCFGP RGSGRGVGGASMGGAPLDAAHPVWWLGWLQRLKIALRARLWRSPRPALLPWASRAERATA AAMAHRCLRLWGRGGCWPRGLQQLLVPGGVGPGEQPCLRTLYRFVTTQARASRNSLLTDI IAAYQRFCSRPPKGFEKYFPNGKNGKKASEPKEVMGEKKESKPAATTRSSGGGGGGGGKR GGKKDDSHWWSRFQKVDRLEVVNKRFVRVTFTPGKTPVDGQYVWFNIGSVDTFERNLETL QQELGIEGENRVPVVYIAESDGSFLLSMLPTVLIIAFLLYTIRRGPAGIGRTGRGMGGLF SVGETTAKVLKDEIDVKFKDVAGCEEAKLEIMEFVNFLKNPKQYQDLGAKIPKGAILTGP PGTGKTLLAKATAGEANVPFITVSGSEFLEMFVGVGPARVRDLFALARKNAPCILFIDEI DAVGRKRGRGNFGGQSEQENTLNQLLVEMDGFNTTTNVVILAGTNRPDILDPALLRPGRF DRQIFIGPPDIKGRASIFKVHLRPLKLDSTLEKDKLARKLASLTPGFSGADVANVCNEAA LIAARHLSDSINQKHFEQAIERVIGGLEKKTQVLQPEEKKTVAYHEAGHAVAGWYLEHAD PLLKVSIIPRGKGLGYAQYLPKEQYLYTKEQLLDRMCMTLGGRVSEEIFFGRITTGAQDD LRKVTQSAYAQIVQFGMNEKVGQISFDLPRQGDMVLEKPYSEATARLIDDEVRILINDAY KRTVALLTEKKADVEKVALLLLEKEVLDKNDMVELLGPRPFAEKSTYEEFVEGTGSLDED TSLPEGLKDWNKEREKEKEEPPGEKVAN >gi568815580f:12208293_12426127|GENSCAN_predicted_CDS_4|3147_bp atggcagcagtttacccagagaagctgaaaatggacaccccgcccctgccgaaatcagtc agggttctcagctgcctggttgaggcagaagaggaatgtgctgaagtttattttcaagag gagcagagaactgcctactcagagatccccaaccaggcggcaggagcctcctctgctgct gttcgtggagacctctggggaatcctatcccccaagggcaagaggcctccgctgccaccc agcatggatggctccccacatacacgagagcagcgagcgccaggcccccggccgcccccc cggctccggccgcccccgccccggccgcccccgctccggctgcccccgccctggctgtcg accaccagcggggcctctgtgcctgcagccaggcccctgcgcacgccgggggagaggccg tacccggcgcagagcctcgcccagaccgcggtcccggggaagcgcgcgtgccctctccgg gacgccgaagttgcggtctgggaacggccggcgcggggcggggatctccgcgaggcccgc tgtgaggtccggcctttcctggaccgctccagccgggctcgggagcaggagccggcgtct ggacggccccaccgcggccctcagcccggcctcacccaaacacgtgctccgagctccaga tcttcattgccgccggcgcagggcgcgggctctgcgatccgggccgggtgcttcgggccg cgcggctccgggcgcggcgtgggcggggcctcgatgggcggggcgcccttggatgccgcc caccccgtctggtggctaggctggctgcagcggctaaaaatagctctgcgggcgcggctg tggcgcagccccagacctgcgctgctaccctgggcgtcccgggccgagagggccacggcg gcggccatggcgcaccgctgtttgcggctgtggggccggggcggctgctggccccgcggc ctacagcagctcctcgtgcctggcggcgtgggcccgggcgagcagccctgcctccggacg ctttaccgatttgttacaactcaagcaagggccagcagaaattctcttttgacagatata attgctgcttatcaaagattctgttctcgacccccaaaaggatttgaaaaatactttcct aatggaaaaaatggaaaaaaagctagtgaacctaaagaagttatgggagagaaaaaagaa tcaaagccagctgctaccacacgctcttctggaggaggaggtggtggcggtggaaaacga ggtggcaagaaagatgattctcactggtggtccaggtttcagaaggtagacagattggaa gtcgtcaacaagcgttttgttcgagtgacctttacaccaggaaaaactcctgttgatggg caatacgtttggtttaatattggcagtgtggacacctttgaacggaatctggaaacttta cagcaggaattgggcatagaaggagaaaatcgggtgcctgttgtctacattgctgaaagt gatggctcttttctgctgagcatgctgcctacggtgctcatcatcgccttcttgctctac accatcagaagagggcctgctggcattggccggacaggccgagggatgggcggactcttc agtgtcggagaaaccactgccaaggtcttaaaggatgaaattgatgtgaagttcaaagat gtggctggctgtgaggaggccaagctagagatcatggaatttgtgaatttcttgaaaaac ccaaagcagtatcaagacctaggagcaaaaatcccaaagggtgccattctcactggtcct ccaggcactgggaagacgctgctagctaaggccacagccggagaagccaatgtccccttc atcaccgttagtggatctgagtttttggagatgttcgttggtgtgggccctgctagagtc cgagacttatttgcccttgctcggaagaatgccccttgcatcctcttcatcgatgaaatc gatgcggtgggaaggaagagaggaagaggcaactttggagggcagagtgagcaggagaac acactcaaccagctgctggtggagatggatggttttaatacaacaacaaatgtcgtcatt ttggccggcaccaatcgaccagatatcctggaccccgcgctgcttaggccggggcgtttc gacaggcagatctttattggaccaccagacataaaaggaagagcttctattttcaaagtt catctccgaccgctaaaactggacagtaccctggagaaggataaattggcaagaaaactg gcatctttaactccagggttttcaggtgctgatgttgctaatgtctgtaatgaagctgcg ttgattgctgcaaggcatctgtcagattccataaatcagaaacactttgaacaggcaatt gaacgagtgattggtggcttagagaagaaaacgcaggttctgcagcctgaggagaagaag actgtggcataccacgaagcaggccatgcggttgccggctggtatctggagcacgcagac ccgcttttaaaggtatccatcatcccacgtggcaaaggactaggttatgctcagtattta ccaaaagaacaatacctctataccaaagagcagctcttggataggatgtgtatgacttta ggtggtcgagtctctgaagaaatcttctttggaagaattacaactggtgctcaagatgac ttgagaaaagtaactcagagtgcatatgcccaaattgttcagtttggcatgaatgaaaag gttgggcaaatctcctttgacctcccacgtcagggggacatggtattggagaaaccttac agtgaagccactgcaagattgatagatgatgaagtacgaatacttattaatgatgcttat aaaagaacagtagctcttctcacagaaaagaaagctgacgtggagaaggttgctcttctg ttgttagaaaaagaagtattagataagaatgatatggttgaacttttgggccccagacca tttgcggaaaaatctacctatgaagaatttgtggaaggcactggcagcttggatgaggac acctcacttccagaaggccttaaggactggaacaaggagcgggaaaaggagaaagaggag cccccgggtgagaaagttgccaactag >gi568815580f:12208293_12426127|GENSCAN_predicted_peptide_5|90_aa MVHSHPWDTVIQAAMRKYPNPMNPSVLGVDVLQRRVDGRGRLHSLRLLSTEWGLPSLVRA ILGTSRTLTYIREHSVVDPVEKKMELCSTN >gi568815580f:12208293_12426127|GENSCAN_predicted_CDS_5|270_bp atggtgcatagccacccgtgggacacggtcatccaggcggccatgcgcaagtacccgaac ccgatgaacccgagcgtgctgggcgtggatgtgctacagcgccgcgtggacggccgcggc cgcctgcacagcttgcgcctgctcagcaccgagtgggggctgcccagcctcgtgagagcg attttgggaaccagtaggacattgacatacatccgagaacattctgtggtggatccagtg gaaaagaaaatggaactttgttctaccaat