GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:57:31 Sequence gi568815590f:11443225_11664105 : 220881 bp : 48.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 230 445 216 1 0 53 35 129 0.458 2.80 1.02 Intr + 1050 1247 198 2 0 78 60 71 0.760 2.85 1.03 Intr + 2116 2251 136 0 1 109 -20 99 0.254 1.34 1.04 Intr + 6561 6630 70 1 1 64 75 92 0.358 3.74 1.05 Intr + 14695 14880 186 1 0 58 86 102 0.394 5.80 1.06 Intr + 20409 20548 140 0 2 79 61 91 0.896 5.61 1.07 Intr + 23094 23240 147 1 0 41 58 85 0.528 1.01 1.08 Intr + 47186 47236 51 0 0 77 81 42 0.002 1.28 1.09 Intr + 59712 59879 168 1 0 37 88 98 0.060 4.62 1.10 Intr + 61625 61685 61 0 1 47 47 94 0.084 -1.01 1.11 Intr + 65926 66129 204 1 0 103 41 71 0.096 2.22 1.12 Intr + 100000 100123 124 1 1 106 105 136 0.114 17.69 1.13 Intr + 102828 102879 52 2 1 76 116 92 0.999 9.28 1.14 Intr + 104808 104901 94 1 1 87 60 179 0.999 14.02 1.15 Intr + 105800 105898 99 2 0 95 100 98 0.997 10.93 1.16 Intr + 106935 107038 104 0 2 113 102 12 0.967 4.92 1.17 Intr + 111519 111649 131 1 2 82 21 193 0.359 12.41 1.18 Intr + 112167 112260 94 2 1 102 92 35 0.357 4.84 1.19 Intr + 113434 113613 180 2 0 57 26 364 0.834 26.94 1.20 Intr + 114738 114814 77 1 2 65 77 58 0.576 1.63 1.21 Intr + 117806 117856 51 1 0 115 68 7 0.572 0.50 1.22 Intr + 118078 118228 151 0 1 94 98 271 0.984 28.44 1.23 Intr + 119755 119886 132 2 0 86 96 173 0.982 18.52 1.24 Term + 120679 120884 206 2 2 120 42 632 0.994 59.33 1.25 PlyA + 121355 121360 6 1.05 2.11 PlyA - 121767 121762 6 -0.45 2.10 Term - 122508 122401 108 0 0 86 39 38 0.172 -2.79 2.09 Intr - 123903 123816 88 2 1 97 80 31 0.556 3.17 2.08 Intr - 124126 123977 150 0 0 65 90 89 0.441 6.08 2.07 Intr - 124688 124565 124 0 1 45 66 50 0.204 -1.86 2.06 Intr - 137456 137289 168 0 0 81 96 34 0.204 3.42 2.05 Intr - 139994 139929 66 0 0 123 68 40 0.416 4.38 2.04 Intr - 145471 145373 99 2 0 99 33 58 0.412 1.48 2.03 Intr - 148489 148411 79 1 1 97 94 27 0.656 3.32 2.02 Intr - 150413 150250 164 0 2 24 45 154 0.465 4.39 2.01 Init - 152897 152795 103 2 1 99 49 87 0.758 6.30 2.00 Prom - 154668 154629 40 -5.56 3.00 Prom + 167639 167678 40 -0.36 3.01 Init + 170629 170635 7 0 1 85 63 10 0.594 -1.33 3.02 Intr + 174031 174225 195 2 0 88 44 119 0.829 6.89 3.03 Term + 182801 182949 149 0 2 76 44 151 0.873 7.56 3.04 PlyA + 184140 184145 6 1.05 4.00 Prom + 187162 187201 40 -6.96 4.01 Init + 189087 189165 79 2 1 61 105 52 0.608 5.49 4.02 Intr + 189995 190103 109 0 1 83 89 -10 0.374 -2.06 4.03 Intr + 195506 195644 139 2 1 92 54 82 0.256 5.67 4.04 Intr + 207925 208037 113 0 2 81 90 101 0.429 8.78 4.05 Intr + 214845 214878 34 0 1 111 113 -6 0.235 2.43 4.06 Intr + 214918 215012 95 0 2 60 100 41 0.194 1.26 4.07 Intr + 216411 216530 120 0 0 50 44 131 0.211 4.51 4.08 Intr + 218072 218157 86 2 2 46 84 55 0.071 0.36 4.09 Intr + 219699 219758 60 1 0 88 78 39 0.034 1.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100123 123 1 0 83 105 142 0.878 15.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:11443225_11664105|GENSCAN_predicted_peptide_1|1023_aa AGPIPPTLGSSTTPHHSSTFHTEPAPHSSALHSHTPATVPHGATRPAPSTRRGRSRAEEW SPEWQAAGVDSRVGGPAGLQPQFLGEGLQAPEVVIGWCGCSRPLFCTHFFHVDLGHRHST VCPGSRTCEGTGGAGGGLVLSSSSRFYIQVLLYAPSFFLQEQHLTSPLQRQKQINLQLWR EEGGPQRLLGGAFANHELTPLAAGGARHQLLEGQDTAAPFVSRAQGPVNSRCSVCADDAD AVKSANKVQPESFTAPDDISPADLPQLHGDAQGTSPAREPLGIYCQLPPLDQDALEGTSG HTEEMGAWSMPPVGEDGRPRTSTQNPSPDTASRELGDEVQPWLPGSSGRAANPRERMSGL SRGGELCVAEETHSHVQVLTEAAASHCGVRKSPLAKRLLFSTTMGYKDVPLGSFSSLLSA GLFAQIDTWDAEPGPEGGFYDKKHQNEDISVAMFQARGLHRTKSWQSSLSVAAVGAANLT APEGMSSGPQLGFQHQLMPARSQPGLGLGSPGTLPSTIHRAVALDRMGLVSSKKPDKEKP IKEKDKGQWSPLKVSAQDKDAPPLPPLVVFNHLTPPPPDEHLDEDKHFVVALYDYTAMND RDLQMLKGEKLQVLKGTGDWWLARSLVTGREGYVPSNFVARVESLEMERWFFRSQGRKEA ERQLLAPINKAGSFLIRESETNKGAFSLSVKDVTTQGELIKHYKIRCLDEGGYYISPRIT FPSLQALNPWAQDEWEIPRQSLRLVRKLGSGQFGEVWMGYYKNNMKVAIKTLKEGTMSPE AFLGEANVMKALQHERLVRLYAVVTKEPIYIVTEYMARGCLLDFLKTDEGSRLSLPRLID MSAQTLGHSCVERERGGGGAQIAEGMAYIERMNSIHRDLRAANILVSEALCCKIADFGLA RIIDSEYTAQEGAKFPIKWTAPEAIHFGVFTIKADVWSFGVLLMEVVTYGRVPYPGMSNP EVIRNLERGYRMPRPDTCPPELYRGVIAECWRSRPEERPTFEFLQSVLEDFYTATERQYE LQP >gi568815590f:11443225_11664105|GENSCAN_predicted_CDS_1|3072_bp gcaggacccatcccgccgactctgggcagctccacaactcctcaccactcgagcacattc cacaccgagccagctcctcacagcagcgccctgcacagccacaccccagccactgtgccc catggagccacccggccagcgcccagcacaagaagaggccgctcacgtgcagaagagtgg tccccggagtggcaggcagctggggtggacagcagggtaggagggcctgcgggtctccag cctcagtttctcggtgagggccttcaggctccggaggtggtcatcgggtggtgcggctgc tcccgccccctcttctgcacccacttcttccacgtggatctggggcacagacattctaca gtctgccctggcagccggacatgcgagggcacggggggcgcagggggaggcttggtcctg tcctccagcagccgtttctacatccaggtcttactctacgctcctagcttcttcctccaa gaacaacaccttacctcacctcttcagagacagaagcaaataaacctgcagctgtggagg gaagaaggcggcccacagcggctccttggaggagccttcgccaaccatgagctaactcct ctggctgctggaggagccagacatcagctccttgaggggcaggacactgctgcacctttt gtgtcccgagcacagggccctgtaaacagcagatgttcagtgtgtgctgatgacgctgat gcggtgaagtcagcgaacaaagtccagccagagtcgttcacagctccagatgacatctcc ccagcagacctcccacagcttcatggggatgcccaagggacaagccctgcgagggagccc ttgggcatctactgccagcttccacctttggatcaggatgccctggagggaacttccggc cacactgaagaaatgggggcttggtccatgcccccagtgggagaagacggcagaccccga acctccacccagaatccgagcccggacacggccagcagggagctaggagatgaagtccag ccctggctgcccggctcctctgggagggcagcgaacccgcgggagcgcatgagcggcctc tcccgggggggcgagctgtgtgtggccgaagaaacacattcacatgtacaagtcctcacg gaagctgctgccagccactgtggggttcgcaagtccccactcgcaaaacggctactgttt tccacaaccatgggatacaaggatgtccccttgggctccttctcatctctactctctgct ggcctctttgcacagattgatacctgggatgctgagccaggaccagagggaggattttac gacaagaaacaccaaaatgaggacatctctgtggccatgtttcaggcaagggggttacac cgcacgaaaagttggcaatcctctctctcggtggctgctgtgggagcagctaacctcaca gcacccgaagggatgtcatcgggaccccagctcggtttccagcatcagctgatgccagca agaagccagccaggcctggggcttggcagcccagggactttaccatccaccatccacagg gctgttgccctggacaggatggggctggtaagtagcaaaaagccggacaaggaaaagccg atcaaagagaaggacaagggccaatggagccccctgaaggtcagcgcccaagacaaggac gccccgccactgccgcccctggttgtcttcaaccaccttactcctccaccgcccgatgaa cacctggatgaagacaagcatttcgtggtggctctgtatgactacaccgctatgaatgat cgggacctgcagatgctgaagggggagaagctacaggtcctgaagggaactggagactgg tggctggccaggtcactcgtcacaggaagagaaggctatgtgcccagtaactttgtggcc cgagtggagagcctggaaatggaaaggtggttctttagatcacagggtcggaaggaggct gagaggcagcttcttgctccaatcaacaaggccggctcctttcttatcagagagagtgaa accaacaaaggtgccttctccctgtctgtgaaggatgtcaccacccagggggagctgatc aagcactataagatccgctgcctggatgaagggggctactacatctccccccggatcacc ttcccctcgctccaggccctgaatccctgggcccaggatgaatgggagatcccccggcag tctctcaggctggtcaggaaactcgggtctggacaattcggcgaagtctggatgggttac tacaaaaacaacatgaaggtggccattaagacgctgaaggagggaaccatgtctccagaa gcctttctgggtgaggccaacgtgatgaaggctctgcagcacgagcggctggtccgactc tacgcagtggtcaccaaggagcccatctacattgtcaccgagtacatggccagaggatgc ctgctggatttcctgaagacagatgaagggagcagattgtcactcccaaggctgattgac atgtcggcgcagacccttggacacagctgtgtggagcgagaacgaggagggggaggggca cagattgctgaagggatggcatacattgagcgcatgaattccatccaccgcgacctgcgg gcggccaacatcctggtgtctgaggccttgtgctgcaaaattgctgattttggcttggct cgaatcatcgacagtgaatacacggcccaagagggggccaagttccccatcaagtggaca gccccggaagccatccacttcggggtcttcaccatcaaagcagacgtgtggtcgtttgga gtcctcctgatggaagttgtcacttatgggcgggtgccatacccagggatgagcaacccc gaggtcatccgcaacctggagcgcggctaccgcatgccgcgccccgacacctgcccgccc gagctgtaccgcggcgtcatcgccgagtgctggcgcagccggcccgaggagcggcccacc ttcgagttcctgcagtcggtgctggaggacttctacacggccaccgagcggcagtacgag ctgcagccctag >gi568815590f:11443225_11664105|GENSCAN_predicted_peptide_2|382_aa MEKASSMYEWLLTPPPPPPPSSSSSSPPSQVVFSGRKDSSFPVLPTEDTAALRGCDVLRD TTLQLRHRPRSPNLRGLTNKEVRSAEETQAGVLLPLLISKGPPALCLASQPLCRLDGSPI WAAILKLWMCIKEPPSPTPAFYLQGQRIGTHSAGKAPEWPLMQPDPGASAECVCRGKEMA ANIDPAVEILMSSSDQEEEIKTSSTHPDGLPPPSMGLEAGAWALVQASLQCRCQHCAHCT DEETEAQRNEAKSPRSNIRGEEVPDFERHVYVPNKNTPGYRFSSGHHSGKCDPSWRNSPS SFKISSLTFPRKTKPEERLPLAYQSWKKPQASAGRGVSGPEHQPGPQCLPTVSGGCEGQE LAERSLVAWSLSVSSSAETASW >gi568815590f:11443225_11664105|GENSCAN_predicted_CDS_2|1149_bp atggagaaggcttccagcatgtatgagtggttgctaacaccaccaccaccaccaccacca tcatcatcatcatcatcaccaccatctcaagtggtcttcagtgggaggaaggacagctcc ttccctgtgttaccgaccgaggacacagcagccctgcgaggctgtgatgtgctcagggac accacattgcagctgcgacacaggccccggtctcctaatctgcgtgggctgaccaacaag gaggtccgctccgcagaggagactcaggctggtgtccttctgcccctgctcatctccaag ggcccccctgccctttgcttggcatctcagcccttgtgtcgcctggatggttccccaatc tgggctgccatcttgaaactctggatgtgcatcaaagagcccccatctccaacaccggcc ttctacctccaggggcagaggattggaacacactcagcagggaaagctcccgagtggccg ctcatgcagcctgaccctggagcatctgcagaatgtgtgtgcagaggaaaagaaatggca gcaaatatcgatcctgcagtggagattcttatgagcagttctgaccaagaagaggagatc aaaacctcctccacacatccagatgggttgcctcctccttctatgggcctggaggcagga gcatgggcattagtccaggcaagcctacaatgcaggtgtcagcattgtgcccattgcaca gatgaggaaaccgaggctcagagaaatgaagcaaagtctccgaggtcaaacatccgggga gaggaagtgcctgattttgaaagacatgtttatgtcccgaacaagaacacaccaggatac aggttcagctccgggcaccattcgggcaagtgcgacccctcgtggagaaactctccctca tccttcaagatctccagcctcacctttcccaggaaaaccaagccggaggagaggttgcca cttgcataccagagctggaaaaagccacaggcatctgctggtcgtggagtttcaggacct gaacatcagcctggcccccagtgcttgcccacggtcagtgggggctgtgagggccaagaa ctggctgagaggagcctggtggcatggagtctgtccgtttcatcatctgcagaaacagct tcgtggtag >gi568815590f:11443225_11664105|GENSCAN_predicted_peptide_3|116_aa MADWTFQLAAIRISPSHIYGAPTKSGLWWGRQELQKIQRRKSRVPPTQSSPSTKRIKQVP EIPSPKHASCLQNGAAVGRQESMPGKHVACGPQQMMMTLIITDNDAIDDNDDMDNH >gi568815590f:11443225_11664105|GENSCAN_predicted_CDS_3|351_bp atggcggactggaccttccagctggctgccattcgcattagcccctcacatatctatgga gctcctactaagagcggtctctggtggggccgtcaggagctgcagaagattcagagaagg aaaagtcgtgtcccacccacccagagctctccatctaccaaaaggataaagcaggtcccc gaaatccccagcccaaagcatgcttcctgcctgcagaatggcgctgctgtggggaggcaa gagagcatgcctggaaagcacgtggcatgtgggccacagcaaatgatgatgacattgatc attactgacaatgatgccatcgatgataatgatgacatggataatcattga >gi568815590f:11443225_11664105|GENSCAN_predicted_peptide_4|279_aa MAPNLNLQLLVGCGHLNPNSAFGIFKAKPGLCASGSPATGGKAWTCLSLLLSELGVGKTA EMCSNHIQGLNFSQNYVNLYNQIGFPKILVLAKPSWDKEASYAYFLICNSPSQTHRSRYA RDVPVVTTGSSWDTGASGQYLPSNSSRRCLPGSSGKRKRPGSLIRSCLVMCDKVEGMPGF QAAPTREEKRSTFLISPEAPAGTIGRLPLLLPSGKHGGLSPQGTFGEVTGDSNALVSIKS FSDLHKSTQIPDSPDKEKSHKAPRFIGLMDSTSTIVVRX >gi568815590f:11443225_11664105|GENSCAN_predicted_CDS_4|837_bp atggctccaaatctcaacctccaactgcttgtcggatgtggccacctgaatcccaacagc gcctttggtatctttaaagcaaagcctgggctgtgtgcgtcaggctctccagccactggg ggcaaggcctggacctgcctatctctcctgctctctgaactcggtgttgggaagacagcg gagatgtgctcaaatcacatccaggggctcaacttctcacaaaactatgtgaatttatac aatcaaattggattccccaagatacttgtgctggctaaaccttcatgggacaaagaagcc tcatatgcttatttcctcatctgtaatagccccagccagacacatcgatcccgctacgcc agggatgtacccgtggtgaccacaggtagctcctgggacaccggagcctctggccaatat ttacccagtaattcaagcagaagatgcctgcctgggtcttctgggaagagaaagaggcca ggcagcctgattcggagctgccttgtcatgtgtgataaagtggaaggaatgccaggtttc caggcagccccaaccagggaggagaaaagaagcaccttcctcatctccccagaagcacca gctggcaccataggccgcctgcccctgctgctgcccagtggaaagcacggtggactttct ccccagggcacctttggggaagtgacaggagactccaatgcactggtatccatcaaatcc ttctctgatttacacaaatctacacaaatacctgattcccccgataaggaaaagtcccac aaggcccccaggtttattggcctcatggactcaactagtaccattgtagtcagagnn