GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:09:52 Sequence gi568815595r:11458559_11803034 : 344476 bp : 45.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 946 941 6 -0.45 1.06 Term - 3801 3638 164 1 2 88 44 78 0.325 1.50 1.05 Intr - 15631 15521 111 2 0 91 111 2 0.650 3.15 1.04 Intr - 25329 25183 147 1 0 111 65 52 0.827 5.41 1.03 Intr - 29882 29672 211 2 1 1 53 132 0.056 -0.51 1.02 Intr - 30568 30424 145 0 1 30 41 159 0.039 5.68 1.01 Init - 32674 32463 212 1 2 59 102 127 0.649 9.46 1.00 Prom - 33628 33589 40 -3.06 2.05 PlyA - 34587 34582 6 1.05 2.04 Term - 41506 41297 210 1 0 35 42 142 0.127 1.59 2.03 Intr - 62001 61957 45 0 0 113 72 56 0.084 5.11 2.02 Intr - 62447 62297 151 1 1 61 76 54 0.446 1.66 2.01 Init - 65419 65310 110 1 2 98 62 50 0.371 1.26 2.00 Prom - 68527 68488 40 -5.26 3.00 Prom + 69234 69273 40 -3.36 3.01 Init + 71173 71304 132 0 0 92 41 74 0.491 3.27 3.02 Intr + 75886 76183 298 0 1 102 70 90 0.346 5.05 3.03 Intr + 76802 76889 88 0 1 74 72 12 0.296 -2.77 3.04 Term + 78118 78235 118 1 1 67 47 103 0.431 2.11 3.05 PlyA + 78459 78464 6 1.05 4.12 PlyA - 78758 78753 6 -0.45 4.11 Term - 79589 79148 442 1 1 66 46 164 0.099 4.73 4.10 Intr - 90320 90151 170 2 2 104 79 100 0.492 9.44 4.09 Intr - 100269 100034 236 1 2 118 49 351 0.047 31.61 4.08 Intr - 100897 100774 124 1 1 107 101 116 0.999 14.96 4.07 Intr - 106461 106239 223 2 1 121 80 273 0.988 27.93 4.06 Intr - 109979 109879 101 2 2 83 47 56 0.140 -0.09 4.05 Intr - 136274 136176 99 2 0 80 49 48 0.049 0.41 4.04 Intr - 138815 138702 114 2 0 46 89 51 0.088 1.64 4.03 Intr - 142134 142062 73 2 1 52 53 63 0.127 -1.39 4.02 Intr - 143464 143275 190 2 1 90 65 225 0.477 19.04 4.01 Init - 145672 145537 136 1 1 68 36 120 0.609 5.13 4.00 Prom - 149812 149773 40 -4.26 5.03 PlyA - 149941 149936 6 1.05 5.02 Term - 156241 156174 68 2 2 121 53 41 0.233 2.00 5.01 Init - 184945 184879 67 1 1 75 71 134 0.378 11.63 5.00 Prom - 197783 197744 40 -0.36 6.00 Prom + 204421 204460 40 -6.46 6.01 Init + 212705 212849 145 1 1 70 103 134 0.881 13.38 6.02 Term + 214122 214144 23 1 2 69 32 61 0.083 -2.93 6.03 PlyA + 214269 214274 6 1.05 7.00 Prom + 214320 214359 40 -0.96 7.01 Init + 233294 233403 110 1 2 70 45 128 0.656 6.39 7.02 Term + 237668 237692 25 2 1 109 43 21 0.390 -2.60 7.03 PlyA + 239197 239202 6 1.05 8.04 PlyA - 240212 240207 6 1.05 8.03 Term - 243700 243687 14 2 2 101 39 11 0.208 -4.14 8.02 Intr - 244489 244413 77 0 2 104 111 71 0.426 10.16 8.01 Init - 261360 261272 89 0 2 78 64 112 0.142 5.92 8.00 Prom - 263462 263423 40 -3.26 9.06 PlyA - 265510 265505 6 1.05 9.05 Term - 266914 266444 471 1 0 9 49 220 0.150 5.13 9.04 Intr - 287888 287781 108 2 0 56 115 47 0.037 4.68 9.03 Intr - 312720 312685 36 0 0 80 99 32 0.530 1.96 9.02 Intr - 318029 317898 132 2 0 33 98 63 0.444 2.64 9.01 Init - 321450 321337 114 0 0 60 68 87 0.501 4.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 92684 92643 42 2 0 107 69 35 0.930 3.94 S.002 Init + 175298 175318 21 1 0 95 99 4 0.869 2.03 S.003 Term + 175491 175724 234 2 0 98 49 100 0.860 3.42 S.004 Term + 216071 216180 110 0 2 92 53 58 0.856 1.37 S.005 Init + 293562 293682 121 2 1 60 101 140 0.847 12.88 S.006 Init + 333516 333577 62 2 2 55 94 66 0.870 4.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_1|329_aa MEDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIAVPESDGENGTKLENTLQD IIQEKFPNLARLNQEEVEFLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELP PLPDRREDGVQRAAELSQSLPPRRRVPPGRQRLEERTDPAGPEGKEQPPALASQSAEIAA SARPPPRLGKCWEHNVAYEAQGINIVPIRCSLNSTVESILIGCRVIVYKTHCIKADNILL RRVLSIQKTLDEYVVSPSEPCKSPARWWHLAYYTLGLVTTINFVFLSLSCVSFLWLHLTD YLIKEHKTALSNGDERALNLPRLQQKSLN >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_1|990_bp atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtttg attgctgtacctgaaagtgacggggaaaatggaaccaagctggaaaacactctgcaggat attattcaggagaaattccccaatctagcaagactaaaccaggaagaagttgaatttctg aatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactgccg ccgctgcccgaccgccgggaggatggagttcagcgggcagcggagctgtctcagtctttg ccgccgcgccggcgagtgccgcccgggaggcagcggctggaggagcggacggaccccgcg gggcccgagggcaaggagcagccgcctgccttggcctcccaaagtgccgagattgcagcc tctgcccggccgccaccccgtctgggaaaatgctgggaacacaatgtagcctacgaagcc caaggaataaacattgtgccaataagatgttctttaaattctaccgtagagtccatcctg attggctgcagggtcattgtgtacaaaacccactgcattaaagcagataacatcttgctt agaagagtgctgagcattcagaagacactggatgaatatgtcgtatcccccagtgagccc tgcaaatcacctgccaggtggtggcacttggcttactacacacttggacttgtgaccacg ataaactttgtcttcctgagcctctcctgtgtctccttcttgtggctgcacctaacagac tatctcataaaagaacataaaacagcccttagcaatggtgacgagagggccctgaacctt ccaaggctccagcagaagtccttaaactga >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_2|171_aa MQSAMALDKGWALGLGTMKCCSYYQTSGNRRFCVQSSSCVGPGATTTPLMLIGSLKCKEM LTCLQNVGPMEKTHNQLCLLLREECWRGQHGIFLIPMSPAHPHPQKMLKWILFELLLDQG TQNTPELVAFVAHQEIRIKMLIKYLLATYYEDCIAQCILGDAQDWYDTTPV >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_2|516_bp atgcagagcgcaatggctctagacaaggggtgggcgctggggctggggaccatgaagtgt tgcagttattaccagacaagtggaaacagaagattctgtgtccagtccagctcctgcgtg gggcctggagcaactactactcccctgatgctaataggcagccttaagtgcaaggaaatg ctcacttgtttgcagaatgttggccccatggagaagactcacaaccaactgtgcctactt ctcagggaagaatgctggcggggccagcacggcatcttcctcatccccatgtccccagcc catccgcatccccagaagatgttaaaatggattttatttgaattattgctggaccaggga actcagaacactcctgaacttgtggcgtttgtggcccatcaggaaataaggattaagatg ctcataaaatatttgttggccacctactatgaggactgcattgcccaatgcatcttgggg gatgcacaagactggtatgatactacacctgtgtaa >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_3|211_aa MAGDTPHVIVVAVGGDRAERTRELMSLPEGTKDAEMALCSSAPQAGPVLPCGLDMLMEER TPPQSQGGLERIRVLPECASFMPGGYVPPGRDCILRFRSRCAKGSCHTPSVSRTWSWAPN GPSGVKSRCLQPTVGRCSLPAQPVAPSMVPKSLWRNPGFCDSQSLLRRPSLSRCGTNLLP RPCGSDRGRAAKDHPGLVTYDSGSSSAGDLA >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_3|636_bp atggcaggggacaccccacacgtcatcgtggtagcagtgggaggggacagggcagaaagg accagagagttgatgtctctgcccgaaggtaccaaggatgctgagatggctctgtgctcg tcagcccctcaggcaggtcctgtgttgccctgtgggttagacatgctcatggaggagagg acacctccccagagccaaggcggtctggaacgcatcagggttcttcctgaatgtgcgtcc ttcatgccgggtggctacgtgccgccaggcagggactgcatactgcggtttcgtagtcgc tgtgctaaggggagctgccacacgccgtctgttagcaggacatggagttgggccccaaat ggccctagtggtgtcaaaagcagatgtcttcaacccacagtgggacgctgctccctccca gcgcagcctgtggcacccagcatggtgcccaagtccctgtggaggaaccctggcttttgt gattctcaaagtcttctgcgtcggccctccctgagcaggtgcggaacgaacctgctgccc cgaccctgcggctccgaccggggccgagcagcgaaggaccaccctggtctggtcacctac gattctgggtccagctcagctggtgaccttgcttag >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_4|635_aa MLKYYSTHLPPSPRDLVEASVTVPGEGHLECLHGACSPSPGPTPTREAALRGEPRIQTLP VASALSSHRTGPPPISPSKRKFSMEPGDEDLDCDNDHVSKMSRIFNPHLSSVLAFRSFQS AWTVVFSHKTLTEKSLKKMNVCLEVEKYSVFGKSGLKDGKRWSGIHLVEKSLLTLDFSCV ALYYLHASKLIQPVPCVWTFWQFQCLCLTHSAHCAVFHGLSFQLVGLEVPFSLLPCVGNK TANGDCRRDPRERSRSPIERAVAPTMSLHGSHLYTSLPSLGLEQPLALTKNSLDASRPAG LSPTLTPGERQQNRPSVITCASAGARNCNLSHCPIAHSGCAAPGPASYRRPPSAATTCDP VVEEHFRRSLGKNYKEPEPAPNSVSITGSVDDHFAKALGDTWLQIKAAKDGASSSPESAS RRGQPASPSAHMESVLKTHLHYTKGYYPLLHDLLQEKTGNRPKRPSAGNQLTKPFDAHPH LDYCGPETQNATNLPSGSLWPSGTSVSPGVHISPSQLDNRGHLESHTGTYALFPPPFPGL FYIQGQISARYHFGSVALGIKETALDLGQDAPNFGSGLYQPPALCGLARFSGLSFPAGKY ISPFYHQPYRGYQGYIKQLESTDPEPGTEQGLRKC >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_4|1908_bp atgctgaagtactacagcacgcatttaccaccgtctccccgggacctggtggaggcgtct gtcacagtccctggtgaaggacacctggagtgccttcatggggcctgtagccccagccct ggtcccacccccacccgcgaagctgctctcaggggagaacccagaatacagaccctgccg gtggcctctgccctcagcagtcaccgcaccggccctcccccaatcagccccagcaagagg aagttcagcatggagccaggtgacgaggacctagactgtgacaacgaccacgtctccaaa atgagtcgcatcttcaacccccatctctccagcgtcctcgcattccgctcgttccagtca gcttggactgttgtgttcagtcataagacactgacagagaaaagtcttaagaagatgaat gtctgtttggaagttgaaaagtactctgtgtttggaaagagtggattaaaggatgggaag aggtggagtgggattcatcttgtggaaaaatcactcctcactctggatttcagctgcgtg gcactctattacctgcatgcatcaaagctcatccaaccagtcccctgtgtttggacattt tggcaatttcagtgtctttgtctgacccatagtgcccactgtgctgtgttccacggactg tctttccagttagtgggcctggaagtccccttctccctgctgccctgcgtagggaacaag actgccaatggagactgccgcagagacccccgggagcggagccgcagccccatcgagcgc gctgtggcccccaccatgagcctgcacggcagccacctgtacacctccctccccagcctt ggcctggagcagcccctcgcactgaccaagaacagcctggacgccagcaggccagccggc ctctcgcccacactgaccccgggggagcggcagcagaaccggccctccgtgatcacctgt gcctcggctggcgcccgcaactgcaacctctcgcactgccccatcgcgcacagcggctgt gccgcgcccgggcctgccagctaccggaggccaccgagcgctgccaccacctgtgacccc gtggtggaggagcatttccgcaggagcctgggcaagaattacaaggagcccgagccggca cccaactccgtgtccatcacgggctccgtggacgaccactttgccaaagctctgggtgac acgtggctccagatcaaagcggccaaggacggagcatccagcagccctgagtccgcctct cgcaggggccagcccgccagcccctctgcccacatggagtctgtcctgaagacacacctc cactatacgaaaggatattatcccctgctgcatgatctcttacaggaaaagactgggaac aggcccaagaggccctcagcggggaaccagctgactaaaccatttgatgcacatccacac ctggactactgtggccctgaaacacaaaatgccaccaaccttccgtcaggttccctgtgg ccctcaggaacatctgtcagccctggggtccacatatctcctagccagctggacaacagg ggtcatctggaatcccacactgggacctatgccctcttccctccacctttccctggcctc ttctacattcagggccagatctcggccaggtaccactttggctctgtggcattgggtata aaggaaacagcactggacttggggcaagacgctcccaattttggatccgggctctaccag ccaccagctctctgtgggctagctcgtttctcaggactcagtttccccgcaggaaaatac atctctccattttaccaccaaccatacagaggttatcaaggatacataaaacaactggaa agcacagacccagaacctggcaccgagcagggcctcaggaaatgttaa >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_5|44_aa MDLLNYQYLDKMNNNIGILCYEGLSGSRMVCVPKSLMLLPLCQD >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_5|135_bp atggacctgttgaactatcagtacttggacaagatgaacaacaatatcggcattctgtgc tacgaagggttgtcaggctctcgaatggtgtgcgtgcccaagtccctaatgctgcttcca ctgtgtcaggactga >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_6|55_aa MVQTWMSPTFECTGLRDFVKMRILSHQFRSEDYDSAFLTSSQVTLWATCEDTENR >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_6|168_bp atggtgcagacctggatgtctccaacttttgagtgcacaggactcagggattttgtcaaa atgcgtattctcagtcatcagttccgcagcgaggactacgattctgcatttctaacaagt tcccaggtgacgctgtgggccacatgtgaggacacggagaaccggtaa >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_7|44_aa MSSEDISNTGWLTALEMFTPDTDRGSDSQSGERRDDGAPEDLAL >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_7|135_bp atgagctcagaagacatctccaacacaggatggctgacagcactggagatgttcacacca gatactgaccggggcagtgactctcaaagtggagagagaagagatgatggggcaccagag gacttggcactctga >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_8|59_aa MSTRPARWRLSGAPAPTNLTAPARRHRPGRLQPGMETPLDVLSRAASLVHADDEKRYYQ >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_8|180_bp atgtccactcgccctgcccgctggcgcctctcgggagcgcccgccccgacaaacctcact gccccagcgcgccggcatcggcccgggaggctccaaccaggaatggagacgccattggat gttttgtccagggcagcatctctggtgcatgctgatgacgaaaaacgttactatcagtga >gi568815595r:11458559_11803034|GENSCAN_predicted_peptide_9|286_aa MDQKGQIEVMSRKWTQYNLMTDYGEGLAKAPAGTNGRMATWDGLLLRGHKPVQQVTVLNT EAVVNTVVSTVCLDMSTHSKGTPAKKRERRVTHQLLALIGKAALTFVHKSSSNITFLLGK QLTVHFWRHMNEKKKKEEEEEEEEEERRRRRRRGRRKRRRRKRRRRRRKKRRRRRRKKGG GGEEKKEEEKKEKEEKREEEKEKEEGEEKEKEEREEKEEEKEEGAKEGREQGGRRRGRRR GERRRGGGGGAGGGSVRGRGRRGKRGRGRRTGRRKRRIRFSEQSEL >gi568815595r:11458559_11803034|GENSCAN_predicted_CDS_9|861_bp atggaccaaaagggacagattgaagtgatgtctaggaagtggactcagtacaacttgatg actgactatggagaagggcttgccaaggctcccgctggcacaaatgggagaatggctaca tgggatggcctactgctccgaggccacaaacctgtacagcaggttactgtgctgaatacc gaggcagttgttaacacagtggtgagcactgtgtgtctagacatgtccacacatagcaaa ggtacaccagccaagaagagggaacgcagggtgactcaccagcttttggctcttattggt aaagctgctctgacatttgtgcacaagtcttctagtaacattacatttctcttgggtaaa cagctcactgtacacttctggcgtcatatgaatgagaagaagaagaaggaggaggaggag gaggaggaggaggaggaaagaagaagaagaagaagaagaggaagaagaaaaagaaggagg aggaagaggaggaggagaagaagaaaaaaaaggaggaggaggagaagaaaaaaaggagga ggaggagaagaaaaaaaggaggaggagaagaaggagaaggaggagaagagggaggaagaa aaggagaaggaggagggggaggagaaggagaaggaggagagggaggagaaggaggaggag aaggaggagggagcaaaagagggaagagaacaaggagggaggagaagaggaaggagaaga ggagagaggagaagaggagggggaggaggggcaggaggaggaagtgtaagaggaagagga agaagaggaaaaagaggaagaggaagaagaacaggaagaagaaagagaagaatcagattc tctgagcaatctgaactatga