GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:29:34 Sequence gi568815584r:72571139_72871893 : 300755 bp : 45.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12704 12764 61 1 1 96 75 42 0.624 5.38 1.02 Intr + 16715 16804 90 0 0 69 63 62 0.424 1.77 1.03 Intr + 19299 19396 98 1 2 31 105 50 0.094 0.73 1.04 Term + 29786 29962 177 1 0 81 42 276 0.286 19.99 1.05 PlyA + 30597 30602 6 1.05 2.03 PlyA - 31128 31123 6 1.05 2.02 Term - 33490 33359 132 1 0 93 54 64 0.823 1.59 2.01 Init - 35823 35764 60 0 0 76 89 48 0.673 4.95 2.00 Prom - 36295 36256 40 -5.66 3.06 PlyA - 37112 37107 6 1.05 3.05 Term - 48846 48712 135 0 0 95 42 199 0.564 14.02 3.04 Intr - 58598 58486 113 0 2 117 76 92 0.985 11.00 3.03 Intr - 59743 59651 93 2 0 100 113 38 0.876 7.44 3.02 Intr - 75359 75235 125 1 2 45 43 81 0.020 -0.47 3.01 Init - 90654 90425 230 0 2 73 56 130 0.649 6.43 3.00 Prom - 90718 90679 40 -5.56 4.03 PlyA - 91130 91125 6 1.05 4.02 Term - 96974 96948 27 2 0 105 54 26 0.671 -1.03 4.01 Init - 98879 98814 66 2 0 96 95 75 0.830 10.07 4.00 Prom - 100025 99986 40 -4.46 5.05 PlyA - 100700 100695 6 1.05 5.04 Term - 100770 100748 23 1 2 91 49 49 0.374 -0.23 5.03 Intr - 103230 103102 129 1 0 38 105 145 0.842 11.87 5.02 Intr - 108162 108075 88 0 1 89 65 39 0.732 1.24 5.01 Init - 110373 110335 39 0 0 91 85 65 0.791 4.74 5.00 Prom - 116928 116889 40 -2.66 6.17 PlyA - 117248 117243 6 1.05 6.16 Term - 118064 117897 168 2 0 108 44 123 0.808 7.78 6.15 Intr - 120212 120181 32 0 2 17 66 34 0.362 -8.05 6.14 Intr - 122075 121938 138 0 0 105 69 163 0.972 16.54 6.13 Intr - 132116 132075 42 0 0 88 86 39 0.427 1.91 6.12 Intr - 143363 143285 79 2 1 69 98 69 0.699 5.12 6.11 Intr - 152590 152495 96 1 0 113 114 138 0.999 19.11 6.10 Intr - 160796 160669 128 0 2 120 66 138 0.969 15.20 6.09 Intr - 162743 162647 97 2 1 76 91 14 0.096 0.08 6.08 Intr - 166289 166093 197 0 2 99 3 96 0.021 1.33 6.07 Intr - 167422 167391 32 0 2 76 77 21 0.058 -2.43 6.06 Intr - 170852 170698 155 2 2 80 62 72 0.048 2.67 6.05 Intr - 171088 171000 89 2 2 96 64 56 0.072 3.69 6.04 Intr - 178075 178058 18 2 0 89 105 18 0.097 0.18 6.03 Intr - 182233 182084 150 2 0 115 81 144 0.823 16.53 6.02 Intr - 200755 200595 161 0 2 101 99 230 0.941 25.03 6.01 Init - 211841 211828 14 2 2 76 105 7 0.129 1.02 6.00 Prom - 214210 214171 40 -6.06 7.00 Prom + 215608 215647 40 -5.36 7.01 Init + 217072 217164 93 0 0 65 83 115 0.457 9.08 7.02 Term + 222596 222703 108 1 0 90 42 112 0.757 5.31 7.03 PlyA + 223264 223269 6 1.05 8.02 PlyA - 224785 224780 6 1.05 8.01 Sngl - 227689 227414 276 1 0 100 43 186 0.811 10.68 8.00 Prom - 230375 230336 40 -6.76 9.04 PlyA - 233777 233772 6 1.05 9.03 Term - 239885 239820 66 2 0 56 44 100 0.730 0.34 9.02 Intr - 242023 241860 164 2 2 116 66 111 0.914 11.39 9.01 Init - 242885 242876 10 2 1 86 98 0 0.765 1.52 9.00 Prom - 243257 243218 40 -5.86 10.04 PlyA - 245690 245685 6 1.05 10.03 Term - 264907 264774 134 2 2 54 51 126 0.884 3.75 10.02 Intr - 265041 264946 96 1 0 54 103 73 0.158 5.38 10.01 Init - 277820 277784 37 2 1 64 98 35 0.158 2.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_1|141_aa MEDTRRLPALGPELSAHGDTEKDGFYSTLLHKSCSFDIPKASNNRETSSAMGCYSAITEQ YNNLEESQKRYTEGKKARHKDVPLRERGKDEVGEGKEEEEEQGEEEEEEGGEEEEEGEEE EEEKGEEEEEGEEEEEGEERE >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_1|426_bp atggaggacacccgccggcttcctgcactgggtccagagctgtctgctcatggggacaca gagaaagatggattctactccactttgcttcacaaatcctgcagctttgacatcccaaaa gcttccaacaacagggagacatcatcggccatgggatgttactcggcaataacagagcaa tacaacaacctagaagaatctcaaaaacgttacactgagggaaaaaaagccagacataaa gacgtacccctgcgggaaagaggcaaagatgaagtgggggaaggaaaggaggaggaggag gagcagggagaggaagaggaggaggaggggggagaggaggaggaggagggggaggaggaa gaggaggagaagggggaagaggaggaggagggggaggaggaggaagaaggggaggaaagg gaatag >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_2|63_aa MAPTSASCKDLRKLSIEEEQRCMFRAVIQAILMDTTHRLMWLQTWRIKLDSNFRTSQWKQ RAN >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_2|192_bp atggcaccaacatctgcttcttgtaaggacctcaggaagctttcaattgaggaggagcag aggtgcatgttcagagcggtcatccaagccattttgatggacacaactcaccgtctcatg tggctacagacttggaggataaaacttgacagcaactttagaacttcccagtggaagcaa cgggccaactga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_3|231_aa MRKGNLLLSWLLGPELPELSPRARKADLKDENLKFSCWWEPRKTAGVLTWPFLAELAEVG VLADGMYLGAVSVAQQSLGVLSLTVAKGLLSEVVGIFLDALRKGSALAMLRALMGLESAS FFKAYNPSNERGNTRHHIMGLAELEALENGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGNDDEVNNKNSNRYYYY >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_3|696_bp atgaggaaagggaaccttctgctgagctggcttctggggcctgagcttccagagctgtcc ccaagggctaggaaggccgacctgaaggatgagaacctcaaattcagttgctggtgggag ccaaggaagacggcgggtgttctaacatggccctttctggctgagctggcggaagtgggc gttttggccgatgggatgtatctcggcgctgtgtctgtggcccagcaaagccttggagtc ctgtccctgacagtggccaaagggcttttatctgaggttgtgggcatctttctggatgca cttcgtaaaggcagtgcactggcgatgctgagggcactcatgggcttggagtcagcatcc ttctttaaagcatataatccttcaaatgaaagaggcaacacaaggcaccatataatgggt ttggcagaactggaagctctggagaacggtcacccaacctgcctgcagtttaccctgaac atgaccgaggctgtcaagacctacaagtggcagtgcatagagtgcaaatcctgtatcctc tgtgggacctcagagaatgatgaccagctactcttctgcgatgactgtgaccgaggctat cacatgtactgtttaaatcccccggtggctgagcccccagaaggtaatgatgatgaagtc aacaataagaacagcaacagatactactactactag >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_4|30_aa MEKATHLTDSPKNDRLRGPSQKIAVIYIFV >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_4|93_bp atggagaaggccacccacctcactgacagtcccaagaatgaccggctgcgaggtcctagt caaaagatagcagtcatctacatttttgtgtga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_5|92_aa MGPLFSLAPAESLDAWDAALVSAEPRRVAVADGWFVPASHSFSQKGPDGTVIPNNYCDFC LGGSNMNKKSGRPEELVSCADCGRSVQMNDMA >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_5|279_bp atggggcctctgttctcactggctcctgctgagtccctggatgcttgggacgcagctctg gtctcagcagaacccagacgagtggctgttgccgacggctggtttgtccctgccagtcat tctttctcccagaaaggaccggatggaacagtcattcccaataactactgtgacttctgc ttggggggctccaacatgaacaagaagagtgggcggcctgaagagctggtgtcctgcgca gactgtggacgctctgtgcaaatgaacgacatggcgtaa >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_6|531_aa MKQERLGDQFYKEAIEHCRSYNSRLCAERSVRLPFLDSQTGVAQNNCYIWMEKRHRGPGL APGQLYTYPARCWRKKRRLHPPEDPKLRLLEIKPGQCPLELGWVCDGKGHADARVSEPAA IHLEACTGSFVSSGWPHEKGSWSELCWCSAGLWELGLAEEVGWQQAVGRTVARLLLSVHG HVLQWDLAEPRVEHEFCIYLVCWQLSDSSRLHPVNELGGSNNPVEALLSVHWVAGLGHLC QKLVALICTQEDGCPVCSPKEKKPLLFAGNFCVCICSVLMDARLPLPHGLVRLPLFLVEE VESAEVELPLKKDGFTSESTTLEALLRGEGVEKKVDAREEESIQEIQRVLENDENVEEGN EEEDLEEDIPKRKNRTRGRARGSAGGRRRHDAASQEDHDKPYVCDRIQLKISIKGGLKEV CGKRYKNRPGLSYHYAHTHLASEEGDEAQDQETRSPPNHRNENHRRPYFQTKSQYELCLQ QSPWLIPRLVLLLKGQSGLLEPALTFPAPAHAALCCKLSNPLPLLIAWPPS >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_6|1596_bp atgaagcaggaaaggctcggggaccagttctacaaggaagccattgagcactgccggagt tacaactcacggctgtgtgcagagcgcagcgtgcgtcttcccttcctggactcacagact ggggtggcccagaacaactgctacatctggatggagaagaggcaccgaggcccaggcctt gccccgggccagctgtatacataccctgcccgctgctggcgcaagaagagacgattgcac ccacctgaagatccaaaactgcggctgctggagataaaacctggtcagtgccctctggag cttggctgggtctgtgatgggaaaggtcatgctgatgcaagagtgtcagagcctgcagcc atccatctggaggcctgcaccggcagcttcgtcagctctggttggcctcatgagaagggg agctggagcgagctctgctggtgttctgcagggctctgggagcttgggctggccgaggag gtgggctggcagcaggcagtgggcaggacagtggcacggctgctgctgagtgtccacggt catgttttgcagtgggacctggcagagccacgtgtagagcatgagttctgcatttacctg gtctgctggcagctcagtgactcatctcgcctgcatcctgttaatgagctcggtggctcc aacaacccagtggaagctttgctttcagtgcattgggtggcaggcttggggcacctgtgt caaaagcttgtggctttgatctgcacccaagaggatggctgccctgtctgtagccctaaa gagaagaagccattgctctttgctggaaatttctgtgtgtgcatttgttcagttttaatg gatgcccgcttgccgcttcctcatggattggtcagactccctctgtttctggtggaggaa gtggaaagtgcagaagtggagcttcccctgaagaaggatgggttcacctcagagagcacc acgctggaagccttgctccgtggcgagggggttgagaagaaggtggatgccagggaggag gaaagcatccaggaaatacagagggttttggaaaatgatgaaaatgtagaagaagggaat gaagaagaggatttggaagaggatattcccaagcgaaagaacaggactagaggacgggct cgcggctctgcagggggcaggaggaggcacgacgccgcctctcaggaagaccacgacaaa ccttacgtctgtgacagaattcagcttaaaataagcatcaagggtggcctgaaagaagtc tgtggcaagcgctacaagaaccgaccggggctcagctaccactatgctcacactcacctg gccagcgaggagggggatgaagctcaagaccaggagactcggtccccacccaaccacaga aatgagaaccacaggcgaccctacttccaaaccaagtcacagtatgagctttgtcttcag cagagcccatggctcatcccgcgcctggtgctgctgctgaaagggcagagtggactgctg gagccggcgctgaccttccccgcccccgcccatgctgccctgtgctgcaagctcagcaat ccactgccactcctgatcgcctggcccccaagctga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_7|66_aa MAAASRPPILLDHRMQEDYTVSWISSLQEQELPCIDDAGKFNVIFIGPAQQSRCGSLITN NNASSM >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_7|201_bp atggcagctgcatctcgaccaccaatcctactggaccacaggatgcaggaggactacact gtgtcctggatctcgtcattacaagagcaggagctcccttgcatagatgatgctggcaaa ttcaacgtgattttcattggacctgcccagcagtccagatgtgggagcctaataacaaat aacaatgccagctccatgtga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_8|91_aa MKDFHLAVPDLGSELSYTDLKTWPKAGTEVVEEPYLECGASDFPYWAEVSGKGKGSGCIL SSWQPLLQPTRDPEFSMGTLELEDTSCRYHV >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_8|276_bp atgaaggatttccatttagcagtgccagacctgggctcggagctcagctacactgatctc aagacttggcccaaggcaggaactgaggttgtggaagagccatacttggagtgtggagct tctgacttcccatactgggcagaagtctctggaaagggaaagggatctggctgcatcctt agcagctggcagcccctgctgcagccaactagggaccccgagttctccatgggaactctg gagctagaggatacatcttgcaggtatcatgtctga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_9|79_aa MGKEFGAKPIRDKGMLGPCLDLRFGGLLPFLVSWFEAFSKEPPKSKDSVERLMPRGDALP EASPEADAGTVVALQPAEL >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_9|240_bp atggggaaagaatttggagccaagcccatcagggacaaaggaatgctgggaccttgtttg gatctcagatttggtggtttgctccccttcctggtgtcatggtttgaagcattttcgaag gagcccccgaaaagcaaggacagtgtggagaggctgatgcccagaggcgatgcgcttcct gaagcctcaccagaagcagatgccggcaccgtggttgctctacagcctgcagaactgtga >gi568815584r:72571139_72871893|GENSCAN_predicted_peptide_10|88_aa MYVKRLVQNLAHKAPTPSENTMISLLFLTGFFFGPGFPQQTWRPSSPRGTRVVTAGPRDV WECGLRRVYLEHNAQVLGVEESNKGDEP >gi568815584r:72571139_72871893|GENSCAN_predicted_CDS_10|267_bp atgtatgtgaagcgcttagtacagaacctggcacacaaggctccaaccccttctgagaac acaatgatcagcctcctgttcctcactggcttcttctttggacctggatttccacagcag acctggaggccatcatctccccgaggcacccgtgtggtgacggctggccctcgggatgtc tgggaatgtggtctccggcgggtgtatcttgaacacaacgcacaggttttaggtgtcgaa gagtcaaataagggggatgaaccatga