GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:23:16 Sequence gi568815591r:101215137_101421709 : 206573 bp : 50.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 884 772 113 0 2 74 49 247 0.997 19.50 1.04 Intr - 1190 1027 164 1 2 103 89 254 0.983 26.62 1.03 Intr - 1410 1274 137 0 2 77 65 154 0.999 11.37 1.02 Intr - 1650 1559 92 1 2 76 76 161 0.988 13.41 1.01 Init - 2138 2030 109 2 1 106 94 228 0.988 23.58 1.00 Prom - 4151 4112 40 -4.06 2.00 Prom + 4903 4942 40 -12.30 2.01 Init + 5398 5476 79 0 1 57 92 1 0.513 -1.38 2.02 Intr + 7468 7638 171 2 0 75 81 295 0.921 27.41 2.03 Intr + 8341 8420 80 2 2 46 94 75 0.999 3.17 2.04 Intr + 8537 8706 170 1 2 73 99 147 0.993 13.04 2.05 Term + 8801 8822 22 2 1 143 52 17 0.958 1.38 2.06 PlyA + 9033 9038 6 1.05 3.06 PlyA - 9293 9288 6 1.05 3.05 Term - 17379 17274 106 2 1 94 43 194 0.999 13.38 3.04 Intr - 17584 17468 117 0 0 93 99 73 0.997 8.48 3.03 Intr - 17778 17697 82 1 1 90 89 182 0.806 17.20 3.02 Intr - 19306 19142 165 2 0 49 50 184 0.792 10.63 3.01 Init - 22445 22229 217 2 1 96 60 377 0.981 34.35 3.00 Prom - 22898 22859 40 -10.74 4.06 PlyA - 24501 24496 6 1.05 4.05 Term - 24767 24670 98 0 2 141 49 121 0.999 11.63 4.04 Intr - 25111 25006 106 1 1 28 56 240 0.663 14.59 4.03 Intr - 25770 25694 77 1 2 108 100 149 0.998 17.33 4.02 Intr - 29003 28871 133 2 1 93 72 301 0.949 29.22 4.01 Init - 29868 29824 45 0 0 101 81 129 0.860 14.19 4.00 Prom - 58091 58052 40 -5.26 5.04 PlyA - 58137 58132 6 1.05 5.03 Term - 63759 63656 104 1 2 109 34 61 0.118 1.24 5.02 Intr - 72607 72347 261 2 0 116 73 274 0.717 26.16 5.01 Init - 74628 74553 76 0 1 72 101 127 0.795 11.66 5.00 Prom - 77235 77196 40 -8.96 6.09 PlyA - 77318 77313 6 1.05 6.08 Term - 79782 79676 107 1 2 92 49 66 0.011 1.67 6.07 Intr - 87578 87550 29 1 2 79 89 51 0.001 2.06 6.06 Intr - 88075 87952 124 2 1 100 32 134 0.082 8.64 6.05 Intr - 93660 93481 180 1 0 99 46 86 0.080 5.34 6.04 Intr - 101406 101204 203 2 2 147 47 148 0.801 15.53 6.03 Intr - 103077 102988 90 2 0 120 44 105 0.985 8.41 6.02 Intr - 103896 103820 77 0 2 60 83 65 0.816 1.51 6.01 Init - 106573 106535 39 1 0 60 107 65 0.832 5.91 6.00 Prom - 123836 123797 40 -3.16 7.00 Prom + 140960 140999 40 -5.06 7.01 Init + 147897 148054 158 2 2 89 117 396 0.996 39.98 7.02 Intr + 163776 163883 108 0 0 40 64 83 0.074 0.50 7.03 Intr + 164384 164421 38 2 2 128 92 32 0.737 5.51 7.04 Intr + 178338 178379 42 1 0 80 59 65 0.004 1.01 7.05 Intr + 184776 184829 54 1 0 86 80 26 0.138 0.55 7.06 Intr + 185757 185892 136 1 1 55 89 23 0.112 -1.27 7.07 Intr + 186729 186820 92 0 2 78 97 62 0.633 5.74 7.08 Intr + 190667 190769 103 0 1 17 75 60 0.081 -3.17 7.09 Intr + 204841 204963 123 1 0 115 98 152 0.675 18.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 87678 87735 58 2 1 91 94 110 0.880 11.38 S.002 Term - 100146 99998 149 1 2 71 42 102 0.847 1.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_1|205_aa MTSSGPGPRFLLLLPLLLPPAASASDRPRGRDPVNPEKLLVITVATAETEGYLRFLRSAE FFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYADREDMIIMFVDSYDVILAG SPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRFLNSGGFIGFATTIHQIV RQWKYKDDDDDQLFYTRLYLDPGLR >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_1|615_bp atgacctcctcggggcctggaccccggttcctgctgctgctgccgctgctgctgccccct gcggcctcagcctccgaccggccccggggccgagacccggtcaacccagagaagctgctg gtgatcactgtggccacagctgaaaccgaggggtacctgcgtttcctgcgctctgcggag ttcttcaactacactgtgcggaccctgggcctgggagaggagtggcgagggggtgatgtg gctcgaacagttggtggaggacagaaggtccggtggttaaagaaggaaatggagaaatac gctgaccgggaggatatgatcatcatgtttgtggatagctacgacgtgattctggccggc agccccacagagctgctgaagaagttcgtccagagtggcagccgcctgctcttctctgca gagagcttctgctggcccgagtgggggctggcggagcagtaccctgaggtgggcacgggg aagcgcttcctcaattctggtggattcatcggttttgccaccaccatccaccaaatcgtg cgccagtggaagtacaaggatgatgacgacgaccagctgttctacacacggctctacctg gacccaggactgagg >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_2|173_aa MILSHTAVVRVKWFKCCKELSTCLAHIRSQDPGQRRVLDRAARQRRINRQLEALENDNFQ DDPHAGLPQLGKRLPQFDDDADTGKKKKKTRGDHFKLRFRKNFQALLEEQNLSVAEGPNY LTACAGPPSRPQRPFCAVCGFPSPYTCVSCGARYCTVRCLGTHQETRCLKWTV >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_2|522_bp atgatactgtcccatacagctgttgtaagggttaaatggttcaagtgttgcaaagagctg agcacgtgcctggctcatattcgctcccaggaccccgggcagcggcgggtgctggaccgg gctgcccggcagcgtcgcatcaaccggcagctggaggccctggagaatgacaacttccag gatgacccccacgcgggactccctcagctcggcaagagactgcctcagtttgatgacgat gcggacactggaaagaaaaagaagaaaacccgaggtgatcattttaaacttcgcttccga aaaaactttcaggccctgttggaggagcagaacttgagtgtggccgagggccctaactac ctgacggcctgtgcgggacccccatcgcggccccagcgccccttctgtgctgtctgtggc ttcccatccccctacacctgtgtcagctgcggtgcccggtactgcactgtgcgctgtctg gggacccaccaggagaccaggtgtctgaagtggactgtgtga >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_3|228_aa MSMAVETFGFFMATVGLLMLGVTLPNSYWRVSTVHGNVITTNTIFENLWFSCATDSLGVY NCWEFPSMLALSGYIQACRALMITAILLGFLGLLLGIAGLRCTNIGGLELSRKAKLAATA GALHILAGICGMVAISWYAFNITRDFFDPLYPGTKYELGPALYLGWSASLISILGGLCLC SACCCGSDEDPAASARRPYQAPVSVMPVATSDQEGDSSFGKYGRNAYV >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_3|687_bp atgtcgatggctgtggaaacctttggcttcttcatggcaactgtggggctgctgatgctg ggggtgactctgccaaacagctactggcgagtgtccactgtgcacgggaacgtcatcacc accaacaccatcttcgagaacctctggtttagctgtgccaccgactccctgggcgtctac aactgctgggagttcccgtccatgctggccctctctgggtatattcaggcctgccgggca ctcatgatcaccgccatcctcctgggcttcctcggcctcttgctaggcatagcgggcctg cgctgcaccaacattgggggcctggagctctccaggaaagccaagctggcggccaccgca ggggccctccacattctggccggtatctgcgggatggtggccatctcctggtacgccttc aacatcacccgggacttcttcgaccccttgtaccccggaaccaagtacgagctgggcccc gccctctacctggggtggagcgcctcactgatctccatcctgggtggcctctgcctctgc tccgcctgctgctgcggctctgacgaggacccagccgccagcgcccggcggccctaccag gctccagtgtccgtgatgcccgtcgccacctcggaccaagaaggcgacagcagctttggc aaatacggcagaaacgcctacgtgtag >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_4|152_aa MEAVLNELVSVEDLLKFEKKFQSEKAAGSVSKSTQFEYAWCLVRSKYNDDIRKGIVLLEE LLPKGSKEEQRDYVFYLAVGNYRLKEYEKALKYVRGLLQTEPQNNQAKELERLIDKAMKK DGLVGMAIVGGMALGVAGLAGLIGLAVSKSKS >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_4|459_bp atggaggccgtgctgaacgagctggtgtctgtggaggacctgctgaagtttgaaaagaaa tttcagtctgagaaggcagcaggctcggtgtccaagagcacgcagtttgagtacgcctgg tgcctggtgcggagcaagtacaatgatgacatccgtaaaggcatcgtgctgctcgaggag ctgctgcccaaagggagcaaggaggaacagcgggattacgtcttctacctggccgtgggg aactaccggctcaaggaatacgagaaggccttaaagtacgtccgcgggttgctgcagaca gagccccagaacaaccaggccaaggaactggagcggctcattgacaaggccatgaagaaa gatggactcgtgggcatggccatcgtgggaggcatggccctgggtgtggcgggactggcc ggactcatcggacttgctgtgtccaagtccaaatcctga >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_5|146_aa MVRMVPVLLSLLHLLGPAIPQETQDGHYSLTYLYTGLSRPGKGTHRLQGTVFLNGRAFFH YNSEDRKPEPLGPWRHVEGVEDWEKQSQVQKAREDIFMETLNNIMEYYNDSNASCVICDR STPRASSSVTSTRPLSSGNHQLMNKG >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_5|441_bp atggtaagaatggtgcctgtcctgctgtctctgctgcaccttctgggtcctgctatcccc caggagacccaagatggtcattactctctgacctatctctacactgggctgtccaggcct ggcaaaggcacccacaggctgcagggtactgtcttcctcaatggccgtgccttcttccac tacaacagtgaagacaggaagcctgagcccctgggaccatggagacacgtggaaggagta gaggactgggagaagcagagccaagttcagaaggccagggaggacatctttatggagacc ctgaacaacatcatggagtattacaatgacagtaacgcctcttgtgttatatgtgatcgg agcacccccagagcttcgtcatccgtgacgagcacccgccctctctccagtgggaatcat cagctgatgaataagggctga >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_6|282_aa MLKAKILFVGPCESGKTVLANFLTESSDITEYSPTQGVRILEFENPHVTSNNKGTGCEFE LWDCGGDAKFESCWPALMKDAHGVVIVFNADIPSHRKEMEMWYSCFVQQPSLQDTQCMLI AHHKPGSGDDKGSLSLSGGGGNRRQGRSCPRGWTLGVAGAAAAAAQGKEARGLHVGSTWA PATPLGNLSPAAPYSGAGRRRLQLQLLRAGKPVGSTWAPATPLGNLSPAAPYSGGPRGLV RKLRVHTGVFGEAVPMPETQGHVGNESCFCVIIHVLPRVPQP >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_6|849_bp atgctgaaagccaagatcctcttcgtggggccttgcgagagtggaaaaactgttttggcc aactttctgacagaatcttctgacatcactgaatacagcccaacccaaggagtgaggatc ctagaatttgagaacccgcatgttaccagcaacaacaaaggcacgggctgtgaattcgag ctatgggactgtggtggcgatgctaagtttgagtcctgctggccggccctgatgaaggat gctcatggagtggtgatcgtcttcaatgctgacatcccaagccaccggaaggaaatggag atgtggtattcctgctttgtccaacagccgtccttacaggacacacagtgtatgctaatt gcacaccacaaaccaggctctggagatgataaaggaagcctgtctttgtctggcggcggc ggtaaccgcaggcaaggcaggagctgccctcggggctggactctcggggtggcaggggct gcagctgcagctgctcagggcaaggaagcccgtgggctccacgtgggctccacgtgggct cctgcaacgccgctggggaatctctcgcccgctgccccctattctggagctgggcggcgg cggctgcagctgcagctgctcagggcagggaagcccgtgggctccacgtgggctcctgca acgccgctggggaatctctcgcccgctgccccctactctggaggtccacgaggactggtc cgcaagctccgtgtgcacacaggggtcttcggagaagcagtacccatgcctgaaactcaa ggacatgtaggaaatgagagttgcttttgtgtcatcatccatgtccttcctcgggtgccc cagccttga >gi568815591r:101215137_101421709|GENSCAN_predicted_peptide_7|285_aa MKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRMSAAVRP LEATRPAKHSLESHLGFPSPRSYEAFERRAFAQIVAPAFVRAALAADEMVPTQIEDIYLS PTQCQMLPQRSKLINTTDVGPALNPNEGWGDAETGVQGPVVSLQAAVTRVRWTAARLHSG KLQDLVVSKPRFLSACWGLRELPKIPTLKAAPSTWKTQSISGPASPSRQCAGKEPMLPVG LALRHWCHHTVTRTVSCQVQNGSETVVQRVYQSCRWPGPCANLVS >gi568815591r:101215137_101421709|GENSCAN_predicted_CDS_7|855_bp atgaagctggccctgctcctgccctgggcgtgttgctgcctctgcgggtcggcgctggcc accggcttcctctatcccttctcggccgcagctctgcagcagcacggctaccccgagccc ggcgccggctcccctggcagcggctacgcgagccgccggatgtctgcggctgtgcggccc ctggaggcgaccaggcctgccaaacactcactggagtctcacttgggcttccccagcccc cgtagttacgaagcctttgagagaagggcctttgcacagattgttgcccctgcctttgtc agagctgcactggcagctgatgagatggtgcccacccagattgaggatatttacttatca cccactcaatgccagatgctgccgcagagatccaagttgataaataccactgatgtgggc cctgcgctaaacccgaacgaggggtggggagatgcggagacgggagttcaagggcctgtg gtctctctgcaggctgcagttacacgggtcaggtggacagcagcacgtctgcacagcgga aagctgcaggatctggtggtgtcgaagccacgtttcctctctgcctgctgggggttacgt gaactgccaaaaattcctaccttgaaagctgccccctccacatggaaaactcaatccatc agtggccccgccagcccttcccgccagtgcgcaggcaaggagccgatgctgcctgtgggg ctggctctcaggcactggtgccatcacacagtgacacggacggtgtcctgccaggtgcag aatggctcggagacggtggtccagcgcgtgtaccagagctgccggtggccggggccctgc gccaacctcgtaagn