GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:03:38 Sequence gi568815594r:174391959_174622451 : 230493 bp : 37.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1349 1344 6 1.05 1.02 Term - 2009 1821 189 2 0 58 42 93 0.116 -1.83 1.01 Init - 9670 9482 189 1 0 73 98 171 0.657 15.66 1.00 Prom - 20423 20384 40 -3.65 2.05 PlyA - 21187 21182 6 1.05 2.04 Term - 21973 21363 611 2 2 48 38 219 0.803 6.57 2.03 Intr - 22766 22122 645 0 0 44 40 228 0.157 4.62 2.02 Intr - 24594 24347 248 2 2 -41 58 178 0.086 -1.92 2.01 Init - 25058 24673 386 2 2 88 44 336 0.459 25.56 2.00 Prom - 36786 36747 40 -5.55 3.03 PlyA - 37084 37079 6 1.05 3.02 Term - 41049 40917 133 2 1 92 52 94 0.932 2.78 3.01 Init - 42777 42614 164 0 2 46 91 78 0.453 3.15 3.00 Prom - 45918 45879 40 -5.35 4.03 PlyA - 46724 46719 6 1.05 4.02 Term - 50235 49634 602 1 2 25 47 256 0.848 8.90 4.01 Init - 50600 50390 211 2 1 49 57 175 0.896 9.49 4.00 Prom - 51135 51096 40 -7.05 5.05 PlyA - 51258 51253 6 1.05 5.04 Term - 53146 53048 99 1 0 94 49 89 0.239 2.75 5.03 Intr - 82819 82684 136 0 1 92 68 136 0.583 11.65 5.02 Intr - 91287 91181 107 0 2 47 115 46 0.515 1.29 5.01 Init - 95517 95380 138 0 0 91 75 109 0.659 10.09 5.00 Prom - 98649 98610 40 -7.95 6.08 PlyA - 99079 99074 6 1.05 6.07 Term - 100136 99998 139 1 1 86 47 102 0.932 2.35 6.06 Intr - 101356 101193 164 1 2 52 83 96 0.925 3.35 6.05 Intr - 103666 103590 77 2 2 93 77 46 0.046 2.32 6.04 Intr - 116834 116738 97 2 1 43 110 92 0.399 5.56 6.03 Intr - 126119 126013 107 0 2 30 97 55 0.484 -0.39 6.02 Intr - 130109 129986 124 2 1 100 94 174 0.984 18.44 6.01 Init - 130493 130401 93 2 0 87 121 162 0.994 19.86 6.00 Prom - 138548 138509 40 -5.55 7.00 Prom + 138701 138740 40 -6.15 7.01 Init + 147111 147201 91 2 1 70 66 87 0.507 5.40 7.02 Term + 148358 148584 227 0 2 90 39 98 0.831 1.06 7.03 PlyA + 149073 149078 6 1.05 8.05 PlyA - 149695 149690 6 1.05 8.04 Term - 155908 155688 221 2 2 74 47 117 0.090 2.42 8.03 Intr - 163220 163158 63 0 0 60 101 80 0.038 4.27 8.02 Intr - 171855 171696 160 0 1 31 83 97 0.006 2.14 8.01 Init - 179008 178976 33 1 0 77 82 49 0.006 1.24 8.00 Prom - 185279 185240 40 -3.35 9.00 Prom + 212074 212113 40 -3.35 9.01 Sngl + 227214 227519 306 2 0 88 48 201 0.975 11.72 9.02 PlyA + 227595 227600 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 174991 175066 76 0 1 95 89 51 0.944 7.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_1|125_aa MGPGDQRSAYGGPTLTPASEFPYYLLIIIGHFPERGMWQDNRIIVERRSAGKQVNECLCI INKKQKNNCKTHMGPQETPIAKAFITKNNKAKGIIQLDFKIYCKAVVIKTECTGIKTDTL TNRMV >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_1|378_bp atgggcccaggggaccagcgttcagcatacggaggacccacgctgacaccggcctctgaa ttcccttactatttattgatcattatcgggcatttcccggagagggggatgtggcaggac aataggataatagtggagagaagatcagcaggtaaacaggtgaacgaatgtctctgcatc ataaacaagaaacagaaaaacaattgtaaaactcatatgggtccacaagagaccccaata gccaaagcattcataaccaaaaataacaaagctaaaggcatcatacaactggatttcaaa atatattgcaaagctgtagtaatcaaaacagaatgtactggcataaaaacagacacactg accaaccgaatggtatag >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_2|629_aa MGKKQSRKAENSKNQSTSPTPKERSSSPAMEQSWTENDFDELREEGFRRSNFSQLKEEVQ THHKEAKNLEKRLDKWLTRITSVEKSLNDLMELKTTARELRDEYTSVSSQFDQLEERVSV IEDQMNAVKPNLHPIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQS YSSRRATPRHIIVRFTKFEMKEKMLRAARKKEIQTTIREYYKHLYANKLENLEEMDKFLD TYILPRLNQEEVESLNRRITGSEVKAIINSLPTKKSPGPEVFTAKFYQRYKEELVPFLLK LFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRTISLMNNDAKILNKILANQIQQH IKKLIHHDQVGFIPGMQGWFNISKSINIIQHINRTKDKNHMIISIDAKKAFDKIQQPFML KTLNKLVLEVLARAIRQDKEIKGIQLGKEEVKLSLFADDMTVYLENPIVSAQNLLKLISN FSKVSGYKINVQISQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDMKDLFKEN YKPLLNEIKEDTNKWKNIPFSWIGRISIMKMTILPKVIYRFNAILHQATNDFLHRIGKNY FKVHMEPKKSPHCQNNPKPKEQSWRHHAT >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_2|1890_bp atggggaaaaaacagagcagaaaagctgaaaattctaaaaatcagagcacctctcccact ccaaaggaacgcagctcctcgccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaacttctcccagctaaaggaggaagttcaa acccatcacaaagaagctaaaaaccttgaaaaaagattagacaaatggctaactagaata accagtgtagagaagtccttaaatgacctgatggagctgaaaaccacggcacgagaacta cgtgatgaatacacaagcgtcagtagccaatttgatcaactggaagaaagggtatcagtg attgaagatcaaatgaatgcagtgaaaccaaatctacatccgattggtgtacctgaaagt gatggagagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttc cccaacctagcaaggcaggcaaacattcagattcaggaaatacagagaacgccacaaagt tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaatttgaaatg aaggaaaaaatgttaagggcagccagaaagaaagaaatacaaactaccatcagagaatac tacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctggac acatacatcctcccaagattaaaccaggaagaagttgaatccctgaatagacgaataaca ggctctgaagttaaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagaa gtattcacagccaaattctaccagaggtacaaggaggagctggtaccattccttctgaaa ctattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatc attctgataccaaagccgggcagagacacaacaaaaaaagagaattttagaacaatatcc ctgatgaacaacgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcac atcaaaaagcttatccaccatgaccaagtgggcttcatccctgggatgcaaggctggttc aacataagcaaatcaataaacataatccagcatataaacagaaccaaagacaaaaaccac atgattatctcaatagatgcaaaaaaggcgtttgacaaaattcaacaacccttcatgcta aaaactctcaataaactagtgttggaagttctggccagggcaatcaggcaggacaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg actgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaatatcacaagcattcttatacacc aataacagacaaacagagagtcaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatatgaaggacctcttcaaggagaac tacaaaccactgctcaacgaaataaaagaggatacaaacaaatggaagaacattccattc tcatggataggaagaatcagtatcatgaaaatgaccatactgcccaaggtaatttataga ttcaatgccatcctccatcaagctaccaatgactttcttcacagaattggaaaaaactac tttaaagttcatatggaaccaaaaaagagcccgcattgccaaaacaatcctaagccaaaa gaacaaagctggaggcatcatgctacctga >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_3|98_aa MNWMLMADPVCSHYGVQAMWILTTFPSEGLSSEFMCLKALESLFAHPAATGNHFRRISHG GYAEGMQSPTFHYRIQRQKTHDPSSVNETVPLLLMWTK >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_3|297_bp atgaattggatgttaatggctgacccagtgtgctcccattatggagttcaagccatgtgg attctaactacattccccagcgaaggattgagctctgaattcatgtgcctaaaggcctta gagtctctttttgcacacccagcagccacaggaaaccatttcaggagaatctcccatggg ggctatgcagaagggatgcagagccccaccttccactacagaatccaaaggcagaaaaca catgacccaagttcagtcaatgagactgtcccactacttttgatgtggaccaaatga >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_4|270_aa MWKQHWNWVTGRDWNSLKGLEEDEKIWESFDLPRDLLNGFAQNADTHMDNKIQAEVFSDG HEELVGNWSKGKQIIKVWKICTLTVGLKKKKFSGEKFKPAVEICISNKEPNVNFQDHGEN VSRPCQRTSWQALPSQAQRPRRKKWFRGLGPGSSCCVQSRDLVPCVQAAPAVAERSQRTA QAVASEGRSSKPWQLPCGVESVDAQKSRIEVWEPPPRFQKMYGNGCMPRQKFAEGVGFSW RTSARTVQKENVGSEPPHRIPAGALPSKKI >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_4|813_bp atgtggaagcaacattggaactgggtaacaggcagagattggaacagtttgaagggcttg gaagaagacgagaaaatatgggaaagttttgatcttcctagagacttgttgaatggcttt gcccaaaatgctgatacacatatggacaataaaatccaggctgaggtcttctcagatgga catgaggaacttgttgggaactggagcaaagggaagcagatcataaaagtttggaaaatt tgcaccctgactgtgggattaaaaaaaaaaaaattttctggggagaaattcaagccagct gtagaaatttgcataagtaacaaggagcctaatgttaatttccaagaccatggggaaaat gtctccaggccatgtcagagaacttcatggcaggccctcccatctcaggcccagaggccc agaaggaaaaagtggtttcgaggactgggcccagggtcctcatgctgtgtgcagtctagg gacttagtgccctgtgtccaagccgctccagccgtggctgaaaggagccaacgtacagct caggctgtggcttcagagggtagaagctccaagccttggcagcttccatgtggtgttgag tctgtggatgcacagaagtcaagaattgaggtttgggaacctccacctagatttcagaag atgtatggaaatggctgcatgcctaggcaaaagtttgctgaaggggtgggattctcatgg agaacctctgctaggacagtgcaaaaggaaaacgtggggtcagagcctccacacagaatc cctgctggtgcactgcctagtaaaaaaatttga >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_5|159_aa MGNEATQQQQEGSITSRTGAIQGTTSAKEPMGGCQSKLGLQSRGCQSCLNGPTWVQGWAP VLGNVNSAWKPTFQQQRYPLERCCKRLEAVGFQPRRGTCKTAPVAVAVKFVLALCYRKEV FQCLGQWNHCPVLPVNQCVKKGVYDILPAGGHDQDRFIN >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_5|480_bp atgggtaatgaggcaacccagcaacagcaggaaggctctatcacttcaagaactggtgca attcagggtactaccagtgccaaggaaccaatgggaggctgccaatcaaagctggggcta cagagcaggggctgtcagagctgcttaaacggccccacctgggtgcaaggctgggctcca gtgctgggaaatgtaaattctgcctggaaacccactttccagcaacagcgctaccctttg gaaagatgttgtaaaagactggaggcagttggcttccagccacgacgtggcacatgcaaa acagcaccagtggcggtagctgtgaaatttgtgcttgccttatgttaccgaaaggaggta ttccagtgccttgggcaatggaatcactgtcctgtgctgcctgttaaccaatgtgtgaaa aaaggcgtttacgatattttgccagcagggggtcatgatcaggaccgatttatcaattag >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_6|266_aa MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG AIMKITTSKGIHFQDYDTTPFQAKTQ >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_6|801_bp atgcacgtgaacggcaaagtggcgctggtgaccggcgcggctcagggcataggcagagcc tttgcagaggcgctgctgcttaagggcgccaaggtagcgctggtggattggaatcttgaa gcaggtgtacagtgtaaagctgccctggatgagcagtttgaacctcagaagactctgttc atccagtgcgatgtggctgaccagcaacaactgagagacacttttagaaaagttgtagac cactttggaagactggacattttggtcaataatgctggagtgaataatgagaaaaactgg gaaaaaactctgcaaattaatttggtttctgttatcagtggaacctatcttggtttggat tacatgagtaagcaaaatggaggtgaaggcggcatcattatcaatatgtcatctttagca ggactcatgcccgttgcacagcagccggtttattgtgcttcaaagcatggcatagttgga ttcacacgctcagcagcgttggctgctaatcttatgaacagtggtgtgagactgaatgcc atttgtccaggctttgttaacacagccatccttgaatcaattgaaaaagaagaaaacatg ggacaatatatagaatataaggatcatatcaaggatatgattaaatactatggaattttg gacccaccattgattgccaatggattgataacactcattgaagatgatgctttaaatggt gctattatgaagatcacaacttctaagggaattcattttcaagactatgatacaactcca tttcaagcaaaaacccaatga >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_7|105_aa MGNLSGGRGEMAYATETGSDLNLARVLACAGGASWDLQVTKKGDPGTGIVSLGVPEVEAE SLLGGASPQTCLFAVGRRPPHIPSHTHHLKPDSVSSEYDHLLPIF >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_7|318_bp atggggaacctgtcaggtggtagaggtgaaatggcttatgccacagaaactggcagtgac ctaaatctggcaagggtcctggcctgcgcaggtggagcgagctgggacttgcaggtgacc aagaagggggatcccggaaccggaattgtctcccttggcgttccggaagtagaagcagag tcattgctgggtggcgccagccctcagacttgcctctttgcagtaggaagaaggcctccc cacataccttcccacactcatcaccttaagccagactcggtgtccagtgaatatgaccat ctcttgcccattttctaa >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_8|158_aa MAPVSMVLQGLLWMTAKGTEGNLHGDERFDKKEQWLEAQAITILYLQKQREPATLTKGTS GTHRAGSFWWVRGLADFKNGAADLRGTRCKLSVDLPFQGLQDGGPLLTAPLGSAPVGTLC GCSDPIFPFHTLLAEVLHQGFTPAADFYLDSQAFPYIL >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_8|477_bp atggcgcccgtgagcatggtactgcaaggtctgctgtggatgactgcaaaaggcacagaa ggaaatcttcatggtgacgaaaggtttgataaaaaagaacaatggctagaagctcaagca attacaattctttacctacagaaacagagagaaccagccaccttgaccaaaggaacatct ggcacacatagagctggttccttctggtgggttcgtggtcttgctgacttcaagaatgga gccgcggaccttcgcggcacacggtgcaagctgtcggtggatctaccattccagggtctg caggatggtggccctcttctcacagctccactaggcagtgccccagtggggacactgtgt gggtgctctgaccccatatttcccttccacactctcctagcagaggttctccatcagggc ttcacccctgcagcagatttctacctggacagccaggcctttccatacatcctttga >gi568815594r:174391959_174622451|GENSCAN_predicted_peptide_9|101_aa MGRSQHKKAENSKRQNAFSLPQDNNSLPAREQSWKKNEFDELTELGFRRWVITNSSELKE QVLTQCKEAKNLEKTLDELLARITSVEKNINDLTELKNFTS >gi568815594r:174391959_174622451|GENSCAN_predicted_CDS_9|306_bp atggggagaagccagcacaaaaaggctgaaaattccaaaagacagaatgccttttctctt ccacaggataacaactccttgccagcaagggaacaaagctggaagaaaaatgagtttgat gaattgacagaactcggcttcagaaggtgggtaataacaaactcctccgagctaaaggaa caggttctaacccaatgcaaggaagctaagaaccttgaaaaaacgttagatgaactgcta gctagaataaccagtgtagagaagaacataaatgacctgacggagctgaaaaacttcact tcatga