GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:50:51 Sequence gi568815596f:171590156_171847886 : 257731 bp : 39.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 652 741 90 0 0 36 56 112 0.231 3.34 1.02 Intr + 9716 9833 118 1 1 74 75 88 0.833 5.22 1.03 Intr + 17365 17471 107 2 2 93 82 70 0.274 5.91 1.04 Term + 37049 37216 168 1 0 40 50 155 0.306 3.80 1.05 PlyA + 37951 37956 6 1.05 2.02 PlyA - 38312 38307 6 1.05 2.01 Sngl - 52908 51943 966 0 0 60 47 306 0.994 20.14 2.00 Prom - 52968 52929 40 -10.15 3.04 PlyA - 53115 53110 6 1.05 3.03 Term - 54578 53559 1020 2 0 44 45 543 0.480 36.72 3.02 Intr - 56419 55893 527 2 2 68 31 424 0.282 26.43 3.01 Init - 62257 62101 157 1 1 47 87 82 0.641 4.12 3.00 Prom - 69808 69769 40 -5.65 4.00 Prom + 73965 74004 40 -0.95 4.01 Init + 79525 79551 27 0 0 65 97 13 0.189 -0.45 4.02 Intr + 94841 94932 92 1 2 114 72 61 0.362 4.97 4.03 Intr + 97312 97472 161 1 2 -9 111 154 0.491 6.71 4.04 Intr + 97566 97718 153 1 0 46 94 64 0.767 1.92 4.05 Term + 97911 98125 215 1 2 77 42 141 0.952 4.81 4.06 PlyA + 98325 98330 6 -3.64 5.00 Prom + 98475 98514 40 -4.95 5.01 Init + 100001 100108 108 1 0 68 47 126 0.711 6.77 5.02 Intr + 102622 102739 118 0 1 26 87 107 0.598 3.52 5.03 Intr + 135463 135558 96 2 0 124 65 177 0.989 18.06 5.04 Intr + 135764 135926 163 0 1 67 57 124 0.999 5.31 5.05 Intr + 136039 136138 100 1 1 48 98 49 0.687 1.09 5.06 Intr + 136636 136761 126 0 0 19 80 110 0.619 3.26 5.07 Intr + 137666 137812 147 1 0 38 98 75 0.870 3.01 5.08 Intr + 138562 138695 134 0 2 51 97 148 0.990 10.52 5.09 Intr + 139554 139698 145 0 1 25 65 105 0.956 1.26 5.10 Intr + 153894 154034 141 2 0 89 83 88 0.991 8.03 5.11 Intr + 155647 155772 126 0 0 56 64 138 0.979 8.16 5.12 Term + 157621 157734 114 0 0 72 32 243 0.919 14.69 5.13 PlyA + 158235 158240 6 1.05 6.03 PlyA - 158839 158834 6 1.05 6.02 Term - 167099 166969 131 0 2 56 38 118 0.693 0.96 6.01 Init - 167928 167661 268 0 1 64 72 220 0.893 15.19 6.00 Prom - 176625 176586 40 -5.25 7.15 PlyA - 179397 179392 6 1.05 7.14 Term - 195320 195119 202 1 1 101 45 239 0.670 16.78 7.13 Intr - 197506 197416 91 2 1 105 95 86 0.982 9.13 7.12 Intr - 197792 197634 159 0 0 59 96 162 0.991 13.14 7.11 Intr - 201434 201296 139 2 1 64 93 91 0.995 6.32 7.10 Intr - 203612 203472 141 2 0 109 98 175 0.997 20.23 7.09 Intr - 219531 219451 81 0 0 76 75 103 0.516 6.72 7.08 Intr - 220121 220069 53 0 2 99 93 24 0.959 1.71 7.07 Intr - 223342 223184 159 2 0 76 84 112 0.877 8.64 7.06 Intr - 225047 224966 82 2 1 92 105 9 0.796 1.29 7.05 Intr - 236727 236643 85 2 1 51 80 60 0.410 0.40 7.04 Intr - 243901 243808 94 2 1 100 110 69 0.953 8.40 7.03 Intr - 244710 244572 139 0 1 95 90 84 0.993 8.42 7.02 Intr - 247112 246966 147 2 0 95 89 102 0.996 10.51 7.01 Intr - 254353 254214 140 2 2 96 99 96 0.999 10.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_1|160_aa MIAAPAAHSSLVRAPEDPAKPCPDSSPTETRTWTNTSGNGNAYPKERLLQFKAKVLLYWI AAATQSKNKGREVKSFPVVEATPVFALDGGKLKLLPENDGESGPNLILLVRGTRTQEPPN GRTERAVTPTGLKHVPHLPVCHTVGNEKGKKAVALQGSQT >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_1|483_bp atgatagcagcacctgctgctcatagcagccttgtgagagccccagaagacccagccaag ccatgtccagattcttcacccacagaaactaggacctggactaacacatcaggcaatggc aatgcctaccctaaagaacgacttttacagttcaaggccaaagtccttctgtattggata gcagcagcaacccaaagcaagaacaaagggagggaggttaagagcttccctgttgtagag gctactccagtttttgcccttgatgggggtaagctaaagctgcttcctgagaacgatggt gagagtggcccaaatctcattcttcttgtacgtgggacaagaactcaggaaccgccgaat ggcaggactgaaagagctgtaacaccaacagggctgaaacatgtgccccacctccctgtt tgccacaccgtgggcaatgagaaggggaaaaaagctgtggcccttcagggatcccagacc tag >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_2|321_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKADTNKWKNIPCSWVGRINVV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFMWNQKRARIAKSILSKKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAIGRKLNLDPFLTPYIKINSRWIKDLHVRSKTIKTLEENLGNTIQDIGMGKD FMSKTQKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTELEKIFAIYSSDKGLISRI YNELKQIYKKKTTPSTSGRRI >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_2|966_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaactt acacgggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gcggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatgtcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatgtggaaccaaaaaaga gcccgcattgccaagtcaatcctaagcaaaaagaacaaagctggaggcatcacactacct gacttcaaactatactataaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataatgccacatatctacaac tatctgatctttgacaaacctgagaagaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccataggtagaaagctgaacctggatcccttcctt acaccttatataaaaattaattcaagatggattaaagacttacatgttagatctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacacaaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaattggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaccccatcaacaagtgggcgaagg atatga >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_3|567_aa MTTKVVPLWVCKSHNITRLGVLPNGDMATVTKELDHNTQDPLNTWKVFPRRTASAADTQT NRIWSGPPANSTDLQLRVLTVRRKTNRKDIHTKTPSVRHHHQGPKVDKTTKMGKKQSRKT ENSKKQSASLPPKECSSSPATEQSWTENDFDELREEGFRRSNYSKLKEEVRTHCKEVKNL EKRLDEWLTRITNAGKSLKDLMELKTMARELHDECTSLSSRFHQLEERHHTTPIHKIDHI VGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTAWKLNNLFLNDYWVRNEMK AEIKMLFETNENKNTTYQNLWDTFKAVCRGKFIALNAHKIKQERSKIDTLTSQLKELEKQ EQTHAKASRRQEITKIRAELKEIETQKTLQKINESKSWLFEKINKIDRLLARLIKKKREK NQTDAIKNGKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDAYTLPRLNQEEVE SLNRTITGSEIEAIINSLPTKKSPGPDGFTTEFYQRYKEELVPFLLKLSQSTEKEGILPD SFYEASIILITKPVRDTTKKENLDQYL >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_3|1704_bp atgaccaccaaggtagtacctctatgggtctgcaagagccacaatattactaggcttggg gtgctccctaatggagatatggctacagtgaccaaagagttagatcacaatacacaggac cctttgaatacttggaaagtcttcccaagaaggacagcctctgctgctgatacccagaca aacaggatctggagtggacctccagcaaactccacagacctgcaactgagggtcctgact gttagaaggaaaactaacagaaaggacatccacacaaaaaccccatctgtacgtcaccat catcaaggaccaaaggtagataaaaccacaaagatggggaaaaaacagagcagaaaaact gaaaattctaaaaagcagagcgcctctcttcctccaaaggaatgcagctcctcaccagca acagaacaaagctggacagagaatgactttgacgagttgagagaagaaggcttcagacga tcaaactactccaagctaaaggaggaagttcgaacccattgcaaagaagttaaaaacctt gaaaaaagattagacgaatggctaactagaataaccaatgcagggaagtccttaaaggac ctgatggagctgaaaaccatggcacgagaactacatgatgaatgcacaagcctcagtagc cgattccatcaactggaagaaaggcaccacactacacctattcacaaaattgaccacata gttggaagtaaagcactcctcagcaaatgcaaaagaacagaaattataacaaactgtctc tcagaccacagtgcaatcaaactagaactcaggattaagaaactcactcaaaaccgctca actgcatggaaactgaacaacctgttcctgaatgactactgggtacgtaatgaaatgaag gcagaaataaagatgctctttgaaaccaacgagaacaaaaacacaacataccagaatctc tgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagata aagcaggaaagatctaaaattgacactctaacatcacaattaaaagaactagagaagcaa gagcaaacacatgcaaaagctagcagaaggcaagaaataactaagatcagagcagaactg aaggagatagagacacaaaaaacccttcaaaaaatcaatgaatccaagagctggcttttt gaaaagatcaacaaaattgatagactgctagcaagactaataaagaagaaaagagagaag aatcaaacagatgcaataaaaaatggtaaaggggatatcaccaccaatcccacagaaata caaactaccatcagagaatactataaacacctctatgcaaataaactagaaaatctagaa gaaatggataaattccttgatgcatacaccctcccaagactaaaccaggaagaagttgaa tctctgaatagaacaataacaggctcggaaattgaggcaataattaatagcttaccaacc aaaaaaagtccaggaccagatggattcacaaccgaattctaccagaggtacaaggaggag ctggtaccattccttctgaaactatcccaatcaacagaaaaagaaggaatcctccctgat tcattttatgaggccagcatcatcctgataacaaagcctgtcagagacacaaccaaaaaa gagaatttagaccaatatctctga >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_4|215_aa MCSSQLDWKASCINKGLNKKQNSQDSSHENTLMKGQVTEGSSRSCQFVRRGRKLDSWPEN IRRSFPVFGKAEILKAPGLTGFFVLSTGFCFTTALAVAAAAHPSLCALESRGAGGGYSWC WGLRRGTLPGSCRREDSSPSSHLTEAAWGALRLVRRPLASSGRPAVDRALLAVTPGEKVI RFRLAPFYKLRPSLLALIVSPLTLCVMESPRTRRF >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_4|648_bp atgtgctcatcacagttagactggaaggcaagctgtatcaacaagggtctcaacaagaaa cagaattcacaagatagttcacatgaaaatactttaatgaaaggacaagttacagagggt tcttctcgatcgtgtcagtttgtaaggcgagggcggaagttggattcctggcctgagaat attaggcgtagttttccagtttttggcaaagcggaaatacttaaggcccctgggttgact gggttctttgttttatctaccggcttctgctttacgacagctctcgccgtagcagccgcc gcccatccctctttgtgtgctttggaaagccgcggagctggtggtggctacagttggtgt tgggggcttaggcgagggacgttaccgggaagttgcaggcgggaggactcttccccatcc agtcacctgacagaggcggcctggggagccttgaggcttgtccggcgcccactggcttct tccggacgccctgccgtggaccgcgccctcttggccgtgaccccaggggagaaagtgatt cgttttcgccttgccccattttacaaattaaggccatctctacttgccttgatagtgtct cccctcaccttatgtgtgatggaaagcccccgaacacggaggttctag >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_5|505_aa MSDKSELKAELERKKQRLAQIREEKKRKEEERKKKETDQKKEAVAPVQEESDLEKKRREA EALLQSMGLTPESPIDEEEDDDVVAPKPPIEPEEEKTLKKDEENDSKAPPHELTEEEKQQ ILHSEEFLSFFDHSTRIVERALSEQINIFFDYSGRDLEDKEGEIQAGAKLSLNRQFFDER WSKHRVVSCLDWSSQYPELLVASYNNNEDAPHEPDGVALVWNMKYKKTTPEYVFHCQSAV MSATFAKFHPNLVVGGTYSGQIVLWDNRSNKRTPVQRTPLSAAAHTDSMELVHKQSKAVA VTSMSFPVGDVNNFVVGSEEGSVYTACRHGSKAGISEMFEGHQGPITGIHCHAAVGAVDF SHLFVTSSFDWTVKLWTTKNNKPLYSFEDNADYVYDVMWSPTHPALFACVDGMGRLDLWN LNNDTEVPTASISVEGNPALNRVRWTHSGREIAVGDSEGQIVIYDVGEQIAVPRNDEWAR FGRTLAEINANRADAEEEAATRIPA >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_5|1518_bp atgtcagacaaaagtgaattaaaggctgagttggaacgtaagaagcagcgactggcccaa atcagagaggaaaagaagagaaaagaagaagaaaggaaaaaaaaagaaacagaccagaag aaggaagctgttgctcctgtgcaagaagaatcagatcttgaaaaaaaaaggagagaagct gaagcattgcttcaaagcatggggctaactccagaatcccccattgatgaagaggaagat gatgatgtagtggctcctaaaccacctattgaacctgaagaagagaaaactttaaagaaa gatgaggaaaatgatagtaaagctccccctcatgagctgactgaagaagaaaagcaacaa atcttgcactctgaggaatttttaagtttctttgaccattctacaagaattgtagaaaga gctctttctgagcagattaacatcttctttgactatagtgggagagatttggaagacaaa gaaggagagattcaagcaggtgctaaactgtcattaaatcgacaattttttgacgaacgt tggtcaaagcatcgggtggttagttgtttggattggtcatctcagtatccggagttactc gtggcttcctataacaacaatgaagatgcccctcatgagcctgatggtgtggcccttgta tggaatatgaaatacaaaaaaactaccccagagtatgtgtttcactgccagtcagctgtg atgtctgccacatttgcaaaatttcatccaaatcttgttgttggtggtacatattcaggc caaattgtgctttgggataaccgtagcaataaaagaactccagtgcaaagaactccactg tcagcagctgcacacacagatagcatggagttggttcataaacagtcaaaagcagtagct gtgacatctatgtccttccctgttggagatgtcaacaactttgttgttgggagtgaagaa ggttctgtgtacacagcatgccgccatggcagcaaagctggaatcagtgagatgtttgag gggcatcaaggaccaatcactggcatccattgtcatgcagctgttggagcagtagacttc tcacatctttttgtcacttcatcgtttgactggacagtaaagctttggacaactaagaat aacaagcctttgtattcatttgaagataatgcagactatgtttatgatgttatgtggtca cctacccacccagccctgtttgcctgtgtggatggcatggggagattggatttgtggaat ctcaataatgacacagaggtaccaactgccagcatttctgtggagggtaatcctgctctt aatcgtgtgagatggacccattctggcagagagattgctgtgggtgattctgaaggacag attgttatatacgatgtgggagagcagattgctgttccccgcaatgatgaatgggcacgg tttggccgaacacttgcagaaattaatgcaaaccgagctgatgcagaggaggaagcagct acccgaatacctgcttag >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_6|132_aa MRNDKGDTATDPTEIQTTIREYYKYLYANKLENLEEMAEFLDTYTLPSLNQEEGESLNRP ITSSEMKAVIVYRPKNVQDQTDSQPNSTRVLEVVARAIRQEKEIKGIQTGKEEELSLFAD DMIVYLENPISA >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_6|399_bp atgagaaatgacaaaggggatactgccactgatcccacagaaatacaaactaccatcaga gaatactataaatacctctatgcaaataaactagaaaatctagaagaaatggctgaattc ctggacacatacaccctcccaagtctaaaccaggaagaaggtgaatccctgaatagacca ataacaagttctgaaatgaaggcagtaatagtctaccgaccaaaaaacgtccaggatcag acagattcacagccgaattctaccagagtgttggaagttgtggccagggcaatcaggcaa gagaaagaaataaagggtattcagacaggaaaagaggaagaactgtctctgtttgcagat gacatgattgtatatttagaaaaccccatctcagcctaa >gi568815596f:171590156_171847886|GENSCAN_predicted_peptide_7|570_aa XNVKEIFGQTIIHHHIPFNWDCEFIRLHFGHNRKKHLNYTEFTQFLQELQLEHARQAFAL KDKSKSGMISGLDFSDIMVTIRSHMLTPFVEENLVSAAGGSISHQVSFSYFNAFNSLLNN MELVRKIYSTLAGTRKDVEVTKEEFAQSAIRYGQVTPLEIDILYQLADLYNASGRLTLAD IERIAPLAEGALPYNLAELQRQQSPGLGRPIWLQIAESAYRFTLGSVAGAVGATAVYPID LVKTRMQNQRGSGSVVGELMYKNSFDCFKKVLRYEGFFGLYRGLIPQLIGVAPEKAIKLT VNDFVRDKFTRRDGSVPLPAEVLAGGCAGGSQVIFTNPLEIVKIRLQVAGEITTGPRVSA LNVLRDLGIFGLYKGAKACFLRDIPFSAIYFPVYAHCKLLLADENGHVGGLNLLAAGAMA GVPAASLVTPADVIKTRLQVAARAGQTTYSGVIDCFRKILREEGPSAFWKGTAARVFRSS PQFGVTLVTYELLQRWFYIDFGGLKPAGSEPTPKSRIADLPPANPDHIGGYRLATATFAG IENKFGLYLPKFKSPSVAVVQPKAAVAATQ >gi568815596f:171590156_171847886|GENSCAN_predicted_CDS_7|1713_bp naaaatgtcaaagaaatttttggacagactattattcatcatcatatcccttttaactgg gattgtgaatttatccgactgcattttgggcataaccggaagaagcatcttaactacaca gaattcacgcagtttctccaggagctgcaattggaacatgcaagacaagcctttgcactc aaagacaaaagcaaaagtggcatgatttctggtctggatttcagtgacatcatggttacc attagatctcacatgcttactccttttgtggaggagaacttagtttcagcagctggagga agtatctcacaccaggttagcttctcctacttcaatgcatttaactcgttactgaataac atggagcttgttcgtaagatatatagcactctagctggcacaaggaaagatgttgaagtc acaaaggaggaatttgcccagagtgccatacgctatggacaagtcacaccactagaaatt gatattctatatcagcttgcagacttatataatgcttcagggcgcttgactttggcagat attgagagaatagccccattggctgagggggccttaccttacaacctggcagaacttcag agacagcagtctcctgggttaggcaggcctatctggctccagattgccgagtctgcttac agattcactctgggctcagttgctggagctgtgggagccactgcagtgtatcctatagat ctggtgaagacccgaatgcaaaaccagcgtggctctggctctgttgttggggagctaatg tacaaaaacagctttgactgttttaagaaagtcttgcgttatgagggcttctttggactc tacaggggtctgataccacaacttataggggttgctccagaaaaggccattaaactgact gttaatgattttgttcgggacaaatttaccagaagagatggctctgttccacttccagca gaagttcttgctggaggctgtgctggaggctctcaggtcatttttaccaacccattggag atagtgaagattcgtctgcaagtagctggagagatcaccacgggacccagagtcagcgcc ctgaatgtgctccgggacttgggaatttttggtctgtataagggtgccaaagcgtgtttc ctccgagacattcccttctctgcaatctattttcctgtttatgctcattgcaaactactt ctggctgatgaaaatggacacgtgggaggtttaaatcttcttgcagctggagccatggca ggtgtcccagctgcatctctggtgacccctgctgatgtcatcaagacaagactgcaggtg gctgcccgcgctggccagacgacatacagtggtgtcatcgactgtttcaggaagattctc cgggaagaagggccctcagcattttggaaagggactgcagctcgagtgtttcgatcctct ccccagtttggtgttaccttggtcacttatgaacttctccagcggtggttttacattgat tttggaggcctcaaacccgctggttcagaaccaacacctaagtcacgcattgcagacctt cctcctgccaaccctgatcacatcggtggatacagactcgccacagccacgtttgcaggc atcgaaaacaaatttggcctttatctcccgaaatttaagtctcctagtgttgctgtggtt cagccaaaggcagcagtggcagccactcagtga