GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:40:29 Sequence gi568815592f:25626909_25827289 : 200381 bp : 38.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2490 2556 67 2 1 71 96 35 0.371 3.89 1.02 Term + 4631 4857 227 0 2 69 54 122 0.867 2.86 1.03 PlyA + 6486 6491 6 1.05 2.04 PlyA - 7079 7074 6 1.05 2.03 Term - 8107 7997 111 1 0 90 40 107 0.652 3.68 2.02 Intr - 14191 14063 129 1 0 74 85 70 0.646 5.27 2.01 Init - 14755 14723 33 1 0 80 86 17 0.579 0.63 2.00 Prom - 15316 15277 40 -6.25 3.00 Prom + 21221 21260 40 -8.05 3.01 Init + 25496 25577 82 1 1 94 101 136 0.999 14.90 3.02 Intr + 27922 28043 122 2 2 76 80 80 0.972 5.29 3.03 Intr + 34644 34736 93 2 0 87 78 143 0.992 12.44 3.04 Intr + 38035 38124 90 0 0 117 109 53 0.929 9.57 3.05 Intr + 42603 42659 57 2 0 92 77 78 0.901 5.26 3.06 Intr + 43091 43168 78 1 0 83 115 46 0.894 5.63 3.07 Term + 45907 46023 117 0 0 47 53 77 0.182 -2.34 3.08 PlyA + 48127 48132 6 1.05 4.03 PlyA - 48297 48292 6 1.05 4.02 Term - 48711 48691 21 0 0 91 43 53 0.517 -1.57 4.01 Init - 52056 51910 147 0 0 71 49 176 0.807 12.14 4.00 Prom - 52717 52678 40 -8.65 5.00 Prom + 52752 52791 40 -2.95 5.01 Init + 58133 58273 141 1 0 36 44 175 0.777 8.08 5.02 Intr + 62565 62624 60 2 0 111 81 60 0.815 5.61 5.03 Intr + 64148 64216 69 1 0 82 109 100 0.821 9.96 5.04 Term + 74299 74427 129 0 0 114 36 148 0.993 9.40 5.05 PlyA + 74850 74855 6 1.05 6.00 Prom + 96553 96592 40 -3.65 6.01 Init + 100001 100380 380 1 2 70 82 442 0.851 38.22 6.02 Intr + 104946 105225 280 0 1 17 27 351 0.886 18.46 6.03 Term + 105550 105837 288 0 0 15 46 442 0.885 27.09 6.04 PlyA + 106612 106617 6 1.05 7.02 PlyA - 108705 108700 6 1.05 7.01 Sngl - 132434 131877 558 2 0 81 47 211 0.965 12.18 7.00 Prom - 132938 132899 40 -8.25 8.00 Prom + 133476 133515 40 -6.95 8.01 Init + 135055 135145 91 0 1 67 93 104 0.958 9.50 8.02 Intr + 142077 142282 206 1 2 123 115 155 0.896 19.70 8.03 Intr + 143159 143392 234 1 0 60 81 193 0.965 12.76 8.04 Intr + 143476 143563 88 0 1 79 93 34 0.800 1.62 8.05 Intr + 144018 144104 87 1 0 84 97 30 0.621 2.52 8.06 Intr + 146675 146766 92 0 2 45 95 53 0.392 0.49 8.07 Intr + 149687 149819 133 1 1 57 87 52 0.726 1.30 8.08 Intr + 149904 150051 148 1 1 109 62 106 0.836 8.47 8.09 Intr + 151018 151108 91 1 1 80 94 65 0.989 5.38 8.10 Term + 152146 152280 135 0 0 122 53 100 0.999 6.94 8.11 PlyA + 153916 153921 6 1.05 9.00 Prom + 158634 158673 40 -5.85 9.01 Init + 159421 159751 331 0 1 70 9 213 0.568 9.12 9.02 Intr + 159867 159956 90 1 0 69 68 87 0.535 3.85 9.03 Term + 165610 165824 215 2 2 8 42 129 0.135 -3.29 9.04 PlyA + 166934 166939 6 1.05 10.10 PlyA - 167409 167404 6 1.05 10.09 Term - 172011 171877 135 0 0 71 49 109 0.790 2.34 10.08 Intr - 174072 173982 91 2 1 50 110 28 0.372 0.28 10.07 Intr - 184637 184490 148 0 1 75 91 25 0.470 -0.13 10.06 Intr - 184862 184730 133 2 1 73 115 92 0.926 9.70 10.05 Intr - 186084 185923 162 0 0 101 107 31 0.913 5.55 10.04 Intr - 186305 186187 119 0 2 64 75 75 0.925 3.06 10.03 Intr - 192690 192603 88 0 1 72 103 66 0.950 5.12 10.02 Intr - 193007 192774 234 2 0 99 67 168 0.992 12.76 10.01 Init - 199621 199553 69 1 0 84 83 32 0.449 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_1|97_aa MKVRPLTSTQDGCERSTHLQNSGTVQASFWELEEVTWLFFHNMEGVQSKAGHQDRCMCPV ALPPALSMGCVLCSGSRWCRVADLCLVEPLNVGFSPC >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_1|294_bp atgaaagtccggcccctcacctcaactcaggatggctgtgaaaggtcaacccatcttcag aactcagggactgtgcaggctagcttttgggaactggaagaagtaacttggcttttcttc cacaacatggaaggtgtccagagcaaagctgggcatcaagacaggtgtatgtgtccagtg gctctgccacctgccctcagcatgggctgtgttctgtgctccggcagccggtggtgcaga gtggctgatctttgcctggtggagccattgaatgttggcttttctccttgctga >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_2|90_aa MAVVGDRALRKVACPASDPNTKEQSLLLLRHPTPPLESSGKIAFNKIYALQCSNMVNRTT GTFSWLHRSKNGVSGARMETLVREWSLVQE >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_2|273_bp atggctgtagttggagatagggccttgaggaaggtggcgtgtccagcctctgacccaaat accaaggaacaaagtctcctcctccttaggcatcccactccacctttggagagctcaggg aaaattgccttcaacaaaatctacgccctgcagtgctccaacatggtcaacagaactaca gggaccttcagctggctgcataggtcaaagaatggagtctctggtgcaagaatggagact ctggtgcgagaatggagtctggtgcaagaatag >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_3|212_aa MDSSREPTLGRLDAAGFWQVWQRFDADACEVKGSADEKDRDTSPFFPLVRPLAHTPLYPS FLYLHKLQDTVMKANLHKVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENP LDSSVEFMQIWRKYDADSSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMVMEDGN LDSGEVAEEVRTKYILKIDLRRFPDESGGSQG >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_3|639_bp atggacagctcccgggaaccgactctggggcgcttggacgccgctggcttctggcaggtc tggcagcgctttgatgcggatgcctgtgaagtgaaaggatcagcagacgagaaggataga gacacaagtcccttctttcctctggtcagaccacttgcgcacactcctctttatccttca ttcttgtacctacacaagctacaggacacggtcatgaaagcaaatttgcacaaggtgaaa cagcagtttatgactacccaagatgcctctaaagatggtcgcattcggatgaaagagctt gctggtatgttcttatctgaggatgaaaactttcttctgctctttcgccgggaaaaccca ctggacagcagcgtggagtttatgcagatttggcgcaaatatgacgctgacagcagtggc tttatatcagctgctgagctccgcaacttcctccgagacctctttcttcaccacaaaaag gccatttctgaggctaaactggaagaatacactggcaccatggtgatggaagatggcaac ttggactcaggtgaggtggcagaagaggtgagaactaaatatattttgaagatagacttg agaagatttcctgatgaatcaggtgggagtcaaggatga >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_4|55_aa MIVTVKSFQYILAKLPDDAPSKTPAGTAKEDKGKAKKTALTAAEKAKDLDLGGRE >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_4|168_bp atgattgtaactgtgaagagttttcaatatatcttggccaagctgccagatgatgcacct tccaaaacacctgctgggacagccaaggaagacaaggggaaggcaaagaagacagcactg acagctgcggagaaggccaaggacctagatttaggaggacgagagtga >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_5|132_aa MLVAGFKVEEEAKECRQPLEPEKGKGKDSPREPPEGMRPCRQLEFCPACSTEERKRDFEK IFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISGVDLDKFREILLRHCDVNKDGKIQKS ELALCLGLKINP >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_5|399_bp atgctagttgctggctttaaagtggaagaagaggctaaagaatgtaggcagcctctagaa cctgaaaaaggcaagggaaaggattctccccgagaacctccagaaggaatgaggccgtgt cgacagcttgaattttgcccagcttgttctactgaagaaaggaaaagggactttgagaaa atctttgcctactatgatgttagtaaaacaggagccctggaaggcccagaagtggatggg tttgtcaaagacatgatggagcttgtccagcccagcatcagcggggtggaccttgataag ttccgcgagattctcctgcgtcactgcgacgtgaacaaggatggaaaaattcagaagtct gagctggctttgtgtcttgggctgaaaatcaacccataa >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_6|315_aa MPEVSSKGATISKKGFKKAVVKTQKKEGKKRKRTRKESYSIYIYKVLKQVHPDTGISSKA MSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAV TKYTSSKQQQAHGRLDLPGGDGRALVIMRQAGSLTGNALKNVTNKRIHDTHGLGRDSSVR VDLLQQLVDVNRIALFAASLALFVLLRVLATAFLKPFSEMGNSGEGYALFIQRGSAVPRG RCQRAAQQGQRCREGRGLHASVPAGGSGVPGLRHPGVGGQRRRNKKKTRIIPRHLQLAIR NDKELDKLLARVTMA >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_6|948_bp atgccggaggtgtcatctaaaggtgctaccatttccaagaagggctttaagaaagctgtc gttaagacccagaaaaaggaaggcaaaaagcgcaagaggacccgtaaggagagttattct atttacatctacaaagtgctaaagcaggtccatccggacactggcatctcttcgaaagct atgagcattatgaattccttcgtcactgatatctttgagcgtatagcgagcgaggcatca cgtttggctcactacagcaagcgctccaccatttcttccagagagattcagacagcagtg cgcttgctactgccgggagagctggctaaacatgctgtgtctgagggcaccaaggctgtc actaagtacaccagctccaagcagcagcaggcgcacggccgtctggatctccctggaggt gatggtcgagcgcttgtcataatgcgccaggcgggaagcctcacaggcaatgcgctcaaa aatgtcactaacaaaagaattcatgatactcatggccttggaagagattccagtgtccgc gtggacctgcttcagcagcttgtagatgtaaatagaatagctctctttgcggcatctctt gcgctttttgtcctccttcgagttttagcaactgccttcttaaagcccttttcggaaatg ggaaactcgggggaagggtacgcgctattcattcagagaggttctgcagttccccgtggg cggtgtcaacgggctgctcagcaagggcaacgatgccgagagggtcgggggctgcacgct agcgtacctgccggcggttctggagtacctggcctccgacaccctggagttggcgggcaa cgccgtcggaacaagaagaagacccgcatcatcccgcgccacctgcagctggccatccgc aacgacaaggagctcgacaagctgctggcccgagtgacaatggcttag >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_7|185_aa MDLNYTLEQIDLTDIYRTLYPTTAEYTFYSSAHGTFSKTDHMIGHKTSLNKFKEIEIISS TPSDHSRIKLEINSKRNSQSCANTWKLNKLLLNDHWLNNEIKMKIKKFFELNDNRDITYQ NLWDTAKAVLRGKFIPLNAYIKKSERAQIDNLSLHLKELEKQEQTKPKPSRRKEITKIRA KLSGN >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_7|558_bp atggacttaaactataccctagaacaaatagacttaacagatatttacagaacactctac ccaacaactgcagaatatacattctattcatcagcacatgggacattctccaagacagac catatgataggccacaaaacaagtctcaacaaatttaaggaaatcgaaattatatcaagt actccctcagaccacagcagaataaaattggaaatcaactccaaaaggaactctcaaagc tgtgcaaatacatggaaattaaataagctgctcctaaatgatcattggctcaacaatgaa atcaagatgaaaattaaaaaattctttgaactgaacgacaaccgtgacataacctatcaa aacctctgggatacagcaaaagcggtgctaagaggaaagttcataccattaaatgcctac atcaaaaagtctgaaagagcacaaatagacaatctaagtttacacctcaaggaactagag aaacaagaacaaaccaaacccaaacccagcagaagaaaagaaataaccaagatcagagca aaactaagtggaaattga >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_8|434_aa MSTGPDVKATVGDISSDGNLNVAQEECSRKGFCSVRHGLALILQLCNFSIYTQQMNLSIA IPAMVNNTAPPSQPNASTERPSTDSQGYWNETLKEFKAMAPAYDWSPEIQGIILSSLNYG SFLAPIPSGYVAGIFGAKYVVGAGLFISSFLTLFIPLAANAGVALLIVLRIVQGIAQVMV LTGQYSIWVKWAPPLERSQLTTIAGSGSMLGSFIVLLAGGLLCQTIGWPYVFYIFVSYFC EYWLFYTIMAYTPTYISSVLQANLRDSGILSALPFVVGCICIILGGLLADFLLSRKILRL ITIRKLFTAIGVLFPSVILVSLPWVRSSHSMTMTFLVLSSAISSFCESGALVNFLDIAPR YTGFLKGLLQVFAHIAGAISPTAAGFFISQDSEFGWRNVFLLSAAVNISGLVFYLIFGRA DVQDWAKEQTFTHL >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_8|1305_bp atgtctaccggaccagatgtcaaggctacagtgggggacatttccagtgatggcaattta aacgtggctcaagaggaatgctccaggaaaggtttttgttcagtccgacatgggctggcc ctcatcttgcagctctgtaatttttcaatttacacccaacaaatgaacttgagcattgcc atcccagctatggtgaacaacacagccccacctagccagcccaatgcttccacagaacgg ccctccactgactcccagggctactggaatgaaactctaaaagaatttaaagcaatggcc cctgcatatgactggagtcctgaaatccagggaatcatcctcagctccctcaactatggc tcattcttggctccaatccccagtggctatgtggctggaatatttggagccaagtatgtg gttggtgctggcttgtttatttcctcattcctgaccctcttcattccactggcagctaat gcgggagtggccttgctcattgtcctccggattgtacaaggcatagcccaggttatggta ttaactggtcagtattcaatttgggtcaaatgggctcccccactggaaaggagtcaactc accaccattgctggatcagggtcaatgctggggtccttcattgttctacttgctggtggt ctcctctgccagaccataggatggccttacgtcttctatatctttgtctcttatttctgt gaatactggcttttttataccattatggcgtacacaccaacgtacatcagctcggtactt caagccaacctcagagatagtgggatcctgtctgccttgccgtttgttgttggatgtatc tgcattatccttggaggtctactggcagactttcttctctccagaaaaatcctcagactc atcaccatcaggaaactcttcactgccattggggttctcttcccatccgtgatcctcgtg tccctgccctgggtcagatccagccacagcatgaccatgaccttcttggtgctgtcttct gccatcagcagcttctgtgaatcaggagcccttgttaacttcttggatattgctcctcgg tacactggctttctcaaaggactattgcaagtctttgcacacatagctggagccatctct cctactgctgctggatttttcatcagtcaggattcagagtttggttggagaaatgtcttc ttgctttcagctgctgttaacatatcgggcctggttttctacctcatctttggccgagca gatgtgcaggactgggctaaagagcagacattcacccacctctga >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_9|211_aa MGQLLAFLRNLGVRGSDFPPEEQGRKTLTTAFEVDKTSPFCWTRGYMRVSHGLDAGHRGL IARPKGPPRREKGYPQREICVEWICDAKEMGSWKNQVHFLPCQPLMHEEKGGVRGGVSIL ADEEKSQTEESAHTPLPEGEDPHQDHADPTSFICIKELNEDLSRNLCYKTRAFLLISETF QFNVDKILPHPPPCQAFQSRNKHSLVAKAKI >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_9|636_bp atgggtcagctgttggctttcctgaggaatctgggagtgaggggctctgattttccaccc gaggaacaaggcagaaagacactgaccactgcatttgaagtggacaagacctctccattt tgttggaccagaggctacatgcgtgtttcccatggattagatgcaggtcacagaggactg atagccaggcccaagggaccacctagaagggagaagggatatccacaaagggagatttgt gtggagtggatctgtgatgccaaggagatgggcagctggaaaaaccaagtgcatttcctg ccttgtcagcctcttatgcatgaagaaaagggaggggtcagaggtggggtcagcatccta gcggatgaggagaaaagccagacagaagaaagtgctcacacccctctccctgaaggagaa gacccccatcaggatcatgctgacccaacatccttcatttgcataaaggaactgaatgag gacctgagcaggaacctttgctataaaaccagagccttccttttgatctctgaaacgttt caatttaatgtggacaaaatcctaccccatcctcctccttgtcaagctttccaatctcgt aataaacactctttagttgctaaagccaaaatctag >gi568815592f:25626909_25827289|GENSCAN_predicted_peptide_10|392_aa MVNSTDPHGLPNTSTKKLLDNIKNPMYNWSPDIQGIILSSTSYGVIIIQVPVGYFSGIYS TKKMIGFALCLSSVLSLLIPPAAGIGVAWVVVCRAVQGAAQGIVATAQFEIYVKWAPPLE RGRLTSMSTSGACGCAVCLLWFVLFYDDPKDHPCISISEKEYITSSLVQQVSSSRQSLPI KAILKSLPVWAISTGSFTFFWSHNIMTLYTPMFINSMLHVNIKENGFLSSLPYLFAWICG NLAGQLSDFFLTRNILSVIAVRKLFTAAGFLLPAIFGVCLPYLSSTFYSIVIFLILAGAT GSFCLGGVFINGLDIAPRYFGFIKACSTLTGMIGGLIASTLTGLILKQDPESAWFKTFIL MAAINVTGLIFYLIVATAEIQDWAKEKQHTRL >gi568815592f:25626909_25827289|GENSCAN_predicted_CDS_10|1179_bp atggtgaatagcacagatccacatggtttgcccaacacctccacaaagaagctcctggat aatataaagaaccctatgtataattggagcccagatatccagggaatcatcttgagttcc acctcctatggtgtcatcatcatccaagttcctgttggatacttctctggaatatattct acaaagaaaatgattggctttgcattatgcctcagctctgtgttaagcctgctcatccca ccagcagctggaattggagtagcttgggtcgttgtatgtcgagcagttcagggagcagcc caggggatagttgcaacagcccagtttgaaatatatgtcaaatgggctcctcccctggaa cgaggccgacttacttctatgagtacatcaggtgcttgtggctgtgccgtatgtcttctc tggttcgttctgttttatgatgaccccaaagaccacccatgtataagcatcagtgaaaag gaatacatcacatcctccctggtccagcaggtcagttcaagtagacaatctctgcctatc aaggctatacttaagtcgcttccagtctgggctatttccactggtagttttacgtttttc tggtcacataacatcatgacactatacactccaatgtttatcaactccatgcttcatgtt aatataaaagagaatgggttcttgtcttcccttccctatttgtttgcctggatctgtggt aacctagcaggtcagttatcagacttcttcctgaccaggaatattctcagcgtaattgct gtccggaaactcttcacagcagcaggatttctccttcctgcaatctttggtgtctgcctg ccttacctgagttccaccttctacagcattgtcattttcctaatacttgctggtgcaaca ggcagcttttgcttgggtggagtgtttataaatggcttggatattgctcccagatatttt ggatttattaaagcatgttcaactttaactggaatgataggaggactaattgcttccact ttgactggattgatccttaagcaggatccggaatccgcctggtttaaaaccttcatcctg atggcagccattaatgtgactggcctaattttctaccttatagttgctacagcagaaatt caggactgggctaaagaaaaacaacacacacgtctctga