GENSCAN 1.0 Date run: 8-Feb-118 Time: 18:24:19 Sequence gi568815592f:25661963_25879185 : 217223 bp : 38.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2981 3070 90 1 0 117 109 53 0.924 9.57 1.02 Intr + 7549 7605 57 0 0 92 77 78 0.900 5.26 1.03 Intr + 8037 8114 78 2 0 83 115 46 0.893 5.63 1.04 Term + 10853 10969 117 1 0 47 53 77 0.182 -2.34 1.05 PlyA + 13073 13078 6 1.05 2.03 PlyA - 13243 13238 6 1.05 2.02 Term - 13657 13637 21 1 0 91 43 53 0.517 -1.57 2.01 Init - 17002 16856 147 1 0 71 49 176 0.807 12.14 2.00 Prom - 17663 17624 40 -8.65 3.00 Prom + 17698 17737 40 -2.95 3.01 Init + 23079 23219 141 2 0 36 44 175 0.777 8.08 3.02 Intr + 27511 27570 60 0 0 111 81 60 0.815 5.61 3.03 Intr + 29094 29162 69 2 0 82 109 100 0.821 9.96 3.04 Term + 39245 39373 129 1 0 114 36 148 0.993 9.40 3.05 PlyA + 39796 39801 6 1.05 4.00 Prom + 61499 61538 40 -3.65 4.01 Init + 64947 65326 380 2 2 70 82 442 0.851 38.22 4.02 Intr + 69892 70171 280 1 1 17 27 351 0.886 18.46 4.03 Term + 70496 70783 288 1 0 15 46 442 0.885 27.09 4.04 PlyA + 71558 71563 6 1.05 5.02 PlyA - 73651 73646 6 1.05 5.01 Sngl - 97380 96823 558 0 0 81 47 211 0.965 12.18 5.00 Prom - 97884 97845 40 -8.25 6.00 Prom + 98422 98461 40 -6.95 6.01 Init + 100001 100091 91 1 1 67 93 104 0.958 9.50 6.02 Intr + 107023 107228 206 2 2 123 115 155 0.896 19.70 6.03 Intr + 108105 108338 234 2 0 60 81 193 0.965 12.76 6.04 Intr + 108422 108509 88 1 1 79 93 34 0.800 1.62 6.05 Intr + 108964 109050 87 2 0 84 97 30 0.621 2.52 6.06 Intr + 111621 111712 92 1 2 45 95 53 0.392 0.49 6.07 Intr + 114633 114765 133 2 1 57 87 52 0.726 1.30 6.08 Intr + 114850 114997 148 2 1 109 62 106 0.836 8.47 6.09 Intr + 115964 116054 91 2 1 80 94 65 0.989 5.38 6.10 Term + 117092 117226 135 1 0 122 53 100 0.999 6.94 6.11 PlyA + 118862 118867 6 1.05 7.00 Prom + 123580 123619 40 -5.85 7.01 Init + 124367 124697 331 1 1 70 9 213 0.568 9.12 7.02 Intr + 124813 124902 90 2 0 69 68 87 0.535 3.85 7.03 Term + 130556 130770 215 0 2 8 42 129 0.135 -3.29 7.04 PlyA + 131880 131885 6 1.05 8.19 PlyA - 132355 132350 6 1.05 8.18 Term - 136957 136823 135 1 0 71 49 109 0.790 2.34 8.17 Intr - 139018 138928 91 0 1 50 110 28 0.372 0.28 8.16 Intr - 149583 149436 148 1 1 75 91 25 0.470 -0.13 8.15 Intr - 149808 149676 133 0 1 73 115 92 0.925 9.70 8.14 Intr - 151030 150869 162 1 0 101 107 31 0.913 5.55 8.13 Intr - 151251 151133 119 1 2 64 75 75 0.925 3.06 8.12 Intr - 157636 157549 88 1 1 72 103 66 0.950 5.12 8.11 Intr - 157953 157720 234 0 0 99 67 168 0.989 12.76 8.10 Intr - 164665 164499 167 2 2 66 83 53 0.079 1.36 8.09 Intr - 175242 174968 275 2 2 79 15 186 0.341 6.66 8.08 Intr - 180959 180765 195 1 0 17 75 122 0.012 1.41 8.07 Intr - 187990 187843 148 2 1 110 97 114 0.953 12.87 8.06 Intr - 192954 192902 53 2 2 82 82 36 0.412 0.03 8.05 Intr - 198507 198423 85 1 1 61 58 117 0.508 3.96 8.04 Intr - 199749 199662 88 0 1 31 110 12 0.832 -3.68 8.03 Intr - 200067 199834 234 0 0 41 80 169 0.962 8.36 8.02 Intr - 200482 200271 212 2 2 47 121 153 0.616 12.21 8.01 Init - 206425 206335 91 1 1 84 94 91 0.520 10.00 8.00 Prom - 207586 207547 40 -5.35 9.00 Prom + 209205 209244 40 -3.65 9.01 Init + 209835 209915 81 2 0 72 115 22 0.789 4.32 9.02 Term + 211584 211871 288 2 0 102 48 108 0.579 2.59 9.03 PlyA + 212331 212336 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 183554 183420 135 2 0 71 49 87 0.860 0.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_1|113_aa LAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDADSSGFISAAELRNFLRDLFLHHK KAISEAKLEEYTGTMVMEDGNLDSGEVAEEVRTKYILKIDLRRFPDESGGSQG >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_1|342_bp cttgctggtatgttcttatctgaggatgaaaactttcttctgctctttcgccgggaaaac ccactggacagcagcgtggagtttatgcagatttggcgcaaatatgacgctgacagcagt ggctttatatcagctgctgagctccgcaacttcctccgagacctctttcttcaccacaaa aaggccatttctgaggctaaactggaagaatacactggcaccatggtgatggaagatggc aacttggactcaggtgaggtggcagaagaggtgagaactaaatatattttgaagatagac ttgagaagatttcctgatgaatcaggtgggagtcaaggatga >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_2|55_aa MIVTVKSFQYILAKLPDDAPSKTPAGTAKEDKGKAKKTALTAAEKAKDLDLGGRE >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_2|168_bp atgattgtaactgtgaagagttttcaatatatcttggccaagctgccagatgatgcacct tccaaaacacctgctgggacagccaaggaagacaaggggaaggcaaagaagacagcactg acagctgcggagaaggccaaggacctagatttaggaggacgagagtga >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_3|132_aa MLVAGFKVEEEAKECRQPLEPEKGKGKDSPREPPEGMRPCRQLEFCPACSTEERKRDFEK IFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISGVDLDKFREILLRHCDVNKDGKIQKS ELALCLGLKINP >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_3|399_bp atgctagttgctggctttaaagtggaagaagaggctaaagaatgtaggcagcctctagaa cctgaaaaaggcaagggaaaggattctccccgagaacctccagaaggaatgaggccgtgt cgacagcttgaattttgcccagcttgttctactgaagaaaggaaaagggactttgagaaa atctttgcctactatgatgttagtaaaacaggagccctggaaggcccagaagtggatggg tttgtcaaagacatgatggagcttgtccagcccagcatcagcggggtggaccttgataag ttccgcgagattctcctgcgtcactgcgacgtgaacaaggatggaaaaattcagaagtct gagctggctttgtgtcttgggctgaaaatcaacccataa >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_4|315_aa MPEVSSKGATISKKGFKKAVVKTQKKEGKKRKRTRKESYSIYIYKVLKQVHPDTGISSKA MSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAV TKYTSSKQQQAHGRLDLPGGDGRALVIMRQAGSLTGNALKNVTNKRIHDTHGLGRDSSVR VDLLQQLVDVNRIALFAASLALFVLLRVLATAFLKPFSEMGNSGEGYALFIQRGSAVPRG RCQRAAQQGQRCREGRGLHASVPAGGSGVPGLRHPGVGGQRRRNKKKTRIIPRHLQLAIR NDKELDKLLARVTMA >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_4|948_bp atgccggaggtgtcatctaaaggtgctaccatttccaagaagggctttaagaaagctgtc gttaagacccagaaaaaggaaggcaaaaagcgcaagaggacccgtaaggagagttattct atttacatctacaaagtgctaaagcaggtccatccggacactggcatctcttcgaaagct atgagcattatgaattccttcgtcactgatatctttgagcgtatagcgagcgaggcatca cgtttggctcactacagcaagcgctccaccatttcttccagagagattcagacagcagtg cgcttgctactgccgggagagctggctaaacatgctgtgtctgagggcaccaaggctgtc actaagtacaccagctccaagcagcagcaggcgcacggccgtctggatctccctggaggt gatggtcgagcgcttgtcataatgcgccaggcgggaagcctcacaggcaatgcgctcaaa aatgtcactaacaaaagaattcatgatactcatggccttggaagagattccagtgtccgc gtggacctgcttcagcagcttgtagatgtaaatagaatagctctctttgcggcatctctt gcgctttttgtcctccttcgagttttagcaactgccttcttaaagcccttttcggaaatg ggaaactcgggggaagggtacgcgctattcattcagagaggttctgcagttccccgtggg cggtgtcaacgggctgctcagcaagggcaacgatgccgagagggtcgggggctgcacgct agcgtacctgccggcggttctggagtacctggcctccgacaccctggagttggcgggcaa cgccgtcggaacaagaagaagacccgcatcatcccgcgccacctgcagctggccatccgc aacgacaaggagctcgacaagctgctggcccgagtgacaatggcttag >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_5|185_aa MDLNYTLEQIDLTDIYRTLYPTTAEYTFYSSAHGTFSKTDHMIGHKTSLNKFKEIEIISS TPSDHSRIKLEINSKRNSQSCANTWKLNKLLLNDHWLNNEIKMKIKKFFELNDNRDITYQ NLWDTAKAVLRGKFIPLNAYIKKSERAQIDNLSLHLKELEKQEQTKPKPSRRKEITKIRA KLSGN >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_5|558_bp atggacttaaactataccctagaacaaatagacttaacagatatttacagaacactctac ccaacaactgcagaatatacattctattcatcagcacatgggacattctccaagacagac catatgataggccacaaaacaagtctcaacaaatttaaggaaatcgaaattatatcaagt actccctcagaccacagcagaataaaattggaaatcaactccaaaaggaactctcaaagc tgtgcaaatacatggaaattaaataagctgctcctaaatgatcattggctcaacaatgaa atcaagatgaaaattaaaaaattctttgaactgaacgacaaccgtgacataacctatcaa aacctctgggatacagcaaaagcggtgctaagaggaaagttcataccattaaatgcctac atcaaaaagtctgaaagagcacaaatagacaatctaagtttacacctcaaggaactagag aaacaagaacaaaccaaacccaaacccagcagaagaaaagaaataaccaagatcagagca aaactaagtggaaattga >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_6|434_aa MSTGPDVKATVGDISSDGNLNVAQEECSRKGFCSVRHGLALILQLCNFSIYTQQMNLSIA IPAMVNNTAPPSQPNASTERPSTDSQGYWNETLKEFKAMAPAYDWSPEIQGIILSSLNYG SFLAPIPSGYVAGIFGAKYVVGAGLFISSFLTLFIPLAANAGVALLIVLRIVQGIAQVMV LTGQYSIWVKWAPPLERSQLTTIAGSGSMLGSFIVLLAGGLLCQTIGWPYVFYIFVSYFC EYWLFYTIMAYTPTYISSVLQANLRDSGILSALPFVVGCICIILGGLLADFLLSRKILRL ITIRKLFTAIGVLFPSVILVSLPWVRSSHSMTMTFLVLSSAISSFCESGALVNFLDIAPR YTGFLKGLLQVFAHIAGAISPTAAGFFISQDSEFGWRNVFLLSAAVNISGLVFYLIFGRA DVQDWAKEQTFTHL >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_6|1305_bp atgtctaccggaccagatgtcaaggctacagtgggggacatttccagtgatggcaattta aacgtggctcaagaggaatgctccaggaaaggtttttgttcagtccgacatgggctggcc ctcatcttgcagctctgtaatttttcaatttacacccaacaaatgaacttgagcattgcc atcccagctatggtgaacaacacagccccacctagccagcccaatgcttccacagaacgg ccctccactgactcccagggctactggaatgaaactctaaaagaatttaaagcaatggcc cctgcatatgactggagtcctgaaatccagggaatcatcctcagctccctcaactatggc tcattcttggctccaatccccagtggctatgtggctggaatatttggagccaagtatgtg gttggtgctggcttgtttatttcctcattcctgaccctcttcattccactggcagctaat gcgggagtggccttgctcattgtcctccggattgtacaaggcatagcccaggttatggta ttaactggtcagtattcaatttgggtcaaatgggctcccccactggaaaggagtcaactc accaccattgctggatcagggtcaatgctggggtccttcattgttctacttgctggtggt ctcctctgccagaccataggatggccttacgtcttctatatctttgtctcttatttctgt gaatactggcttttttataccattatggcgtacacaccaacgtacatcagctcggtactt caagccaacctcagagatagtgggatcctgtctgccttgccgtttgttgttggatgtatc tgcattatccttggaggtctactggcagactttcttctctccagaaaaatcctcagactc atcaccatcaggaaactcttcactgccattggggttctcttcccatccgtgatcctcgtg tccctgccctgggtcagatccagccacagcatgaccatgaccttcttggtgctgtcttct gccatcagcagcttctgtgaatcaggagcccttgttaacttcttggatattgctcctcgg tacactggctttctcaaaggactattgcaagtctttgcacacatagctggagccatctct cctactgctgctggatttttcatcagtcaggattcagagtttggttggagaaatgtcttc ttgctttcagctgctgttaacatatcgggcctggttttctacctcatctttggccgagca gatgtgcaggactgggctaaagagcagacattcacccacctctga >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_7|211_aa MGQLLAFLRNLGVRGSDFPPEEQGRKTLTTAFEVDKTSPFCWTRGYMRVSHGLDAGHRGL IARPKGPPRREKGYPQREICVEWICDAKEMGSWKNQVHFLPCQPLMHEEKGGVRGGVSIL ADEEKSQTEESAHTPLPEGEDPHQDHADPTSFICIKELNEDLSRNLCYKTRAFLLISETF QFNVDKILPHPPPCQAFQSRNKHSLVAKAKI >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_7|636_bp atgggtcagctgttggctttcctgaggaatctgggagtgaggggctctgattttccaccc gaggaacaaggcagaaagacactgaccactgcatttgaagtggacaagacctctccattt tgttggaccagaggctacatgcgtgtttcccatggattagatgcaggtcacagaggactg atagccaggcccaagggaccacctagaagggagaagggatatccacaaagggagatttgt gtggagtggatctgtgatgccaaggagatgggcagctggaaaaaccaagtgcatttcctg ccttgtcagcctcttatgcatgaagaaaagggaggggtcagaggtggggtcagcatccta gcggatgaggagaaaagccagacagaagaaagtgctcacacccctctccctgaaggagaa gacccccatcaggatcatgctgacccaacatccttcatttgcataaaggaactgaatgag gacctgagcaggaacctttgctataaaaccagagccttccttttgatctctgaaacgttt caatttaatgtggacaaaatcctaccccatcctcctccttgtcaagctttccaatctcgt aataaacactctttagttgctaaagccaaaatctag >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_8|885_aa MATKTELSPTARESKNAQDMQVDETLIPRKVPSLCSARYGIALVLHFCNFTTIAQNVIMN ITMVAMVNSTSPQSQLNDSSEVLPVDSFGGLSKAPKSLPAKAPVYDWSPQIQGIIFGAVG YGGILTMAPSGYLAGRVGTKRVVGISLFATSFLTLCIPLATDFGIVLLIVTRIVQGLSQS SILGGQFAIWEKWGPPQERSRLCSIALSVDLIEIAISDDPGAPLQTVLVDGIAQLIRVIS HTLLPLKAVGDRLAGSLPSSALIVSLPYLNSGYITATALLTLSCGLSTLCQSGIYINVLD IAPRNYKSEMEKKIKEKNANVQYRRPEFRPNASVRACGWKTTSQDRMPTSKKGQKKKCPH ITEMKESGVSLHSLTTFASQSPVFSFCMAQTLIFACRQFGQERTPWFQSSNISILLNNQT IQVPLVASVSSSSTRTKRALHLIPLVTGLSISDALGTGIASFCSFRYGLSFLVHCCNVII TAQRACLNLTMVVMVNSTDPHGLPNTSTKKLLDNIKNPMYNWSPDIQGIILSSTSYGVII IQVPVGYFSGIYSTKKMIGFALCLSSVLSLLIPPAAGIGVAWVVVCRAVQGAAQGIVATA QFEIYVKWAPPLERGRLTSMSTSGACGCAVCLLWFVLFYDDPKDHPCISISEKEYITSSL VQQVSSSRQSLPIKAILKSLPVWAISTGSFTFFWSHNIMTLYTPMFINSMLHVNIKENGF LSSLPYLFAWICGNLAGQLSDFFLTRNILSVIAVRKLFTAAGFLLPAIFGVCLPYLSSTF YSIVIFLILAGATGSFCLGGVFINGLDIAPRYFGFIKACSTLTGMIGGLIASTLTGLILK QDPESAWFKTFILMAAINVTGLIFYLIVATAEIQDWAKEKQHTRL >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_8|2658_bp atggccaccaagacagagttgagtcccacagcaagggagagcaagaacgcacaagatatg caagtggatgagacactgatccccaggaaagttccaagtttatgttctgctcgctatgga atagccctcgtcttacatttctgcaatttcacaacgatagcacaaaatgtcatcatgaac atcaccatggtagccatggtcaacagcacaagccctcaatcccagctcaatgattcctct gaggtgctgcctgttgactcatttggtggcctaagtaaagccccaaagagtcttcctgca aaggctcctgtgtatgactggtctcctcaaatccaaggcatcatctttggtgctgttggc tatggtggcatactgacaatggctcccagtggatacctggctggaagagtaggaacaaag cgagtggttggcatttctttgtttgcaacttcatttctcactctatgcatccctctggcc actgactttggaatagtcttgctcattgtaactcgaatagtccagggcctaagccagtcc tcaatacttgggggtcagtttgcaatttgggaaaagtggggccctccacaagaacgaagc agactctgcagcattgctttatcagtagatcttatagaaattgccatttctgatgatcct ggggctcctctgcagactgtactggtggatggaattgctcagcttataagggtaatcagt catactctgctgccattgaaggcagtaggggataggcttgcaggaagtctcccctcttca gcactcattgtgtctctgccttacctcaattccggctatatcacagcaactgccttgctg acgctctcttgcggattaagcacattgtgtcagtcagggatttatatcaatgtcttagat attgctccaagaaactataagagtgaaatggagaaaaagataaaggagaaaaatgcaaat gttcagtaccgcagacctgagtttagaccaaatgcctcagtaagagcctgtggatggaaa accacaagccaagacagaatgccaacatctaagaagggccaaaaaaagaaatgcccacac ataacagagatgaaagagtcaggtgtctcgctacattcacttacaacttttgcctctcaa tccccggtcttttctttctgtatggcacaaactcttatctttgcttgccggcaatttggt caagaacgtacaccctggtttcaatcttcaaacattagcattttgctaaacaaccagacc atccaggttcctttggtagcttctgtttcatcttcctccacacgcactaagcgggctcta catcttattcctctggtaacaggattaagcatctcagatgcacttggcactggaatagca agtttctgttcctttcgctatggattgtctttccttgtgcactgttgtaatgttataata acagcacagcgtgcgtgcctgaacctcacaatggtagtcatggtgaatagcacagatcca catggtttgcccaacacctccacaaagaagctcctggataatataaagaaccctatgtat aattggagcccagatatccagggaatcatcttgagttccacctcctatggtgtcatcatc atccaagttcctgttggatacttctctggaatatattctacaaagaaaatgattggcttt gcattatgcctcagctctgtgttaagcctgctcatcccaccagcagctggaattggagta gcttgggtcgttgtatgtcgagcagttcagggagcagcccaggggatagttgcaacagcc cagtttgaaatatatgtcaaatgggctcctcccctggaacgaggccgacttacttctatg agtacatcaggtgcttgtggctgtgccgtatgtcttctctggttcgttctgttttatgat gaccccaaagaccacccatgtataagcatcagtgaaaaggaatacatcacatcctccctg gtccagcaggtcagttcaagtagacaatctctgcctatcaaggctatacttaagtcgctt ccagtctgggctatttccactggtagttttacgtttttctggtcacataacatcatgaca ctatacactccaatgtttatcaactccatgcttcatgttaatataaaagagaatgggttc ttgtcttcccttccctatttgtttgcctggatctgtggtaacctagcaggtcagttatca gacttcttcctgaccaggaatattctcagcgtaattgctgtccggaaactcttcacagca gcaggatttctccttcctgcaatctttggtgtctgcctgccttacctgagttccaccttc tacagcattgtcattttcctaatacttgctggtgcaacaggcagcttttgcttgggtgga gtgtttataaatggcttggatattgctcccagatattttggatttattaaagcatgttca actttaactggaatgataggaggactaattgcttccactttgactggattgatccttaag caggatccggaatccgcctggtttaaaaccttcatcctgatggcagccattaatgtgact ggcctaattttctaccttatagttgctacagcagaaattcaggactgggctaaagaaaaa caacacacacgtctctga >gi568815592f:25661963_25879185|GENSCAN_predicted_peptide_9|122_aa MVGFLFLRLNEDPKKRNETTKQTYPSMPTYHLLLTLVTVPLPHLSISSSIPSHFSLVKFH TNSLDLDAAFLQMLSWTRATLAIGIIEIIIIATLINYCTSQSTSIPDGDITDPLPPGHPV GK >gi568815592f:25661963_25879185|GENSCAN_predicted_CDS_9|369_bp atggttggctttttgtttctaaggttaaatgaagatcctaaaaagagaaatgaaacaacc aaacagacatatccaagtatgcccacctaccatcttctgctgactctggtgactgtccca ttgcctcatctctccatttcctcttctatccctagccattttagtttggttaaatttcac accaactcactggatcttgatgcagcatttctccagatgctttcttggaccagagccacc ctagccattggtataatagaaataattattattgccactcttataaattactgcaccagc cagtctacatcaatcccagatggagatattactgaccctcttcctccaggtcacccagtt ggtaaatga