GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:27:33 Sequence gi568815597f:92733388_92941862 : 208475 bp : 40.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3028 3151 124 1 1 84 42 105 0.245 2.38 1.02 PlyA + 3252 3257 6 1.05 2.00 Prom + 9413 9452 40 -2.45 2.01 Init + 22444 22569 126 0 0 96 42 97 0.788 6.11 2.02 Intr + 44208 44316 109 2 1 78 100 52 0.044 4.34 2.03 Intr + 44883 45017 135 1 0 37 61 111 0.280 2.92 2.04 Term + 45156 45325 170 1 2 44 42 99 0.823 -2.04 2.05 PlyA + 46373 46378 6 1.05 3.04 PlyA - 46576 46571 6 1.05 3.03 Term - 51296 50917 380 0 2 31 41 228 0.566 6.37 3.02 Intr - 52216 52077 140 0 2 76 -4 166 0.881 5.39 3.01 Init - 59900 59863 38 2 2 39 115 65 0.407 3.84 3.00 Prom - 60015 59976 40 -11.34 4.00 Prom + 60135 60174 40 -5.85 4.01 Init + 61443 61700 258 2 0 90 33 191 0.785 10.98 4.02 Intr + 85118 85166 49 1 1 95 105 51 0.027 4.83 4.03 Intr + 100158 100273 116 1 2 76 111 86 0.956 8.95 4.04 Intr + 101392 101526 135 0 0 67 75 97 0.973 6.14 4.05 Intr + 102803 103005 203 1 2 40 110 265 0.990 21.06 4.06 Intr + 104069 104246 178 2 1 17 53 226 0.980 11.00 4.07 Intr + 107164 107252 89 0 2 91 87 168 0.743 14.85 4.08 Intr + 107283 107384 102 0 0 57 41 122 0.634 2.77 4.09 Term + 108379 108478 100 1 1 87 40 174 0.981 9.12 4.10 PlyA + 108515 108520 6 1.05 5.05 PlyA - 108800 108795 6 1.05 5.04 Term - 110808 109996 813 0 0 39 32 504 0.989 32.07 5.03 Intr - 113972 113796 177 2 0 81 121 17 0.847 3.49 5.02 Intr - 117568 117461 108 1 0 85 80 68 0.644 5.26 5.01 Init - 120084 120043 42 0 0 74 46 10 0.144 -4.13 5.00 Prom - 124256 124217 40 -2.95 6.00 Prom + 125495 125534 40 -6.85 6.01 Init + 137407 137450 44 0 2 57 42 65 0.481 -1.26 6.02 Intr + 139525 139699 175 1 1 64 60 141 0.724 7.92 6.03 Intr + 142952 143045 94 1 1 61 76 54 0.207 0.12 6.04 Term + 159447 160174 728 1 2 -19 43 416 0.006 19.05 6.05 PlyA + 160440 160445 6 1.05 7.00 Prom + 160571 160610 40 -6.15 7.01 Sngl + 160664 161374 711 1 0 49 42 287 0.878 16.17 7.02 PlyA + 161377 161382 6 1.05 8.00 Prom + 163796 163835 40 -3.65 8.01 Init + 168404 168478 75 1 0 74 41 57 0.452 0.74 8.02 Intr + 169452 169593 142 2 1 61 89 83 0.233 4.71 8.03 Intr + 187866 188019 154 1 1 67 47 108 0.541 2.81 8.04 Intr + 188150 188289 140 2 2 11 82 109 0.818 1.79 8.05 Term + 191043 191182 140 1 2 63 49 96 0.696 0.34 8.06 PlyA + 196592 196597 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 84229 84117 113 2 2 83 43 105 0.865 3.24 S.002 Intr - 85060 84932 129 2 0 46 109 91 0.853 6.75 S.003 Sngl + 159518 160174 657 1 0 59 43 328 0.832 21.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_1|41_aa XLARSSKSSGLNCDGEDGERAGVDSVVDEDVVCNEVDGLAT >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_1|126_bp nntttagccaggagttctaagtcgtctggactcaactgtgatggggaagatggtgaaagg gctggtgttgatagtgtggtagatgaggatgtggtatgtaatgaagtagatggacttgcc acctga >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_2|179_aa MARRAGCQYSPSTWYKELKLPSYKGQSPQLSLSQYFADLTAITTMKKFKRYNPIRRRQQW RTGGNKFLYNGKQTVDFQDSSKGSGTARTKSGRGTKRKEQVNAEQLESQISSKLLSGPTL NVADTDNQTSTLQAENWKILSGKYDWDKRKDKYSGDSLKQSTQINYFNLGYTLELSWQL >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_2|540_bp atggcgaggagagctggctgccaatattcaccgagcacttggtacaaagagctgaagttg ccctcatacaaaggccagtcccctcaactgagtctcagccagtattttgctgacttgact gccattacaactatgaagaaatttaaaaggtataaccccataaggaggagacaacaatgg aggacaggtggcaacaagtttctgtataatggaaagcagacagtggacttccaagattcc tcaaaaggctcaggaactgccagaacaaagagtggaagggggactaaaagaaaggagcaa gtgaatgctgaacagttagaatcccagatttcttccaagctgctgagtggccccaccctc aatgtggcagatactgacaaccagacctcaactctccaagcagaaaactggaagatactc tctggaaaatatgactgggacaaaaggaaagataaatacagtggagacagcctcaaacaa tccacccagatcaattatttcaatcttggctacacactggaattatcttggcagctttaa >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_3|185_aa MIDEKRFHSAQLCSLHKPNAKTRVRVHYNYKCHLRVDYCFCECPKATSGGARGCLPKHAR PKFPALASSPVAAAAARSPPSRGRWGRNSSAGLTVASQGGGGTEQLRGHVLVNPVEDTDP APHPRGQKSVPFGDLRGESTRPPDLQSWRLYLASGILRMACLQAADGQDRCARSRMNLRR VCSHS >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_3|558_bp atgatagatgaaaagcgcttccacagtgcccagctatgtagtctgcataaacccaatgcg aagacacgtgtaagagtacattacaattacaagtgtcatttacgcgttgattattgtttt tgcgagtgccctaaggccacctctgggggagctcggggctgccttccgaagcatgcgcgg cctaagttccctgcactcgcttcctcgcctgtcgccgccgccgccgcccgcagccctcct tctcgtgggcgctggggaagaaactcgtcggcgggtctaactgtggcgtcccagggcggt ggagggacggagcagcttcgggggcacgtcctcgtaaatcccgtggaggacactgaccct gcaccccaccctcgaggccagaagtcggttccctttggggacctgaggggcgagagcact cgcccccctgacttgcaaagttggcgtctttacttggcctccgggattctgcgcatggcg tgtctccaggctgctgatgggcaagacagatgtgccaggtccagaatgaacttgagaaga gtttgtagccattcctga >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_4|409_aa MCEQLCEGKELRPPINIQDQLDSHANEPPQKQIFQPQSSLQITSVPAHIRLNLMKNQKSE PVSKLLLNSYYIETVNAPLHSSLGNRKYSFIFWQLAGHTPPSEGKTDYYARKRLVIQDKN KYNTPKYRMIVRVTNRDIICQIAYARIEGDMIVCAAYAHELPKYGVKVGLTNYAAAYCTG LLLARRLLNRFGMDKIYEGQVEVTGDEYNVESIDGQPGAFTCYLDAGLARTTTGNKVFGA LKGAVDGGLSIPHSTKRFPGYDSESKEFNAEVHRKHIMGQNVADYMRYLMEEDEDAYKKQ FSQYIKNSVTPDMMEEMYKKAHAAIRENPVYEKKPKKEVKKKRKQVGNGSHVGQTVGVIV QTRSLALRDVAEAKGTRWNRPKMSLAQKKDRVAQKKASFLRAQERAAES >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_4|1230_bp atgtgtgaacagctctgtgaaggaaaagaactaaggcctcccatcaacattcaggaccaa cttgacagccatgccaatgagccacctcaaaagcaaatctttcagccccagtcaagcctt cagattacttcagtaccagctcacatcagactgaatctcatgaaaaaccaaaagtcagaa ccagtcagcaagctgctgctaaattcctattacatagaaactgtgaatgcaccattgcat tctagcctaggcaacagaaaatacagcttcatcttctggcagcttgctggccacactcct ccttctgagggtaaaactgattattatgctcggaaacgcttggtgatacaagataaaaat aaatacaacacacccaaatacaggatgatagttcgtgtgacaaacagagatatcatttgt cagattgcttatgcccgtatagagggggatatgatagtctgcgcagcgtatgcacacgaa ctgccaaaatatggtgtgaaggttggcctgacaaattatgctgcagcatattgtactggc ctgctgctggcccgcaggcttctcaataggtttggcatggacaagatctatgaaggccaa gtggaggtgactggtgatgaatacaatgtggaaagcattgatggtcagccaggtgccttc acctgctatttggatgcaggccttgccagaactaccactggcaataaagtttttggtgcc ctgaagggagctgtggatggaggcttgtctatccctcacagtaccaaacgattccctggt tatgattctgaaagcaaggaatttaatgcagaagtacatcggaagcacatcatgggccag aatgttgcagattacatgcgctacttaatggaagaagatgaagatgcttacaagaaacag ttctctcaatacataaagaacagcgtaactccagacatgatggaggagatgtataagaaa gctcatgctgctatacgagagaatccagtctatgaaaagaagcccaagaaagaagttaaa aagaagagaaaacaggttgggaatggttcccacgtggggcagactgttggtgtaattgtg caaactcgatcactagctctgcgtgatgtggcagaagcgaagggaaccaggtggaaccgt cccaaaatgtcccttgctcagaagaaggatcgggtagctcaaaagaaggcaagcttcctc agagctcaggagcgggctgctgagagctaa >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_5|379_aa MEKSGANLIGFPLKCDKYKTGVIDGPACNSLCVTETLYFGKCLSTKPNNQMYLGIWDNLP GVVKCQMEQALHLDFGTELEPRKEIVLFDKPTRGTTVQKFKEMVYSLFKAKLGDQGNLSE LVNLILTVADGDKDGQVSLGEAKSAWALLQLNEFLLMVILQDKEHTPKLMGFCGDLYVME SVEYTSLYGISLPWVIELFIPSGFRRSMDQLFTPSWPRKAKIAIGLLEFVEDVFHGPYGN FLMCDTSAKNLGYNDKYDLKMVDMRKIVPETNLKELIKDRHCESDLDCVYGTDCRTSCDQ STMKCTSEVIQPNLAKACQLLKDYLLRGAPSEIREELEKQLYSCIALKVTANQMEMEHSL ILNNLKTLLWKKISYTNDS >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_5|1140_bp atggagaaatctggggccaatctgattggtttccccttaaagtgtgacaagtacaagact ggagttattgatgggcctgcatgtaacagcctttgtgttacagaaactctttactttgga aaatgtttatccaccaagcccaacaatcagatgtatttagggatttgggataatctacca ggtgttgtgaaatgtcaaatggaacaagcgcttcatcttgattttggaactgaattggaa ccaagaaaagaaatagtgctatttgataagccaactagaggaactactgtacaaaaattt aaagaaatggtctatagtctctttaaggcaaaattgggtgaccaaggaaacctctctgaa ctggttaatctcatcttgacggtggctgatggagacaaagatggccaggtttccttggga gaagcaaagtcggcatgggcacttcttcaactgaatgaatttcttctcatggtgatactt caagataaagaacatacccccaaattaatgggattctgtggtgacctctatgtgatggaa agtgttgaatatacctctctttatggaataagccttccttgggtcattgaactttttatt ccatctgggttcagaagaagcatggatcagctgttcacaccatcatggccaagaaaggcc aaaatagccataggacttctagaatttgtggaagatgttttccatggcccctacggaaat ttcctcatgtgcgatactagtgccaaaaacctaggatataatgataagtatgatttgaaa atggtggatatgagaaaaattgtgccagagacaaacctgaaagaacttattaaggatcgt cactgtgagtctgatttggactgtgtctatggcacagattgtagaactagctgtgatcag agtacaatgaagtgtacttcagaagtgatacaaccaaacttggcaaaagcttgtcagtta ctcaaagactacctactgcgtggtgctccaagtgaaattcgtgaagaattagaaaagcag ctttattcttgtattgctctcaaagtcacagcaaatcaaatggaaatggaacattctttg atactaaataacctaaaaacattattgtggaagaaaatttcctacactaatgactcttag >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_6|346_aa MDRGKGGNLLPLTGQAPGIGKYPNRITGCMHEVPLVFHKASADFLSPVGAFRQHKVSASQ IPAHAQNQQIPPRVEYCTYIIQLPTKTTNQEKKRYFIRTYEKRASRELCDECTSLSSRFD QLEERVSVMEDQMNEMKPGETFREKRIERNEQSLQEICDYVKRPNLHLIGVTESDGENGT KLENTLQDIIEENFPKLARQANIQIQEIQRTPQRYSSRRATPRHIIVRVTKVEMKEKMLR AAREKGQVTHKGKPIRLTADLSAETLQARREWGPIFNMLKEKNFQPRISYPAKLSFISEG EIKSFTDKQMLRDFVTIRPALKELLKEALNMERNNRYQPLQKHAKL >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_6|1041_bp atggatcggggtaaaggtgggaatctgcttccactgactggtcaagccccgggaattggc aaatatcccaacaggataactggctgtatgcatgaggttcccctggttttccacaaagcc tctgctgatttcttgtccccagtaggggctttccgccaacataaggtttctgcttctcag atccctgcccatgcccagaatcagcaaattccccccagggtagaatactgcacatatata atccagcttccaacaaaaaccactaaccaggaaaagaaaagatatttcatccgcacatat gagaagcgagcctcacgagaactatgtgatgaatgcacaagtctcagtagccgattcgat caactggaagaaagggtatcagtgatggaagatcaaatgaatgaaatgaagccaggagag acgtttagagaaaaaagaatagaaagaaatgaacaaagcctccaagaaatatgtgactat gtgaaaagaccaaatctacatctgattggtgtaactgaaagtgacggggagaatggaacc aagttggaaaacactctgcaggatattatcgaggagaacttccccaagctagcaaggcag gccaacattcagattcaggaaatacagagaacgccacaaagatactcctcgagaagagca actccaagacacataattgtcagagtgaccaaagttgaaatgaaggaaaaaatgttaagg gcagccagagagaaaggtcaggttacccacaaagggaagcccatcagactaacagcggat ctctcggcagaaactctacaagccagaagagagtgggggccaatattcaacatgcttaaa gaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataagtgaagga gaaataaaatcctttacagacaagcaaatgctgagagattttgtcaccatcaggcctgcc ctaaaagagctcctgaaggaagcactaaacatggaaaggaacaaccggtaccagccactg caaaaacatgccaaactgtaa >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_7|236_aa MGDFNTPLSTLDRSMRQKVNKDIQELNSALHQADLIGIYRTLHPKSTEYTFFSAPHHTYS KTDHVVGSKALLIKWKTTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLFLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWNTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKLEQTHSKASRRQEITKITAELKEIETQKTLQKFNESRSWFFEKINKIDRPLA >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_7|711_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagtcaac aaggatatccaggaattgaactcagctctgcaccaagcggacctaataggcatctacaga actctccaccccaaatcaacagaatatacattcttttcagcaccacaccacacctattcc aaaactgaccacgtagttggaagtaaagcactcctcatcaaatggaaaacaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaatctgttcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagataca acataccagaatctctggaacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagctagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcacagcagagctgaaggaaatagagacacaaaaaacccttcaaaaattcaatgaatcc aggagctggttttttgaaaagatcaacaaaatagatagaccgctagcttga >gi568815597f:92733388_92941862|GENSCAN_predicted_peptide_8|216_aa MTIYPKERGVSSSDSRELVQFQGSRLFSLKCGLAIVLVLCEDDDHEGLLVGHLADVTWSV PECFIISSGETFRVYNCIEKAIVFWFLIAIETHSGWTNCERCPRKGEPKGENTKMTISDQ IQRNAFEDRRTVGPEEPGSVGEQLCVSLGQDISGVEVLSGERPAPRSWAVGRHYITRTDQ GELKKEGVNAGIEDKDKTVYLEEGVRGHLASSGQGP >gi568815597f:92733388_92941862|GENSCAN_predicted_CDS_8|651_bp atgactatataccccaaggagaggggtgtcagcagctcagactctagggagctagtccag ttccagggaagcaggttattttcattaaaatgtggtttagctattgttttggttctttgt gaggatgatgaccacgaggggcttctagtcggccatcttgctgatgtcacttggagcgtg cctgagtgttttattatttcctcaggagagacattcagagtttataactgtattgagaaa gcaattgtgttttggtttttaattgctatagaaacacactcaggatggacaaattgtgaa cgatgtcccagaaaaggagagcccaaaggagagaacacaaagatgaccatatcagaccaa atccagaggaatgcttttgaggacaggaggacggtgggacctgaggaacctggctcagtg ggagagcaactgtgcgtcagtcttgggcaggacatctctggagtggaagtactgtctgga gaaagaccagctcctcggagttgggcagtagggcgccactatataacccgcacggatcaa ggggaactgaaaaaagagggagtgaatgcgggaatagaagacaaagacaaaacagtatat ttggaagaaggagtcagggggcaccttgcctctagtggacaagggccctga