GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:53:46 Sequence gi568815592f:142047288_142318640 : 271353 bp : 36.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2407 2444 38 0 2 95 72 31 0.298 1.76 1.02 Term + 15448 15619 172 1 1 23 45 231 0.870 8.62 1.03 PlyA + 16411 16416 6 1.05 2.05 PlyA - 16525 16520 6 1.05 2.04 Term - 25021 24875 147 1 0 103 36 71 0.402 0.32 2.03 Intr - 28762 28457 306 1 0 75 74 95 0.497 2.52 2.02 Intr - 31616 31268 349 1 1 64 115 231 0.277 17.83 2.01 Init - 41371 40950 422 1 2 79 110 421 0.099 39.31 2.00 Prom - 46350 46311 40 -1.45 3.03 PlyA - 46989 46984 6 1.05 3.02 Term - 48369 47287 1083 0 0 -33 38 456 0.524 19.61 3.01 Init - 61882 61754 129 1 0 73 64 125 0.337 8.80 3.00 Prom - 64028 63989 40 -5.05 4.00 Prom + 65941 65980 40 -4.95 4.01 Init + 68309 68375 67 1 1 73 74 89 0.682 7.29 4.02 Intr + 81046 81245 200 2 2 66 107 41 0.034 1.95 4.03 Intr + 86563 86745 183 0 0 39 108 51 0.003 1.26 4.04 Intr + 97217 97313 97 1 1 116 51 63 0.005 4.06 4.05 Intr + 99945 100112 168 1 0 49 74 80 0.003 1.70 4.06 Intr + 118941 119035 95 1 2 106 87 48 0.290 5.26 4.07 Intr + 122263 122390 128 0 2 57 108 144 0.995 11.86 4.08 Intr + 123059 123134 76 2 1 107 81 50 0.956 4.80 4.09 Intr + 142139 142247 109 1 1 54 80 71 0.417 1.84 4.10 Intr + 151152 151328 177 1 0 45 75 181 0.967 11.47 4.11 Intr + 156698 156778 81 0 0 111 110 49 0.986 8.09 4.12 Intr + 159756 159863 108 1 0 25 13 153 0.550 0.84 4.13 Term + 160040 160179 140 0 2 83 49 102 0.882 2.94 4.14 PlyA + 160375 160380 6 1.05 5.04 PlyA - 163938 163933 6 1.05 5.03 Term - 166571 165834 738 2 0 54 32 326 0.154 16.07 5.02 Intr - 173963 173709 255 2 0 75 75 88 0.005 2.92 5.01 Init - 184015 183347 669 1 0 58 38 198 0.163 7.04 5.00 Prom - 219265 219226 40 -4.85 6.03 PlyA - 220380 220375 6 1.05 6.02 Term - 221996 221809 188 0 2 80 41 150 0.849 6.17 6.01 Init - 222796 222709 88 1 1 98 33 37 0.380 0.05 6.00 Prom - 229625 229586 40 -5.05 7.00 Prom + 230876 230915 40 -6.15 7.01 Init + 231094 231260 167 0 2 90 94 96 0.881 9.55 7.02 Intr + 232882 232919 38 1 2 56 83 19 0.367 -4.81 7.03 Term + 234335 234987 653 0 2 68 43 222 0.765 8.91 7.04 PlyA + 236355 236360 6 1.05 8.03 PlyA - 236548 236543 6 1.05 8.02 Term - 254622 254117 506 1 2 38 32 257 0.474 8.72 8.01 Intr - 254958 254778 181 0 1 72 72 109 0.477 6.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 100081 99757 325 1 1 46 62 249 0.869 13.70 S.002 Term + 171211 171356 146 2 2 110 48 202 0.964 15.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_1|69_aa MGINQGVVTAGIGEDLGKDVEFEVVGDAPEKVGPKQAEDAAKSITKGSDDGAQPSTSTAQ EQDDVLIVD >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_1|210_bp atgggcattaaccagggagtggttacagcaggcattggtgaagacctaggaaaggatgtt gaatttgaagttgttggtgatgccccagaaaaagtggggcccaaacaagctgaagatgct gccaaaagcataaccaaaggcagtgatgatggagctcagccctccacctccacagctcaa gagcaagatgatgttctcatagttgattag >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_2|407_aa MPSKSLSNLSVTTGANESGSVPEGWERDFLPASDGTTTELVIRCVIPSLYLLIITVGLLG NIMLVKIFITNSAMRSVPNIFISNLAAGDLLLLLTCVPVDASRYFFDEWMFGKVGCKLIP VIQLTSVGVSVFTLTALSADRYRAIVNPMDMQTSGALLRTCVKAMGIWVVSVLLAVPEAV FSEVARISSLDNSSFTACIPYPQTDELHPKIHSVLIFLVYFLIPLAIISIYYYHIAKTLI KSAHNLPGEYNEHTKKQMETRKRLAKIVLVFVGCFIFCWFPNHILYMYRSFNYNEIDPSL GHMIVTLVARVLSFGNSCVNPFALYLLSESFRRHFNSQLCCGRKSYQERGTSYLLSSSAV MPIPEVCLHSLGQLCVCGFASIALYWLLSWAGVKYLWLFQVHVASYL >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_2|1224_bp atgccctctaagtctctttccaacctctcggtgaccaccggcgcgaatgagagcggttcc gttcccgaggggtgggaaagggatttcctgccggcctcggacgggaccaccacggagttg gtgatccgctgtgtgatcccgtccctctacctgctcatcatcaccgtgggcttgctgggc aacatcatgctggtgaagatcttcatcaccaacagcgccatgaggagcgtccccaacatc ttcatctctaacctggcggccggggacttgctgctgctgctcacctgcgtcccggtggac gcctcgcgctacttcttcgacgagtggatgtttggcaaggtgggctgcaaactgatccct gtcatccagctcacttccgtgggggtttccgtgttcactctcactgccctcagcgccgac aggtacagagccatcgttaaccccatggacatgcagacgtcaggggcattgctgcggacc tgtgtgaaggccatgggtatctgggtggtctccgtgttgctggcagttcccgaagcggtg ttttcagaagtggctcgcatcagtagcttggataatagcagcttcacagcatgtatccca taccctcaaacagatgaattacatccaaagattcattcagtgctcattttcttggtctat ttcctcataccacttgctattattagcatttattattatcatattgcaaagaccttaatt aaaagcgcacacaatcttcctggagaatacaatgaacataccaaaaaacagatggaaaca cggaaacgcctggctaaaattgtgcttgtctttgtgggctgtttcatcttctgttggttt ccaaaccacatcctttacatgtatcggtctttcaactataatgagattgatccatctcta ggccacatgattgtcaccttagttgcccgggttctcagttttggcaattcttgtgtcaac ccatttgctctttacctactcagtgaaagcttcaggaggcatttcaacagccaactctgc tgtgggaggaagtcctatcaagagagaggaaccagctacctactcagctcttcagcggtc atgccgataccagaggtgtgcttacacagccttgggcagctctgcgtctgtggctttgcg agtatagccctctactggctgctttcatgggctggtgttaagtatctgtggcttttccag gtgcatgttgcaagctatttgtag >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_3|403_aa MCLKSLRLAMQAMKTAEAGSCTLQSHRVGAAQGCGSPPHASAEHHPDTKAWQRHKKKENF RPISLRNINAKILNEILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINIIQYINRTN DKNHMIISIDAEKALDKIQQHFMLKSLNKLGIDGTYLQIIRAIYDKPTANIILNGQKPEA FPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLEN PIVSAQNLLKRIGNSSKFSGYKINVQKSQAFLYTNNRQTENQIMSEFPFTIASKRIKYLG IQLTRDVKDLFKENYKPLLNEIKEDTKKWKNIPCSWVGRINIVEMDILPKVIYRFNAIPI KLPMTFFKELEKNSFKVHMEPKKSPHSQVNPKPKEQSCRHHLT >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_3|1212_bp atgtgcctgaaaagcctcagactcgcaatgcaagctatgaaaacagctgaagcagggagc tgtaccctgcaaagccacagggttggagctgcccaaggctgtggaagcccacctcatgca tcagctgagcatcatcctgataccaaagcctggcagagacacaagaaaaaagagaatttt agaccaatatccctgaggaacatcaatgcaaaaatcctcaatgaaatactagcaaaccga atccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatg caaggctggttcaacatatgcaaatcaataaacataatccagtatataaacagaaccaat gacaaaaaccacatgattatctcaatagatgcagaaaaggccttggacaaaattcaacaa cacttcatgctaaaatctctcaataaattaggtattgatgggacgtatctccaaataatc agagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaaccggaagca ttccctttaaaaactggcacaagacagggatgccctctctcaccactcctattcaacata gtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaac cccattgtctcagcccaaaatctccttaagcggataggcaactccagcaaattctcagga tacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagag aaccaaatcatgagtgaattcccattcacaattgcttcaaagagaataaaatacctagga atccagcttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaggatacaaagaaatggaagaacattccatgctcatgggtaggaagaatc aatatcgtggaaatggacatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgactttcttcaaagaattggaaaaaaactcctttaaagttcatatggaa ccaaaaaagagcccgcatagccaagtcaatcctaagccaaaagaacaaagctgcaggcat caccttacctga >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_4|542_aa MEKLLRLDLNYTSVEVVEMEKRAMHEQASTDIFLGPIALVMTSSTMLNISGKCGHPYFVL DLRGKACSFFSIDYDVSSGLFIYGLYYVEVKPPTVIGQFHTLFFGSIRIFFLGVLGFAVY GNEALHFICDPDKREVNLFCYNQFRPITPQLCPPLTCQGSSPVLEGDVRSFRNWSEMNGE EEYGSGSAGGARVGSGEFGVEMAALAPLPPLPAQFKSIQHHLRTAQEHDKRDPVVAYYCR LYAMQTGMKIDSKTPECRKFLSKLMDQLEALKKQLGDNEAITQEIVGCAHLENYALKMFL YADNEDRAGRFHKNMIKSFYTASLLIDVITVFGELTDENVKHRKYARWKATYIHNCLKNG ETPQAGPVGIEEDNDIEENEDAGAASLPTQPTQPSSSSTYDPSNMPSGNYTGIQIPPGAH APANTPAEVPHSTGVASNTIQPTPQTIPAIDPALFNTISQENEKQIYSTEIDTSNNPEPK YEDEIVPGATEKWKNSSSIMTACRENITEDQQRLLFPAPETSLCNLAKGDTKSEWLFSST ML >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_4|1629_bp atggagaaacttttaaggctagacctgaactacactagtgtagaagtggtggaaatggag aagagagcaatgcatgaacaggctagtacagatatctttttgggtccaattgctctggtt atgacttctagtactatgttgaatataagtggcaaatgtgggcatccttactttgtactg gatcttagaggaaaagcttgcagttttttttccattgattatgatgttagcagtgggctt ttcatatatggcctttattatgttgaggttaaacctccaactgtgattggtcaattccac acccttttctttggatcgatccgaatattcttcctcggggtgctaggctttgcagtttat gggaatgaggccttgcacttcatttgcgatccagacaaaagagaagtaaacctcttctgt tacaatcagttcaggccaatcactccacaactctgccctccccttacttgccaaggttct agcccagtcctagagggtgatgtcagatccttcagaaactggtctgagatgaatggagaa gaagaatacggaagcggaagtgccggtggagcgcgagtaggaagtggtgagttcggagta gagatggccgcgcttgcaccgctgcccccgctccccgcacagttcaagagcatacagcat catctgaggacggctcaggagcatgacaagcgagaccctgtggtggcttattactgtcgt ttatacgcaatgcagactggaatgaagatcgatagtaaaactcctgaatgtcgcaaattt ttatcaaagttaatggatcagttagaagctctaaagaagcagttgggtgataatgaagct attactcaagaaatagtgggctgtgcccatttggagaattatgctttgaaaatgtttttg tatgcagacaatgaagatcgtgctggacgatttcacaaaaacatgatcaagtccttctat actgcaagtcttttgatagatgtcataacagtatttggagaactcactgatgaaaatgtg aaacacaggaagtatgccagatggaaggcaacatacatccataattgtttaaagaatggg gagactcctcaagcaggccctgttggaattgaagaagataatgatattgaagaaaatgaa gatgctggagcagcctctctgcccactcagccaactcagccatcatcatcttcaacttat gacccaagcaacatgccatcaggcaactatactggaatacagattcctccgggtgcacac gctccagctaatacaccagcagaagtgcctcacagcacaggtgtagcaagtaatactatc caacctactccacagactatacctgccattgatcccgcacttttcaatacaatttcccag gaaaacgaaaaacaaatatacagcactgagattgacaccagcaacaacccagagcccaag tatgaggatgaaatagttcctggggccacagagaagtggaaaaactccagtagcatcatg actgcctgcagggaaaatatcactgaagaccagcagagactattattcccagccccagaa acttcactgtgcaacttggccaaaggagacaccaaatcagaatggctgttcagcagtacc atgctgtag >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_5|553_aa MIVYLENPIVSAPNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSGCPFTITA KRIKYLGIQLTRDVKDLFKENYKPVLNKIKEDTNKWKSISCSWIGRINIMKRAILPKVTY TFNAIPIKLPLTSFTELEKTTLKFIWNQKRARIAKTILSKKNKAGGITLPDFKLYYKCTV TKTAWCQYQNRYIDQWNRTEASEITPHIYNHLIFDKPDKNKQWHLSLFNTEVPIVWSPEL SSKFAGDADAGRPETPLRTGLRRISFLRLAIFLIWVELNHSNFRDHLKCLLKLRRLGPIP RVSDWVGQVGDLEGKNGSMSQIQGQLLCVALGHGTPAAPAPAIAKRDQGVAWAMATPRCG VGPMGAQKARGEVWESLPRFQRMYGNAQMSRQKCAALVEPSWNTSTRAVQRRYGGLEPPC SMPTGALPSGAVRRGPPSSRPQNGRPTDSLHHAPGKVTGTQHQPVKAAKGAVPCRATGVE LPKALGAHPLHWHDLEVRHGVKGDYFGALQLNDFPGGLQTCMGPVAPLFWPISPIKNGSI YPVPAPSLYLRSN >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_5|1662_bp atgattgtatatttggaaaaccccatcgtctcagccccaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaacgtgcaaaaatcacaagcattcctatac accaataatagacaaacagagagccaaatcatgagtgggtgcccattcacaattactgca aagagaataaaatacctaggaatacaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccagtgctcaacaaaataaaagaggacacaaacaaatggaagagcatttca tgctcatggataggaagaatcaatatcatgaaaagggccatactgcccaaggtaacttat acattcaatgctatccccatcaagctaccactgacttccttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcacgcatagccaagacaattctaagcaaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaagtgtacagta accaaaacagcatggtgccagtaccaaaacagatatatagaccaatggaacagaacagag gcctcagaaataacaccacacatctacaaccatctgatctttgacaaacctgacaaaaac aaacaatggcatttatcactatttaatacagaagttcccatagtgtggtccccagaactt tctagcaagttcgcaggtgatgctgatgccggtcgtccagagacacctttgagaaccggt ttgaggagaatctcttttctcagacttgcaatatttctgatttgggtagagctaaaccac tcgaactttagggaccaccttaagtgtctcttaaaactcagaaggctgggacccatccct agagtttctgattgggtaggacaggttggagatctagaaggaaaaaatggttccatgagc cagatacagggccagctgctctgtgtagccttgggacatggcaccccagctgctccagct ccagccatagctaaaagggaccaaggtgtggcttgggccatggccacaccaaggtgtggt gttgggcctatgggtgcgcagaaggcaagaggtgaggtttgggaatctctgcctagattt caaagaatgtatggaaatgcccagatgtctaggcagaagtgtgctgcactggtggagccc tcatggaacacctctacaagagcagtgcagagaagatatggggggttggagcccccatgc agcatgcccactggggcactgcctagtggagctgtgagaagagggccaccatcctctaga cctcagaatggtagacctactgacagcttacatcatgcacctggaaaagtcacaggcact caacatcagcctgtgaaagcagccaagggggctgtaccctgcagagccacaggggtggag ctgcccaaggccttgggagcccaccccttgcattggcatgacctggaagtgagacatgga gtcaaaggagattattttggagctttacaattgaatgactttcctggtgggcttcagact tgcatggggcctgtagcccctttgttttggccaatttctcccattaagaatgggagcatt tatccagtgcctgcaccctcactgtatctcagaagtaactaa >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_6|91_aa MVEGERQVLHSSRQERMRAKQKSYKTIRSCDKHAEPHLALPLLSEVTGEAKWMIHHLDNF LSNWSQRMPFYPSCSDTRGATGRVCGQGKGP >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_6|276_bp atggtggaaggtgaaagacaagtcttgcatagcagcagacaagagagaatgagagccaag caaaaatcttataaaaccatcagatcttgtgacaagcatgctgagccacatctagcacta cctctcctcagtgaagtgactggggaagccaaatggatgatccaccacctggacaacttc ttgagcaactggtctcagagaatgccattttacccatcgtgttccgacactaggggagcc acgggaagagtttgtggccaaggaaaaggcccatag >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_7|285_aa MIQLPPIRSLPRYVGIMGATIQDEIWVGIQPNPIRGCIFSSSGVIHDFDFSQELQSRKQE CFLVPINCVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAPNLLKLI SNFSKVSGYKIDVQKSQAFLYTNNRQTESQIMSELPFTITTNRIKYLGIQLTRDVKELFK EHYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVTYTFNAIPIKLPQTFFTELEK TTLKFIWNQKRACIAKTILSKKNKVGGIKLPDFKLYYKATVTKTP >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_7|858_bp atgattcaattacctcccatccggtctctcccacgatatgtgggaattatgggagctaca attcaagatgagatttgggtggggatccagccaaaccctatcagaggttgtatcttcagt agctctggggtaattcacgactttgatttctctcaggaacttcagagtagaaagcaagaa tgtttcctcgttcctatcaactgtgtgttggaagtcctggctagggcaatcaggcaagag aaagaaataaagggtattcaattaggaaaagaggaagtcaaattgtctctgtttgcagat gacatgattgtatatttagaaaaccccattgtctcagccccaaatctccttaagctgata agcaatttcagcaaagtctcaggatacaaaatcgatgtgcaaaaatcacaagcattccta tacaccaataatagacaaacagagagccaaatcatgagtgaactcccattcacaattact acaaacagaataaaatacctaggaatacaacttacaagggatgtgaaagaactcttcaag gagcactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggtaact tatacattcaatgctatccccatcaagctaccacagactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcctgcatagccaagacaattctaagc aaaaagaacaaagttggaggcatcaagctacctgacttcaaactatactacaaggctaca gtaaccaaaacaccatga >gi568815592f:142047288_142318640|GENSCAN_predicted_peptide_8|228_aa GEPRRPSPAPRRDPPLRRFWRVHGLSRRRKVNPGRGSVQRRACGRGSRLAAGPSSQGSWQ GDSPRTRALRPRFRRDWSWAQVRRPRRRAAYDPTPAALVWALRGWGCPEGAAAPHRIFNP RAEREKSPRGAGQALLERAAKGEWPGSAMGAEPGSAPQLGGDRADPEVAETPGALNPRDP RREKGEAGKAGAPVASPQRLQPTRVPARVPPPRSRAVALLHFSATFFT >gi568815592f:142047288_142318640|GENSCAN_predicted_CDS_8|687_bp ggggaacccaggcgccccagccctgcgccgcggcgggaccctcctttacgccgtttctgg agagttcatgggctttcccgacggcgcaaggtgaaccctgggcgcggcagcgttcagcgc agggcctgcggacgaggaagcaggctggcggcaggtccctcctcgcagggaagttggcag ggcgactctccccggacccgtgcgctccgcccgcgcttccggcgtgactggagctgggct caggtgcggcggccccggcggcgtgcggcgtatgacccgactccggctgccctggtctgg gcgctccgcggctggggctgtcccgaaggcgcagctgccccacacaggatcttcaacccc agggccgaaagggagaagtcgccccgcggggcgggacaggcgcttttggagcgagcagcg aagggagagtggccggggtcggcgatgggcgcagagccgggctcagctccccagctggga ggggacagggcggacccggaggtcgccgaaacgccgggagcgctgaaccccagagaccct cggcgagagaaaggagaagctgggaaggcaggcgctcctgtggcatctccccagcgtttg cagcctacccgagtccccgcgcgggtccccccgccacgctcccgggctgttgctttgctt cacttttctgccactttctttacttaa