GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:34:09 Sequence gi568815594r:4167892_4383742 : 215851 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 207 202 6 -3.24 1.10 Term - 2851 2436 416 2 2 104 37 342 0.821 26.12 1.09 Intr - 3348 3247 102 1 0 -10 94 114 0.821 2.35 1.08 Intr - 7380 7093 288 1 0 87 -80 253 0.531 5.62 1.07 Intr - 13475 13455 21 0 0 83 89 40 0.285 1.12 1.06 Intr - 21082 20923 160 1 1 118 50 175 0.587 16.26 1.05 Intr - 30212 29275 938 0 2 89 14 1186 0.501 102.21 1.04 Intr - 34687 34557 131 0 2 96 109 171 0.998 20.34 1.03 Intr - 38239 38181 59 1 2 102 93 -28 0.164 -3.22 1.02 Intr - 45113 44977 137 0 2 73 97 22 0.123 1.89 1.01 Init - 58973 58571 403 2 1 63 101 698 0.994 63.39 1.00 Prom - 63106 63067 40 -2.06 2.05 PlyA - 63862 63857 6 1.05 2.04 Term - 68596 68578 19 0 1 117 38 11 0.314 -3.21 2.03 Intr - 72588 72430 159 2 0 114 111 40 0.897 8.00 2.02 Intr - 78452 78311 142 0 1 94 110 72 0.997 9.41 2.01 Init - 80311 80215 97 1 1 94 76 194 0.806 17.33 2.00 Prom - 87535 87496 40 -3.86 3.10 PlyA - 88964 88959 6 1.05 3.09 Term - 100132 99998 135 1 0 49 47 141 0.830 4.12 3.08 Intr - 100724 100639 86 0 2 58 86 52 0.697 1.54 3.07 Intr - 105778 105692 87 2 0 8 71 100 0.531 0.24 3.06 Intr - 106878 106476 403 0 1 16 115 410 0.821 30.50 3.05 Intr - 111639 111556 84 0 0 98 94 50 0.892 6.62 3.04 Intr - 111858 111751 108 0 0 102 80 32 0.940 4.28 3.03 Intr - 114006 113892 115 2 1 23 97 134 0.682 8.25 3.02 Intr - 118681 118628 54 0 0 77 78 45 0.067 0.49 3.01 Init - 122562 122288 275 0 2 65 62 135 0.018 3.06 3.00 Prom - 126341 126302 40 -3.16 4.00 Prom + 128557 128596 40 -8.76 4.01 Init + 132055 132206 152 0 2 70 80 69 0.508 3.83 4.02 Intr + 134098 135200 1103 1 2 70 93 451 0.199 33.34 4.03 Intr + 138247 138293 47 2 2 105 75 11 0.177 -0.37 4.04 Intr + 145150 145223 74 0 2 119 78 94 0.397 9.70 4.05 Intr + 145825 145948 124 1 1 77 20 111 0.374 3.79 4.06 Intr + 147446 147518 73 1 1 85 75 35 0.953 0.88 4.07 Intr + 147918 148079 162 1 0 106 101 90 0.996 12.05 4.08 Term + 152749 153425 677 2 2 123 41 715 0.993 64.18 4.09 PlyA + 153870 153875 6 1.05 5.00 Prom + 154480 154519 40 -8.26 5.01 Init + 154645 154654 10 0 1 84 94 16 0.145 2.25 5.02 Intr + 161955 162022 68 1 2 123 88 0 0.248 2.12 5.03 Intr + 166895 167048 154 1 1 72 86 137 0.319 11.55 5.04 Intr + 171700 171808 109 2 1 -14 103 109 0.143 1.54 5.05 Intr + 180501 180580 80 0 2 69 101 76 0.221 6.19 5.06 Intr + 180793 180943 151 2 1 45 37 75 0.034 -2.68 5.07 Intr + 196058 196188 131 2 2 93 81 31 0.405 3.24 5.08 Intr + 197575 197705 131 2 2 57 82 67 0.186 3.41 5.09 Intr + 204753 204892 140 2 2 74 -3 84 0.306 -2.84 5.10 Term + 209408 209618 211 2 1 45 37 219 0.362 9.27 5.11 PlyA + 213747 213752 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 142495 142576 82 0 1 73 92 75 0.907 7.53 S.002 Intr + 144485 144621 137 0 2 47 93 29 0.925 -0.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:4167892_4383742|GENSCAN_predicted_peptide_1|884_aa MLEGLGSPASPRAAASASVAGSSGPAACSPPSSSAPRSPESPAPRRGGVRASVPQKLAEM LSSQYGLIVFVAGLLLLLAWAVHAAGVSKSDLLCFLTALMLLQLLWMLWYVGRSSAHRRL FRLKDTHAGAGWLRGSITLFAVITVILGCLKIGYFIGFSECLSATEGVFPVTHSVHTLLQ VYFLWGHAKDIIQSFKTLERFGVIHSVFTNLLLWANGVLNESKHQLNEHKERLITLGFGN ITTVLDDHTPQCNCTPPTLCTAISHGIYYLYPFNIEYQILASTMLYVLWKNIGRKVDSHQ HQKMQFKSDGVMVGAVLGLTVLAATIAVVVVYLIHIGRSKTKSESALIMFYLYAITLLML MGAAGLAGIRIYRIDEKSLDESKNPARKLDSDLLVGTASGSWLISWGSILAILCAEGHPR YTWYNLPYSILAIVEKYIQNLFIFESIHREPEKLSEDIQTLRVVTVCNGNTMPLASSCPK SGGVARDVAPQGKDMPPAANGNVCMRESHDKEEEKQEESSWGGSPSPVRLPRFLQGNAKR KVLRNIAAFLFLCNISLWIPPAFGCRPEYDNGLEEIVFGFEPWIIVVNLAMPFSIFYRMH AAASLFEVYYKMDEIAGKRCPNSTDPQNVTDVSRFLLLKLSEDPELQPVLAGLFLSMCLV TVLGNLLIILAVSPDSHLHTSMYFFLSNLSLPDIGFPSPTVPKMVVDIQSHSNYILINIT GWVYTYCDIERNIMLSPSLDIRSNNTARLRPKTGTQMPLREQRYTGIDKDGHLVERRAFG YQPITCVNLLNWKNNTQNYTEKPQALIDFLQAVIQTHNPTLADWHQLLMFLFNSEERRRV LQAVTKWLEEHAPADYQNPQEYGRTQVSGTDPIWTHMKERRFKG >gi568815594r:4167892_4383742|GENSCAN_predicted_CDS_1|2655_bp atgctcgagggcctggggtcgcccgcctcgccccgggcagctgcaagcgcctcggtcgca gggtcgtcggggccagcggcctgctcgcctccctcgtcctcggccccgaggtccccggaa tccccggccccccggcggggcggtgtgcgcgccagcgtcccacagaaactggccgagatg ctgagcagccagtatgggctgatcgtgttcgtggcggggctgctgctgctgctggcctgg gccgtgcacgccgcgggcgtgagcaagagcgacctgctgtgcttcctgacggcgctcatg ctgctgcagctgctgtggatgctgtggtacgtgggccgcagctccgcgcaccgccgcctc ttccgcctcaaggacacgcacgcgggtgccggctggctgcgcggtagtatcacattgttt gcagtcattaccgtcatcctgggatgccttaaaattggatacttcattggattttcagaa tgtttatcagccactgaaggagttttcccagtcacccattcagtgcatactttgttgcag gtatattttctttgggggcatgcaaaggatattatccagtctttcaaaacactggaaagg tttggagtgatccactcggtgttcaccaacctgcttctgtgggccaatggcgtcctcaat gagtcaaagcaccaactcaatgagcacaaggaacggctcatcactctgggttttgggaac ataacaacagttttagatgaccacacaccgcagtgtaactgcacgcccccaactctgtgc actgccatctcccacgggatctactacctctaccccttcaacatagagtatcagatcctg gcctccacaatgctctacgtcctgtggaagaacatcgggcgcaaagttgacagccatcag caccagaagatgcagttcaagtctgatggggtcatggtgggcgcagtcctgggcctgacc gtgctggccgccaccattgctgtggtggtggtatacctgattcatattgggcgctccaag accaagagcgagtcggcactcatcatgttctacctgtatgccatcaccctgctgatgctt atgggggctgcggggctggctggaatccggatttacaggatagacgagaagtcactggat gagtccaaaaatccggcccgcaaactggactcggacctcttggtgggcactgcctcgggc tcctggcttatctcctggggctcaatcttggccatcctctgtgctgagggccacccccgc tacacctggtacaacctgccctactccatcctggcgatcgtggagaagtacatccagaac ctcttcatctttgaatccattcaccgagagcctgaaaaactctctgaggacatccaaacc cttcgggtggtcacagtctgcaatggcaacaccatgccccttgcttcttcctgccccaag agtggaggtgtggccagagatgtggctccccagggcaaggacatgccaccagcagccaat ggaaatgtgtgcatgagagaaagccatgacaaggaggaggagaagcaggaggagagcagc tggggagggagcccaagcccagtccgccttccccgtttcttacagggcaacgccaagaga aaagtcctgaggaatattgcagccttcttgttcctctgcaatatttcgctttggatacct cccgcctttggctgtcgacctgagtatgacaatggattggaggagattgtctttggcttt gaaccctggataattgtggtcaacctggccatgcctttttctattttctatcgaatgcac gcagctgcctccctctttgaggtctattacaaaatggatgagattgcaggcaaaaggtgt ccaaactctacagacccacagaatgtaacggatgtctctcgattcctcctcctcaaactc tcagaggatccagaactgcagccggtccttgctgggctgttcctgtccatgtgcctggtc acggtgctggggaacctgctcatcatcctggccgtcagccctgactcccacctccacact tccatgtacttcttcctctccaacctgtccttgcctgacatcggtttcccctcccccacg gtccccaagatggttgtggacatccaatctcacagcaattatattctgatcaatatcacc ggctgggtgtacacctactgcgatattgaacgtaatatcatgctctctccctccctggac attagaagcaataacacagctcgtttacgacccaaaacggggacacaaatgcccctgaga gagcagcggtatactgggatagataaggatggtcacttggtggagaggcgtgcttttggc taccagcccatcacctgcgtcaaccttctcaactggaaaaacaatacacagaactatacc gaaaagccacaagccctaattgattttctccaagctgttatccagacccacaaccccacc ttggctgattggcaccagttgctcatgttcctctttaacagcgaagaaaggcggagagtc ctccaagcagtaactaagtggctagaagaacatgcaccagctgattatcaaaacccccaa gagtatggaaggacccaggtgtcaggaaccgaccccatttggacccacatgaaagagagg agattcaaaggctaa >gi568815594r:4167892_4383742|GENSCAN_predicted_peptide_2|138_aa MDSSRARQQLRRRFLLLPDAEAQLDREGDAGPETSTAVEKKEKPLPRLNIHSGFWILASI VVTYYVDFFKTLKENFHTSSWFLCGSALLLVSLSIAFYCIVYLEWYCGIGEYDVKYPALI PITTASFIAAGICPFFDS >gi568815594r:4167892_4383742|GENSCAN_predicted_CDS_2|417_bp atggactcctcgcgggcccggcagcagctccggcggcgattcctcctcctgccggacgcc gaggcccagctggaccgcgagggtgacgccgggccggaaacctccacagctgttgagaaa aaggagaaacctcttccaagacttaatatccattctggattctggattttggcatccatt gttgtgacctattatgttgacttctttaaaacccttaaagaaaacttccacactagcagc tggtttctctgtggcagtgccttgttgcttgtcagtttatcaattgcattttactgcata gtctacctggaatggtattgtggaattggagaatatgatgtcaagtatccagccttgata cccattaccactgcctcctttattgcagcaggaatttgccccttctttgactcttga >gi568815594r:4167892_4383742|GENSCAN_predicted_peptide_3|448_aa MPFWATFGRPASLPFRAIPTSASPARLRKAGHLTLGGRSLPRRLLFTCSSCLQGHSQRSA GRGIITTTPPWLAGTAFLLPLASLPLGGAWPRQCLLSSIGVTEFLHDGFLGDDYKNHVKC ISEDQKYGGKGYEGKTHKGDIKQQAWIQKISELIKRPNVSPKVRELLEQISAFDNVPRKK AKFQNWMKNSLKVHNESILDQVWNIFSEASNSEPVNKEQDQRPLHPVANPHAEISTKVPA SKVKDAVEQQGEVKKNKRERKEERQKKRKREKKELKLENHQENSRNQKPKKRKKGQEADL EAGGEEVPEANGSAGKRSKKKKQRKDSASEEEARVGAGKRKRRHSEVETDSKKKKMKLPE HPEGGEPEDDEAPAKGKFNWKGTIKAILKQAPDNEITIKKLRKKVLAQYYTVTDEHHRSE EELLVIFNKKISKNPTFKLLKDKVKLVK >gi568815594r:4167892_4383742|GENSCAN_predicted_CDS_3|1347_bp atgcccttctgggccaccttcgggcgcccggcctcccttcccttccgggccattcccacc agcgccagccccgcacgccttcgcaaggccggccaccttaccctcggaggccgctctctg ccccgccgcctactcttcacctgcagcagctgccttcaaggccactcgcagcgatccgct ggccgcggaatcatcacgacaacgcccccttggcttgccggaaccgccttcctgcttccg ttggcgtcacttccactggggggggcgtggccgcggcaatgtttgctgtcttccattgga gtgactgaatttctacatgacggctttttgggcgatgactataaaaaccacgtgaaatgc ataagtgaagatcagaagtatggtggcaaaggctatgaaggtaaaacccacaaaggcgac atcaaacagcaggcgtggattcagaaaattagtgaattaataaagagacccaatgtcagc cccaaagtgagagaacttttagagcaaattagtgcttttgacaacgttcccaggaaaaag gcaaaatttcagaattggatgaagaacagtttaaaagttcataatgaatccattctggac caggtgtggaatatcttttctgaagcttccaacagcgaaccagtcaataaggaacaggat caacggccactccacccagtggcaaatccacatgcagaaatctccaccaaggttccagcc tccaaagtgaaagacgccgtggaacagcaaggggaggtgaagaagaataaaagagaaaga aaggaagaacggcagaagaaaaggaaaagagaaaagaaagaactaaagttagaaaaccac caggaaaactcaaggaatcagaagcctaagaagcgcaaaaagggacaggaggctgacctt gaggctggtggggaggaagtccctgaggccaatggctctgcagggaagaggagcaagaag aagaagcagcgcaaggacagcgccagtgaggaagaggcacgcgtgggcgcagggaagagg aagcggaggcactcggaagttgaaacagattctaagaagaaaaagatgaagctcccagag catcctgagggcggagaaccagaagacgatgaggctcctgcaaaaggtaaattcaactgg aagggaactattaaagcaattctgaaacaggccccagacaatgaaataaccatcaaaaag ctaaggaaaaaggttttagctcagtactacacagtgacagatgagcatcacagatccgaa gaggaactcctggtcatctttaacaagaaaatcagcaagaaccctacctttaagttatta aaggacaaagtcaagcttgtgaaatga >gi568815594r:4167892_4383742|GENSCAN_predicted_peptide_4|803_aa MDPVATHSCHLLQQLHEQRIQGLLCDCMLVVKGVCFKAHKNVLAAFSQYFRSLFQNSSSQ KNDVFHLDVKNVSGIGQILDFMYTSHLDLNQDNIQVMLDTAQCLQVQNVLSLCHTFLKSA TVVQPPGMPCNSTLSLQSTLTPDATCVISENYPPHLLQECSADAQQNKTLDESHPHASPS VNRHHSAGEISKQAPDTSDGSCTELPFKQPNYYYKLRNFYSKQYHKHAAGPSQERVVEQP FAFSTSTDLTTVESQPCAVSHSECILESPEHLPSNFLAQPVNDSAPHPESDATCQQPVKQ MRLKKAIHLKKLNFLKSQKYAEQVSEPKSDDGLTKRLESASKNTLEKASSQSAEEKESEE VVSCENFNCISETERPEDPAALEDQSQTLQSQRQYACELCGKPFKHPSNLELHKRSHTGE KPFECNICGKHFSQAGNLQTHLRRHSGEKPYICEICGKRLAPSTLLPVLGLYVNQATYSI IWWLCQDNRHQPGLCQSRSSVVMLNRTVSDLVLCKTEEESVMKSGFSNFSNLKEHKKTHT ADKVFTCDECGKSFNMQRKLVKHRIRHTGERPYSCSACGKCFGGSGDLRRHVRTHTGEKP YTCEICNKCFTRSAVLRRHKKMHCKAGDESPDVLEELSQAIETSDLEKSQSSDSFSQDTS VTLMPVSVKLPVHPVENSVAEFDSHSGGSYCKLRSMIQPHGVSDQEKLSLDPGKLAKPQM QQTQPQAYAYSDVDTPAGGEPLQADGMAMIRSSLAALDNHGGDPLGSRASSTTYRNSEGQ FFSSMTLWGLAMKTLQNENELDQ >gi568815594r:4167892_4383742|GENSCAN_predicted_CDS_4|2412_bp atggaccctgttgctacccacagctgccatctgctccagcaactgcatgagcagcgaatc caaggcctgctttgtgactgtatgttggtggtaaaaggagtctgctttaaagcgcataag aatgtcctggcagcattcagccagtattttaggagcctctttcagaattcttcaagccag aagaatgatgtttttcacttggatgttaaaaatgtcagtggcatagggcagatcctggac ttcatgtacacttctcatctagatcttaaccaggacaatatacaagtaatgctggacaca gcacagtgtttgcaagttcaaaatgttctgagtctgtgtcacacatttttaaaatcagcc actgtagtacagccacctggcatgccttgtaatagtacattgtctctacaaagcaccctg accccagatgccacttgtgttatcagtgaaaactacccccctcatttactgcaggaatgt tcagcagatgcacagcagaacaaaacgttggatgaatcgcatccgcatgcttcaccatca gttaatcgtcatcactccgcaggtgaaatctcaaaacaagctcctgatacttcagatggc agctgcacagaactgcctttcaaacagccaaattactattacaaactcagaaacttttac agtaagcagtaccataaacacgcagctggtcccagtcaggagagagttgttgagcagcct tttgctttcagcacctctacagaccttaccacggtagagagccagccttgtgccgtcagt cattctgaatgcatcctggagtctcccgagcacttaccttccaacttcctggcccagcct gtgaatgactctgccccacaccctgagtcagacgccacatgccaacaacctgtcaagcag atgaggctcaaaaaggccattcatctgaagaagctcaatttcctgaagtcacagaaatac gcagagcaagtatctgaacccaagtcagatgatggtttgacaaagaggttggaatctgct agtaaaaataccctagagaaagctagcagccaaagtgctgaagaaaaagaaagtgaagaa gtcgtcagttgtgagaattttaattgcattagtgagacggagaggcctgaagacccggct gccctggaagaccagtcccagacacttcagtcccagagacaatacgcgtgtgaattatgc gggaaaccttttaaacacccaagcaacttggagcttcacaaacggtctcatacaggtgag aaaccttttgaatgtaacatttgtgggaaacatttctctcaggcaggtaacttgcagact cacttacgacggcattctggtgaaaaaccatacatctgcgagatctgtggaaagaggctg gcaccctcgaccctccttcctgtccttgggctgtacgtcaatcaggccacgtactccatc atctggtggttgtgccaggacaacaggcatcagccaggactatgccagtcacgaagctct gtagtgatgctgaaccgtacggtatctgatcttgtgctctgtaaaactgaggaagaaagt gtgatgaaatcagggtttagtaacttcagtaatttgaaggagcacaaaaagacacacacg gctgataaagtcttcacctgtgatgagtgtggaaagtcttttaatatgcaaaggaagtta gtaaagcacagaattcggcacacgggggagcggccttacagctgctctgcctgcgggaaa tgttttgggggatcaggtgacctccgcaggcatgtccgcactcacactggggagaagccg tacacatgtgagatctgtaacaagtgctttacccgctctgcggtgctccggcggcacaag aagatgcactgcaaagctggtgacgagagcccagatgtgctggaggagctcagccaagcc atcgagacctccgacctcgagaaatctcagagctcagactctttctcccaagacacgtct gtgacgctgatgccagtgtcggttaaactccctgtccacccagtggaaaattctgtggca gaatttgatagccactctggcggctcctattgtaagttacggtccatgatccaacctcat ggagttagtgaccaggagaagctgagtttggatcctggtaaacttgccaagccccagatg cagcagacacagcctcaggcctatgcttactcggatgtggacaccccagccggtggcgaa ccactgcaggccgatggcatggccatgatccgttcctctctggctgctttggacaaccac ggcggtgaccccctgggcagtcgagcatcttccaccacttataggaactcagagggtcag tttttctccagcatgactctctgggggctagcgatgaagacgctgcagaatgaaaacgag ttagaccagtga >gi568815594r:4167892_4383742|GENSCAN_predicted_peptide_5|394_aa MAARAKGTGFPSFSFTASPPIPAKHQRREIVEIKTQDKEVEEKTAGPEGHYHLDVETSSG PECLAAQLFIGYKTRGQGRSQSCCQCFGGDASAGRMFFRERLICVAAVLQNVECAGVSGI GYFWCVLRLADFKNEATDPRGVKPQTFTVSVTIKLVGTQRVSSSKIYCEEPKIKASTSCK GTPTVCSCWLRGVLFWVEALFKYKPTNPESTLQPPVFLGSTAPATSTCPHQLRAGTPRAS CLHTSPLMQNLNPGPVQDSHQKGHIDEEAHVLPGVTLKHQHIFTNQKYGADVNMFEAPYS LRSQESGTPPSNSLDSGVRVICVTARLETVHWKERGVHYEDRVCNRCGDGGHRGSDKGRG KPSQAYSLWTASVGHESQPIAPNPVRSMVFMKLT >gi568815594r:4167892_4383742|GENSCAN_predicted_CDS_5|1185_bp atggccgcacgtgccaagggaacaggattcccgtccttctcattcactgcatctcctcca atcccagcaaagcaccagagacgagagattgtagaaataaagacacaagacaaagaggta gaagaaaagacagctgggcccgagggccactaccacctagacgtggagaccagtagtggc cccgaatgcctggctgcacagttatttattggatacaagacaagggggcagggtcgaagc cagagttgttgtcagtgctttggtggagacgccagtgctgggcgaatgttcttccgagag cgccttatctgcgttgctgcagtcctgcagaatgtcgagtgtgctggtgtgtctggaatt ggttacttctggtgcgttcttcgtctcgctgacttcaagaatgaagccacggaccctcgc ggagtgaagccacagaccttcacagtgagtgttacaataaagctagtggggacccaaaga gtgagcagcagcaagatttattgtgaagagccaaagatcaaagcttccacatcctgcaag gggaccccaacggtttgcagctgctggctcagaggagtcctcttctgggtagaggcactt ttcaaatacaaaccaaccaacccagagtccacgctgcagccacctgtttttctaggctct actgctccagccacatctacctgtcctcatcagctcagggcaggcacccccagagcatcg tgtctgcacacctcacctctcatgcagaatctgaatcctgggccagttcaggactcacac cagaaagggcacattgatgaagaagcacatgtcttacctggggtgaccctgaagcaccaa cacatcttcaccaaccaaaaatatggggccgatgtcaacatgttcgaagcaccttattct ctaagatctcaggagagtggaactccgccctccaactctctagactctggagtgagagtt atctgtgtgacagccagattagaaacagtgcactggaaagagcgaggggtgcactatgag gatcgcgtctgcaaccgctgtggggacggcggccacagaggcagcgacaagggcaggggc aagcctagccaagcctacagcctgtggacagccagtgtggggcacgagtcccagcccata gcacccaatccagttcggtctatggtgttcatgaagctgacctga