GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:31:46 Sequence gi568815587r:124383344_124497351 : 114008 bp : 38.58% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1617 1612 6 1.05 1.01 Sngl - 4850 3927 924 2 0 58 50 342 0.731 23.53 1.00 Prom - 5165 5126 40 -8.55 2.00 Prom + 7076 7115 40 -6.75 2.01 Sngl + 8177 9937 1761 1 0 70 48 546 0.979 43.20 2.02 PlyA + 10074 10079 6 -0.45 3.00 Prom + 10440 10479 40 -3.65 3.01 Init + 13328 13508 181 1 1 33 113 200 0.306 16.39 3.02 Intr + 15362 15429 68 0 2 80 65 55 0.082 0.11 3.03 Term + 31355 31552 198 1 0 129 48 79 0.718 4.42 3.04 PlyA + 32622 32627 6 1.05 4.04 PlyA - 32949 32944 6 1.05 4.03 Term - 41546 40599 948 2 0 117 39 233 0.131 12.38 4.02 Intr - 54242 54107 136 1 1 20 70 130 0.112 4.05 4.01 Init - 57742 57042 701 1 2 75 37 385 0.042 26.60 4.00 Prom - 61233 61194 40 -4.75 5.00 Prom + 62197 62236 40 -7.25 5.01 Sngl + 64712 64954 243 1 0 71 42 178 0.944 6.33 5.02 PlyA + 65653 65658 6 1.05 6.00 Prom + 73950 73989 40 -4.85 6.01 Init + 76261 76369 109 0 1 58 127 20 0.327 3.33 6.02 Intr + 76567 76660 94 2 1 22 62 96 0.401 -1.50 6.03 Term + 77553 77784 232 0 1 84 39 164 0.548 6.06 6.04 PlyA + 78544 78549 6 1.05 7.00 Prom + 79810 79849 40 -5.05 7.01 Init + 82816 82869 54 0 0 77 119 20 0.200 5.34 7.02 Intr + 87050 87206 157 1 1 83 44 85 0.381 2.26 7.03 Intr + 97871 98010 140 0 2 -4 55 170 0.161 3.76 7.04 Term + 99258 99428 171 2 0 49 54 156 0.721 5.14 7.05 PlyA + 100691 100696 6 1.05 8.02 PlyA - 102213 102208 6 1.05 8.01 Sngl - 105201 104554 648 0 0 86 42 277 0.993 18.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_1|307_aa MIVYLENPIVSAQNLLKLINNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPTTFFTELEKTTLKFTWNQKRARITKSTLSQKNKAGGITLPDFKLYYKATV TKTAWYGYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAI CRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMA RKAKIDK >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_1|924_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataaac aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaccttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtgatttat agattcaatgccatccccatcaagctaccaacgactttcttcacagaattggaaaaaact actttaaagttcacatggaaccaaaaaagagcccgcatcaccaagtcaaccctaagccaa aagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagtc accaaaacagcatggtacgggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataacgccacatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg attaaagacttaaatgtgagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca agaaaagccaaaattgacaaatga >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_2|586_aa MDKFLDAYTLPRLNQEEVESLNRPITGSEIEVISSLPTKKSPGPDGFTAEFYQRYKEELV PFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDGKILNKILAN QIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQ QPFMLKSLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLEVLARAIRQEKEIKGMQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLINNFSKVS GYKINVQKSQTFLYTNNRQTESQIMSELPFTIASKRIKYLGIHLARDVKDLFKENYKPLL NEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFTW NQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYGYQNRDIDQWNRTEPSEITP HIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLIPYTKINSRWIKDLNI RPKTPKTLEENLGITIQDIGMGKDFMSKTPKAMARKAKIDKWDLIK >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_2|1761_bp atggataaattcctcgacgcatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgaggtaatcagtagcttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggta ccattccttctgaaattattccaatcaatagaaaaagagggaatcctccctaactcattt tatgaggctagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagagaat tttagaccaatatccttgatgaacattgatggaaaaatcctcaataaaatactggcaaac caaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacccttcatgctaaaaagtctcaataaattaggtattgatgggacgtatctcaaaata ataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatgcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccattgtctcagcccaaaatctccttaagctgataaacaacttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaaacattcttatacaccaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaccttgcaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtgatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcacatgg aaccaaaaaagagcccgcatcaccaagtcaatcctaagccagaagaacaaagctggaggc atcacgctacctgacttcaaactatactacaaggctacagtcaccaaaacagcatggtac gggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgcca catatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttataccttatacaaaaattaattcaagatggattaaagacttaaacatt agacctaaaaccccaaaaaccctagaagaaaacctaggcattaccattcaggacataggc atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaagaaaagccaaaattgac aaatgggatctaattaaatga >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_3|148_aa MKNASDNENIRNEDGTTGYHDINTHNNENNHLVDIGAGAGKLEEGEYVTQVMVDDISTAE ENSVLTIDMCFTSTALEEEMAVKSSLHPLTLGEGGGCRRKSVDAWKTRALPVRQLVPEHN QMFLETRKFLKGHQRSWIPALLSGSTGR >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_3|447_bp atgaagaatgctagtgacaatgaaaacataagaaatgaggatggtacaactgggtaccat gatattaatacccacaacaatgagaacaaccacctcgttgacataggtgctggtgcagga aagctggaggagggggagtatgtcacacaagtaatggttgatgatattagcactgcagaa gagaactcagtgctgaccattgatatgtgtttcacctccactgccctggaggaagaaatg gctgtgaagagcagtctacatcctctgactttgggggaaggaggaggctgcaggagaaag tcagtggatgcctggaagactagggcactgccggtgaggcaactggttcctgagcacaat cagatgtttttggagactcggaaattcttaaagggacatcagaggagttggattcctgca ttacttagtgggagcacaggtagatga >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_4|594_aa MAAENSSFVTQFILAGLTDQPGVQIPLFFLFLGFYVVTVVGNLGLITLIRLNSHLHTPMY FFLYNLSFIDFCYSSVITPKMLMSFVLKKNSISYAGCMTQLFFFLFFVVSESFILSAMAY DRYVAICNPLLYMVTMSPQVCFLLLLGVYGMGFAGAMAHTACMMGVTFCANNLVNHYMCD ILPLLECACTSTYVNELVVFVVVGIDIGVPTVTIFISYALILSSIFHIDSTEGRTKQEGS VYEAESGLSPDTESAAALILDFSASGTMRNKFLLFKSHLTARQERMTLRNSSSVTEFILV GLSEQPELQLPLFLLFLGIYVFTVVGNLGLITLIGINPSLHTPMYFFLFNLSFIDLCYSC VFTPKMLNDFVSESIISYVGCMTQLFFFCFFVNSECYVLVSMAYDRYVAICNPLLYMVTM SPRVCFLLMFGSYVVGFAGAMAHTGSMLRLTFCDSNVIDHYLCDVLPLLQLSCTSTHVSE LVFFIVVGVITMLSSISIVISYALILSNILCIPSAEGRSKAFSTWGSHIIAVALFFGSGT FTYLTTSFPGSMNHGRFASVFYTNVVPMLNPSIYSLRNKDDKLALGKTLKRVLF >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_4|1785_bp atggctgctgagaattcctccttcgtgacacagtttatcctcgcaggcttaactgaccaa ccgggagtccagatccccctcttcttcctgtttctaggcttctacgtggtcactgtggtg gggaacctgggcttgataaccctgataaggctcaactctcacttgcacacccctatgtac ttcttcctctataacttgtccttcatagatttctgctattccagtgttatcactcccaaa atgctgatgagctttgtcttaaagaagaacagcatctcctacgcagggtgtatgactcag ctcttcttctttcttttctttgttgtctctgagtccttcatcctgtcagcaatggcgtat gaccgctatgtggccatctgtaacccactgttgtacatggtcaccatgtctccccaggtg tgttttctccttttgttgggtgtctatgggatggggtttgctggggccatggcccacaca gcgtgcatgatgggtgtgaccttctgtgccaataaccttgtcaaccactacatgtgtgac atccttccccttcttgagtgtgcttgcaccagcacctatgtgaatgagcttgtagtgttt gttgttgtgggcattgatattggtgtgcccacagtcaccatcttcatttcctatgctctc attctctccagcatcttccacattgattccacggagggcagaacaaaacaagaaggctct gtctatgaggcagaaagcgggctctcaccagacacggaatctgctgctgccttaatcttg gacttctcagcctctggaaccatgagaaataaattcctgctgtttaaaagccacctgaca gctcgccaagagagaatgactctgagaaacagctcctcagtgactgagtttatccttgtg ggattatcagaacagccagagctccagctccctcttttccttctattcttagggatctat gtgttcactgtggtgggcaacttgggcttgatcaccttaattgggataaatcctagcctt cacacccccatgtactttttcctcttcaacttgtcctttatagatctctgttattcctgt gtgtttacccccaaaatgctgaatgactttgtttcagaaagtatcatctcttatgtggga tgtatgactcagctatttttcttctgtttctttgtcaattctgagtgctatgtgttggta tcaatggcctatgatcgctatgtggccatctgcaaccccctgctctacatggtcaccatg tccccaagggtctgctttctgctgatgtttggttcctatgtggtagggtttgctggggcc atggcccacactggaagcatgctgcgactgaccttctgtgattccaacgtcattgaccat tatctgtgtgacgttctccccctcttgcagctctcctgcaccagcacccatgtcagtgag ctggtatttttcattgttgttggagtaatcaccatgctatccagcataagcatcgtcatc tcttacgctttgatactctccaacatcctctgtattccttctgcagagggcagatccaaa gcctttagcacatggggctcccacataattgctgttgctctgttttttgggtcagggaca ttcacctacttaacaacatcttttcctggctctatgaaccatggcagatttgcctcagtc ttttacaccaatgtggttcccatgcttaacccttcgatctacagtttgaggaataaggat gataaacttgccctgggcaaaaccctgaagagagtgctcttctaa >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_5|80_aa MQQGSGDASVMMENPRRQTLHPCLPSPDQTGPETGGTSHYRENTYSQASKAAVYLCAGPE KEPWEPTIGQPSRLVPTSQA >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_5|243_bp atgcagcaagggagtggagatgcatctgtgatgatggaaaatccaagaaggcagactctg catccatgtctccccagtccagatcagactggcccagaaacaggagggacttctcattac agggaaaatacatactcccaggccagcaaagcagctgtgtacctatgtgctggacctgag aaagagccctgggaaccaaccataggccagccaagcagacttgtgcccacatcacaggcc taa >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_6|144_aa MWVGPNKGIKPGRLSPSGNPMGLPSMLWKLYSFALHARVCGFIREVNQTKNPPEGTNSGH SSNGHLRLCYMLMVKACDHYVAICCPLLCNVIMSHVTCSLMVAVVYTMGLVVSTIETGLI LKLPYCELLTSRCFCDILPLMKLS >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_6|435_bp atgtgggtggggccaaataagggaataaaacctggccgcctgagccccagtggcaacccg atggggttaccttccatgctgtggaagctttattcttttgctcttcacgcgagggtctgt ggcttcattcgtgaagtcaaccagaccaagaacccaccagaaggaaccaattctggacac agttccaatgggcatctaaggttgtgttacatgctgatggtgaaggcctgtgaccactat gttgccatctgctgccctttgctttgcaacgtcatcatgtctcatgtcacctgctccctg atggtggctgtggtctacaccatgggactcgttgtctccacaatagagactgggctcata ttaaaactgccctattgtgaactcctcaccagtcgctgcttctgtgacatcctccctctc atgaaactctcctga >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_7|173_aa MVHVGYFLEEHQFQTYRLPGKVGEYTHTGQTRAWKMDLLVHQLSSLVQNLQNRINPYGLQ CCQWDVGKSQGTEGFRPALCGTNVENAGKNKTIREKNGDNRHPNANGHNNKNHQQIYWLW HPVAQTGTHPLQLGAMVLLSTARCPSLSRKPSGPAEKNVTTWSPGPSASASPP >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_7|522_bp atggttcatgtgggctattttttggaggaacaccaatttcagacttataggctgcctggt aaagtaggcgagtatacccacactggccagacaagagcttggaaaatggatcttttggtt caccagctcagctctctggtgcagaatctacagaacagaataaacccatatggtttacag tgttgtcagtgggatgtaggaaaatcacagggtactgaaggctttagacctgccctctgt ggaactaatgtggagaatgctggaaagaataaaaccataagagagaaaaatggtgacaat cggcaccccaatgccaacggtcacaataataaaaaccaccagcaaatttattggctgtgg cacccagtggctcagacaggcactcacccccttcagcttggagccatggtgctgctctct actgcccgttgcccatctctttcacggaaacccagtggccctgctgaaaagaatgtgaca acttggagcccaggcccctctgcttctgcttctccaccttga >gi568815587r:124383344_124497351|GENSCAN_predicted_peptide_8|215_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNRKRARIAKSILSQKNKARGITLPD FKLYYKTTVTKTAWYWYQNRDIDQWNRTQPSEITLHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICKKLKLDPFLTPYTKINSRWIKDLNIRPKTLKTLEENLGITIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTANEHFLMD >gi568815587r:124383344_124497351|GENSCAN_predicted_CDS_8|648_bp atggccatactgcccaaggtgatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccgaaaaagagcc cgtattgccaagtcaatcctaagccaaaagaacaaagctagaggcatcacactacctgac ttcaaactatactacaagactacagtaaccaaaacagcatggtactggtatcaaaacaga gatatagatcaatggaacagaacacagccctcagaaataacgctgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtaaaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaaacccta aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaatgaacacttcttaatggattaa