GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:29:59 Sequence gi568815586r:117934199_118152480 : 218282 bp : 46.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1277 1331 55 1 1 55 90 53 0.531 1.65 1.02 Term + 3925 4025 101 2 2 90 45 65 0.595 0.69 1.03 PlyA + 4108 4113 6 1.05 2.04 PlyA - 6105 6100 6 1.05 2.03 Term - 10361 10347 15 2 0 112 50 4 0.051 -2.76 2.02 Intr - 23090 22980 111 2 0 104 62 32 0.175 2.78 2.01 Init - 34057 33878 180 1 0 70 92 209 0.597 18.68 2.00 Prom - 34192 34153 40 -16.85 3.04 PlyA - 34251 34246 6 1.05 3.03 Term - 34783 34614 170 2 2 80 42 220 0.147 14.64 3.02 Intr - 44470 44234 237 2 0 50 57 105 0.020 1.19 3.01 Init - 53230 53140 91 1 1 85 84 103 0.980 10.25 3.00 Prom - 58925 58886 40 -2.56 4.00 Prom + 74050 74089 40 -3.76 4.01 Init + 82630 82694 65 0 2 110 62 53 0.982 5.54 4.02 Intr + 82812 82956 145 0 1 125 21 52 0.893 2.48 4.03 Intr + 83797 83855 59 0 2 101 103 25 0.941 2.98 4.04 Intr + 84874 84938 65 1 2 65 100 37 0.976 0.96 4.05 Intr + 85434 85570 137 1 2 71 84 87 0.932 6.89 4.06 Intr + 86708 86787 80 1 2 53 60 86 0.928 0.65 4.07 Intr + 88088 88161 74 2 2 59 99 98 0.920 7.05 4.08 Intr + 89445 89564 120 1 0 36 67 73 0.552 0.47 4.09 Intr + 90653 90812 160 0 1 63 81 59 0.946 1.75 4.10 Intr + 91549 91630 82 1 1 99 80 33 0.990 3.24 4.11 Term + 92691 92858 168 2 0 101 49 214 0.867 16.68 4.12 PlyA + 93588 93593 6 1.05 5.16 PlyA - 94388 94383 6 1.05 5.15 Term - 100160 99998 163 1 1 90 47 87 0.990 2.21 5.14 Intr - 101126 101016 111 1 0 102 97 67 0.995 8.49 5.13 Intr - 102312 102140 173 0 2 111 72 90 0.674 8.44 5.12 Intr - 104190 104090 101 1 2 107 109 36 0.968 7.43 5.11 Intr - 108774 108643 132 1 0 95 68 94 0.989 8.72 5.10 Intr - 109179 108935 245 2 2 45 96 131 0.559 6.74 5.09 Intr - 118280 118112 169 0 1 65 81 116 0.433 7.60 5.08 Intr - 134399 134179 221 1 2 32 121 301 0.177 25.75 5.07 Intr - 136869 136854 16 1 1 83 109 0 0.177 -3.80 5.06 Intr - 137271 137161 111 1 0 50 116 26 0.715 1.95 5.05 Intr - 139794 139501 294 1 0 59 113 263 0.704 22.88 5.04 Intr - 145408 145148 261 2 0 118 70 118 0.613 10.46 5.03 Intr - 148231 147929 303 2 0 116 36 409 0.644 35.06 5.02 Intr - 161616 161335 282 1 0 63 100 222 0.960 18.29 5.01 Init - 169473 169395 79 0 1 103 82 163 0.991 16.45 5.00 Prom - 170927 170888 40 -6.76 6.00 Prom + 180920 180959 40 -2.16 6.01 Init + 189881 189929 49 1 1 96 89 40 0.495 4.01 6.02 Term + 194278 194894 617 2 2 66 48 126 0.417 1.03 6.03 PlyA + 195033 195038 6 -0.45 7.00 Prom + 195301 195340 40 -2.36 7.01 Init + 202012 202146 135 0 0 95 86 278 0.997 28.44 7.02 Intr + 203841 203950 110 2 2 72 94 29 0.951 0.98 7.03 Intr + 205253 205353 101 2 2 91 92 137 0.994 14.15 7.04 Term + 210388 210605 218 2 2 118 50 324 0.998 28.91 7.05 PlyA + 211363 211368 6 1.05 8.03 PlyA - 215621 215616 6 1.05 8.02 Term - 216960 216799 162 0 0 69 50 148 0.994 7.04 8.01 Intr - 218211 218029 183 0 0 106 116 226 0.999 27.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 34778 34614 165 2 0 87 42 280 0.847 16.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_1|51_aa MGRAWWFMPVIPALLGGRANAVILQVFALASRHQIHDVKACILCPSTQHGA >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_1|156_bp atgggccgggcatggtggttcatgcctgtaatcccagcacttttgggaggccgagctaat gctgtcatccttcaggtcttcgctttagcatcacgtcaccagatccacgacgtgaaggca tgtattctgtgtcccagtacccagcacggtgcctga >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_2|101_aa MDEENMTKSEEQQPLSLQKALQQCELVQNMIDLSISNLEGLRTKCATSNDLTQKEIRTLE AGLLRFRVTNLTQTDRVRQPQTQEQTEVYCSPVDSPKDFTN >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_2|306_bp atggatgaggaaaacatgacgaaaagcgaggagcagcagcctctgagtttgcaaaaagcc ttacagcagtgcgaactggtccaaaacatgatagacttgagcatctccaacctggaaggg cttaggaccaaatgtgctacctccaacgacctcacacaaaaagaaatccggaccctggag gcaggtttattgaggtttagagtcaccaatttgacccaaactgaccgagtaaggcagccg cagacacaggaacagacagaggtttactgctctcctgtggactccccaaaggattttaca aactga >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_3|165_aa MAKKEGEYGKLFEICPNDKKVTMEAFKNDNRQCKLIVQFMLELMGDCIPDRVAQHSAVIL VTSRKKPPKNDANNKIVLRDADRKETYLSRVRVIFKLLDYVSYFLLPKLGKMSVWSEAGA IAATVTPGARRRQQQQQQQRGEETAASRSSRRSRGSNADSTLRTR >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_3|498_bp atggccaagaaggagggcgaatatgggaagttgtttgaaatatgtccaaatgacaagaaa gtgactatggaagcattcaaaaatgataaccgtcagtgcaagttaatcgtgcaatttatg ctagaactaatgggagactgtatccctgatagggttgctcagcattctgcggtcatcctt gttacatctaggaaaaagccacctaaaaatgatgccaacaacaaaatagtgctgagagat gcagacagaaaagaaacgtacctatccagagtaagagtcatctttaaacttcttgattat gtgagttactttcttttgcctaaactgggcaagatgtcggtgtggagcgaggcaggagcg attgcagcaaccgtcacccccggagcccggcggcggcagcagcagcagcagcagcagcgc ggggaggagacagcagccagccgcagtagccgccgcagccgcgggagcaatgcagacagc accttgagaacgcgctaa >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_4|384_aa METSALKQQEQPAATKIRNLPWPAAGTDLQRFPPTPPSYPDSSVLFRAFASSHPFTPSSI CRRRRRLSHLPLATTNLLSIYGFAYSGHFMVEKYRPQTLNDLISHQDILSTIQKFINEDR LPHLLLYGPPGTGKTSTILACAKQLYKDKEFGSMVLELNASDDRGIDIIRGPILSFASTR TIFKKGFKLVILDEADAMTQDAQNALRREWNRCHKQLFVRVIAPLSALRPPDSLHGKVCK KPPSLPIEVIEKFTENTRFCLICNYLSKIIPALQSRCTRFRFGPLTPELMVPRLEHVVEE EKVDISEDGMKALVTLSSGDMRRALNILQSTNMAFGKVTEETVYTCTGHPLKSDIANILD WMLNQDFTTAYRSILSHDLLATET >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_4|1155_bp atggagacctcagcactcaagcagcaggagcagcccgcggcgaccaagatcaggaacctg ccctggcctgcagccgggaccgacctgcagaggtttcccccgacaccccccagctaccca gactcgtccgttctgtttagggcttttgccagttctcaccctttcacccccagctccatt tgccgaagaagaaggcggctttcgcacctgcccctggcaaccactaatctgctttctatc tatggatttgcctattctggacatttcatggttgaaaaataccggccacagaccctgaat gatctcatttctcatcaggacattctgagtaccattcagaagtttatcaatgaagaccga ctgccacacttgcttctctacggtcccccagggacaggcaagacatctaccatcctagcc tgtgcgaaacagctatataaagacaaagaatttggctccatggtcttggagctgaatgct tcagatgaccgaggaatagacatcattcgaggaccgatcctgagctttgctagcacaagg acaatatttaagaaaggctttaagctagtgatcttggatgaagcagacgccatgactcag gacgcccagaatgccttgagaagagaatggaaccggtgtcataagcaactctttgttcgg gttattgctcccttgagtgctctaaggcctccagacagtctccatggaaaagtctgcaaa aagccgccctccctgcccatagaagtaattgagaaattcacagaaaataccagattctgc ctcatctgtaactatctgtcaaagatcatccctgccttgcagtcccgctgcacgaggttt cggttcggtcccctgactcctgaactcatggttccccgcctggaacatgtcgtggaagaa gagaaagttgatataagtgaagatggaatgaaagcactagtcactctttccagtggagac atgcgtagggctctgaacattttgcagagcaccaatatggcctttgggaaggtgacagag gagactgtctacacctgcaccgggcacccgctcaagtcagacattgccaacatcctggac tggatgttgaatcaagatttcaccacagcctacagaagtatcctttctcatgacctcctg gccaccgagacctga >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_5|886_aa MAAGGSAPEPRVLVCLGALLAGWVAVGLEAVVIGEVHENVTLHCGNISGLRGQVTWYRNN SEPVFLLSSNSSLRPAEPRFSLVDATSLHIESLSLGDEGIYTCQEILNVTQWFQVWLQVA SGPYQIEVHIVATGTLPNGTLYAARGSQVDFSCNSSSRPPPVVEWWFQALNSSSESFGHN LTVNFFSLLLISPNLQGNYTCLALNQLSKRHRKVTTELLVYYPPPSAPQCWAQMASGSFM LQLTCRWDGGYPDPDFLWIEEPGGVIVGKSKLGVEMLSESQLSDGKKFKCVTSHIVGPES GASCMVQIRGPSLLSEPMKTCFTGGNVTLTCQVSGAYPPAKILWLRNLTQPEVIIQPSSR HLITQDGQNSTLTIHNCSQDLDEGYYICRADSPVGVREMEIWLSVKEPLNIGGIVGTIVS LLLLGLAIISGLLLHYSPVFCWKVGNTSRGQNMDDVMVLVDSEEEEEEEEEEEEDAAVGE QEGAREREELPKEIPKQDHIHRVTALVNGNIEQMGNGFQDLQEEPLLLAELKPGRPHQFD WKSSCETWSVAFSPDGSWFAWSQGHCIVKLIPWPLEEQFIPKGFEAKSRSSKNETKGRGS PKEKTLDCGQIVWGLAFSPWPSPPSRKLWARHHPQVPDVSCLVLATGLNDGQIKIWEVQT GLLLLNLSGHQDVVRDLSFTPSGSLILVSASRDKTLRIWDLNKHGKQIQVLSGHLQWVYC CSISPDCSMLCSAAGEKSVFLWSMRSYTLIRKLEGHQSSVVSCDFSPDSALLVTASYDTN VIMWDPYTGERLRSLHHTQVDPAMDDSDVHISSLRSVCFSPEGLYLATVADDRTRDGHVQ FWTAPRVLSSLKHLCRKALRSFLTTYQVLALPIPKKMKEFLTYRTF >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_5|2661_bp atggccgcaggcggcagtgcgcccgagccccgcgtcctcgtctgcctcggggcgctcctg gccggctgggtcgccgtaggattggaggctgttgtcattggagaagttcatgagaatgtt actctgcactgtggcaacatctcgggactgaggggccaggtgacctggtaccggaacaac tcggagcctgtcttccttctctcgtccaactctagcctccggccagctgagcctcgcttc tctctagtggatgccacctccctgcacattgaatcgctgagcctgggagatgagggaatc tacacctgccaggagatcctgaatgtgactcagtggttccaagtgtggctgcaggtggcc agcggcccctatcagattgaggtccacatcgtggccaccggcacactccccaacggcacc ctctacgcagccaggggctcccaggtggacttcagctgcaacagcagctccaggccacca cccgtggttgaatggtggttccaggccctgaattccagcagcgagtcctttggccacaac ctgacagtcaactttttctcactgttactgatatcgccaaacctccaagggaactacacc tgtttagccttgaatcagctcagcaagagacatcgaaaggtgaccaccgagctcctggtc tactatccccctccatcagctccccagtgctgggcacagatggcatcaggatcgttcatg ttgcagcttacctgtcgctgggatgggggataccctgaccctgacttcctgtggatagaa gagccaggaggtgtaatcgtggggaagtcaaagctgggggtggaaatgctgagcgagtcc cagctgtcggatggcaagaagttcaagtgtgttacaagccacatagttgggccagagtcg ggcgccagctgcatggtgcagatcaggggtccctcccttctctctgagcccatgaagact tgcttcactgggggcaatgtgacgcttacatgccaggtgtctggggcctacccccctgcc aagatcctgtggctgaggaaccttacccagcccgaggtgatcatccagcctagcagccgc catctcattacccaggatggccagaactccaccctcactatccacaactgctcccaggac ctggatgagggctactacatctgccgagctgacagccctgtaggggtgagggagatggaa atctggctgagtgtgaaagaacctttaaatatcggggggattgtgggaaccattgtgagc ctccttctgctgggactggccattatctcagggcttctgttgcattatagccctgtgttc tgctggaaagtaggaaacacttccaggggacaaaacatggatgatgtcatggttttggtg gattcagaagaggaagaggaggaggaggaggaggaggaggaagatgctgcagtaggggaa caggagggagcacgtgagagagaggagttgccaaaagaaatacctaagcaggaccacatt cacagagtgaccgccttggtgaatgggaacatagaacagatgggaaatggattccaggat cttcaagaggaaccgctgctgctggccgaactcaagcccgggcgcccccaccagtttgat tggaagtccagctgtgaaacctggagcgtcgccttctccccagatggctcctggtttgct tggtctcaaggacactgcatcgtcaaactgatcccctggccgttggaggagcagttcatc cctaaagggtttgaagccaaaagccgaagtagcaaaaatgagacgaaagggcggggcagc ccaaaagagaagacgctggactgtggtcagattgtctgggggctggccttcagcccgtgg ccttccccacccagcaggaagctctgggcacgccaccacccccaagtgcccgatgtctct tgcctggttcttgctacgggactcaacgatgggcagatcaagatctgggaggtgcagaca gggctcctgcttttgaatctttccggccaccaagatgtcgtgagagatctgagcttcaca cccagtggcagtttgattttggtctccgcgtcacgggataagactcttcgcatctgggac ctgaataaacacggtaaacagattcaagtgttatcgggccacctgcagtgggtttactgc tgttccatctccccagactgcagcatgctgtgctctgcagctggagagaagtcggtcttt ctatggagcatgaggtcctacacgttaattcggaagctagagggccatcaaagcagtgtt gtctcttgtgacttctcccccgactctgccctgcttgtcacggcttcttacgataccaat gtgattatgtgggacccctacaccggcgaaaggctgaggtcactccaccacacccaggtt gaccccgccatggatgacagtgacgtccacattagctcactgagatctgtgtgcttctct ccagaaggcttgtaccttgccacggtggcagatgacaggacaagagatggccacgtccag ttctggacagctcctagggtcctgtcctcactgaagcacttatgccggaaagcccttcga agtttcctaacaacttaccaagtcctagcactgccaatccccaagaaaatgaaagagttc ctcacatacaggactttttaa >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_6|221_aa MGFLHVGQAGLELLTSELEKTTLKFIWNQKRACIAKTILSKKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRDIDQWNRTEPSEITPHIYNHLIFDKPIKNKKWGKDFLFNKWCWENWLT ICRKLKLDPFLTPHTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAM ATKAKIEKWDLIKELLHSKRNYHQSEQATYRMGENFCHLPI >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_6|666_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcctgcattgccaagacaatcttaagc aaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataacaccacacatctacaaccatctgatctttgacaaacctatcaaa aacaagaaatggggaaaggatttcctatttaataaatggtgctgggaaaactggctaacc atatgtagaaagctgaaactggatcccttccttacacctcatacaaaaattaattcaaga tggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacttaggc aataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaacaaaagccaaaatagaaaaatgggatctaattaaagagcttctgcacagcaaaaga aactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgccatctaccc atctga >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_7|187_aa MPVDLSKWSGPLSLQEVDEQPQHPLHVTYAGAAVDELGKVLTPTQVKNRPTSISWDGLDS GKLYTLVLTDPDAPSRKDPKYREWHHFLVVNMKGNDISSGTVLSDYVGSGPPKGTGLHRY VWLVYEQDRPLKCDEPILSNRSGDHRGKFKVASFRKKYELRAPVAGTCYQAEWDDYVPKL YEQLSGK >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_7|564_bp atgccggtggacctcagcaagtggtccgggcccttgagcctgcaagaagtggacgagcag ccgcagcacccgctgcatgtcacctacgccggggcggcggtggacgagctgggcaaagtg ctgacgcccacccaggttaagaatagacccaccagcatttcgtgggatggtcttgattca gggaagctctacaccttggtcctgacagacccggatgctcccagcaggaaggatcccaaa tacagagaatggcatcatttcctggtggtcaacatgaagggcaatgacatcagcagtggc acagtcctctccgattatgtgggctcggggcctcccaagggcacaggcctccaccgctat gtctggctggtttacgagcaggacaggccgctaaagtgtgacgagcccatcctcagcaac cgatctggagaccaccgtggcaaattcaaggtggcgtccttccgtaaaaagtatgagctc agggccccggtggctggcacgtgttaccaggccgagtgggatgactatgtgcccaaactg tacgagcagctgtctgggaagtag >gi568815586r:117934199_118152480|GENSCAN_predicted_peptide_8|114_aa LRLDEAQEAECQALRLQLQQEMELLNAYQSKIKMQTEAQHERELQKLEQRVSLRRAHLEQ KIEEELAALQKERSERIKNLLERQEREIETFDMESLRMGFGNLVTLDFPKEDYR >gi568815586r:117934199_118152480|GENSCAN_predicted_CDS_8|345_bp ttacggctagatgaggctcaagaagcagaatgccaggccttgaggctacagctccagcag gaaatggagctgctcaacgcctaccagagcaaaatcaagatgcaaacagaggcacaacat gaacgtgagctccagaagctagagcagagagtgtctctgcgcagagcacaccttgagcag aagattgaagaggagctggctgcccttcagaaggaacgcagcgagagaataaagaaccta ttggaaaggcaagagcgagagattgaaacttttgacatggagagcctcagaatgggattt gggaatttggttacattagattttcctaaggaggactacagatga