GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:32:16 Sequence gi568815586f:117916828_118131275 : 214448 bp : 45.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6078 6154 77 2 2 51 94 93 0.676 6.76 1.02 Intr + 9804 9962 159 0 0 68 68 50 0.038 0.10 1.03 Intr + 40341 40416 76 0 1 140 69 -13 0.015 1.52 1.04 Intr + 48245 48291 47 1 2 94 100 13 0.003 0.31 1.05 Term + 49893 50163 271 0 1 115 39 105 0.008 3.26 1.06 PlyA + 50977 50982 6 1.05 2.04 PlyA - 51622 51617 6 1.05 2.03 Term - 52154 51985 170 0 2 80 42 220 0.147 14.64 2.02 Intr - 61841 61605 237 0 0 50 57 105 0.020 1.19 2.01 Init - 70601 70511 91 2 1 85 84 103 0.980 10.25 2.00 Prom - 76296 76257 40 -2.56 3.00 Prom + 91421 91460 40 -3.76 3.01 Init + 100001 100065 65 1 2 110 62 53 0.982 5.54 3.02 Intr + 100183 100327 145 1 1 125 21 52 0.893 2.48 3.03 Intr + 101168 101226 59 1 2 101 103 25 0.941 2.98 3.04 Intr + 102245 102309 65 2 2 65 100 37 0.976 0.96 3.05 Intr + 102805 102941 137 2 2 71 84 87 0.932 6.89 3.06 Intr + 104079 104158 80 2 2 53 60 86 0.928 0.65 3.07 Intr + 105459 105532 74 0 2 59 99 98 0.920 7.05 3.08 Intr + 106816 106935 120 2 0 36 67 73 0.552 0.47 3.09 Intr + 108024 108183 160 1 1 63 81 59 0.946 1.75 3.10 Intr + 108920 109001 82 2 1 99 80 33 0.990 3.24 3.11 Term + 110062 110229 168 0 0 101 49 214 0.867 16.68 3.12 PlyA + 110959 110964 6 1.05 4.16 PlyA - 111759 111754 6 1.05 4.15 Term - 117531 117369 163 2 1 90 47 87 0.990 2.21 4.14 Intr - 118497 118387 111 2 0 102 97 67 0.995 8.49 4.13 Intr - 119683 119511 173 1 2 111 72 90 0.674 8.44 4.12 Intr - 121561 121461 101 2 2 107 109 36 0.968 7.43 4.11 Intr - 126145 126014 132 2 0 95 68 94 0.989 8.72 4.10 Intr - 126550 126306 245 0 2 45 96 131 0.559 6.74 4.09 Intr - 135651 135483 169 1 1 65 81 116 0.433 7.60 4.08 Intr - 151770 151550 221 2 2 32 121 301 0.177 25.75 4.07 Intr - 154240 154225 16 2 1 83 109 0 0.177 -3.80 4.06 Intr - 154642 154532 111 2 0 50 116 26 0.715 1.95 4.05 Intr - 157165 156872 294 2 0 59 113 263 0.704 22.88 4.04 Intr - 162779 162519 261 0 0 118 70 118 0.613 10.46 4.03 Intr - 165602 165300 303 0 0 116 36 409 0.644 35.06 4.02 Intr - 178987 178706 282 2 0 63 100 222 0.960 18.29 4.01 Init - 186844 186766 79 1 1 103 82 163 0.991 16.45 4.00 Prom - 188298 188259 40 -6.76 5.00 Prom + 198291 198330 40 -2.16 5.01 Init + 207252 207300 49 2 1 96 89 40 0.563 4.01 5.02 Term + 211649 212265 617 0 2 66 48 126 0.490 1.03 5.03 PlyA + 213079 213084 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 52149 51985 165 0 0 87 42 280 0.847 16.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:117916828_118131275|GENSCAN_predicted_peptide_1|209_aa MVFSRSSGDIRTYSEKHELPDVHDARRKITQQPPAVTAPDGLVLGRDGPMTFTTHLCPCN VEAQLKLSKGGLARPPFSRRFHLWGVHRRAVNLCLFLCLRLPYSLGLSSGQEDIIKSSAR DDLLHVDIQPNLKITEKKEIPKANPRKAPIQATTGAGGPQDSLRTDEFRPWELMCQDQDE RLSTAHAHSTTEGTPVLSAPLPPPSADAS >gi568815586f:117916828_118131275|GENSCAN_predicted_CDS_1|630_bp atggtcttttcacggtcatctggggacatacgaacatacagtgaaaaacatgagctgccc gatgtacatgatgccaggagaaaaatcacccagcagccaccagcggtgacagccccagat ggccttgttcttggcagagacggcccaatgaccttcaccacccacctctgtccctgcaat gtggaggcacaattaaaactctccaagggtggccttgctagaccgcctttcagcaggaga tttcacctttggggagtccacaggagagcagtaaacctctgtctgttcctgtgtctgcgg ctgccttactcgctaggactcagttctggccaagaagatataatcaaatcatctgcaagg gatgacctcctgcatgttgacatccagccaaatttaaaaatcacggaaaaaaaagaaata cccaaagcgaatccgagaaaagcacccatccaggctacaacaggggcaggagggccacaa gattcactacggactgacgagttcaggccttgggaactcatgtgccaggaccaagatgaa cgtctcagtactgctcatgcccactccaccacagagggaacccctgttctctcagcccca ctgcccccaccctcggcggatgccagctaa >gi568815586f:117916828_118131275|GENSCAN_predicted_peptide_2|165_aa MAKKEGEYGKLFEICPNDKKVTMEAFKNDNRQCKLIVQFMLELMGDCIPDRVAQHSAVIL VTSRKKPPKNDANNKIVLRDADRKETYLSRVRVIFKLLDYVSYFLLPKLGKMSVWSEAGA IAATVTPGARRRQQQQQQQRGEETAASRSSRRSRGSNADSTLRTR >gi568815586f:117916828_118131275|GENSCAN_predicted_CDS_2|498_bp atggccaagaaggagggcgaatatgggaagttgtttgaaatatgtccaaatgacaagaaa gtgactatggaagcattcaaaaatgataaccgtcagtgcaagttaatcgtgcaatttatg ctagaactaatgggagactgtatccctgatagggttgctcagcattctgcggtcatcctt gttacatctaggaaaaagccacctaaaaatgatgccaacaacaaaatagtgctgagagat gcagacagaaaagaaacgtacctatccagagtaagagtcatctttaaacttcttgattat gtgagttactttcttttgcctaaactgggcaagatgtcggtgtggagcgaggcaggagcg attgcagcaaccgtcacccccggagcccggcggcggcagcagcagcagcagcagcagcgc ggggaggagacagcagccagccgcagtagccgccgcagccgcgggagcaatgcagacagc accttgagaacgcgctaa >gi568815586f:117916828_118131275|GENSCAN_predicted_peptide_3|384_aa METSALKQQEQPAATKIRNLPWPAAGTDLQRFPPTPPSYPDSSVLFRAFASSHPFTPSSI CRRRRRLSHLPLATTNLLSIYGFAYSGHFMVEKYRPQTLNDLISHQDILSTIQKFINEDR LPHLLLYGPPGTGKTSTILACAKQLYKDKEFGSMVLELNASDDRGIDIIRGPILSFASTR TIFKKGFKLVILDEADAMTQDAQNALRREWNRCHKQLFVRVIAPLSALRPPDSLHGKVCK KPPSLPIEVIEKFTENTRFCLICNYLSKIIPALQSRCTRFRFGPLTPELMVPRLEHVVEE EKVDISEDGMKALVTLSSGDMRRALNILQSTNMAFGKVTEETVYTCTGHPLKSDIANILD WMLNQDFTTAYRSILSHDLLATET >gi568815586f:117916828_118131275|GENSCAN_predicted_CDS_3|1155_bp atggagacctcagcactcaagcagcaggagcagcccgcggcgaccaagatcaggaacctg ccctggcctgcagccgggaccgacctgcagaggtttcccccgacaccccccagctaccca gactcgtccgttctgtttagggcttttgccagttctcaccctttcacccccagctccatt tgccgaagaagaaggcggctttcgcacctgcccctggcaaccactaatctgctttctatc tatggatttgcctattctggacatttcatggttgaaaaataccggccacagaccctgaat gatctcatttctcatcaggacattctgagtaccattcagaagtttatcaatgaagaccga ctgccacacttgcttctctacggtcccccagggacaggcaagacatctaccatcctagcc tgtgcgaaacagctatataaagacaaagaatttggctccatggtcttggagctgaatgct tcagatgaccgaggaatagacatcattcgaggaccgatcctgagctttgctagcacaagg acaatatttaagaaaggctttaagctagtgatcttggatgaagcagacgccatgactcag gacgcccagaatgccttgagaagagaatggaaccggtgtcataagcaactctttgttcgg gttattgctcccttgagtgctctaaggcctccagacagtctccatggaaaagtctgcaaa aagccgccctccctgcccatagaagtaattgagaaattcacagaaaataccagattctgc ctcatctgtaactatctgtcaaagatcatccctgccttgcagtcccgctgcacgaggttt cggttcggtcccctgactcctgaactcatggttccccgcctggaacatgtcgtggaagaa gagaaagttgatataagtgaagatggaatgaaagcactagtcactctttccagtggagac atgcgtagggctctgaacattttgcagagcaccaatatggcctttgggaaggtgacagag gagactgtctacacctgcaccgggcacccgctcaagtcagacattgccaacatcctggac tggatgttgaatcaagatttcaccacagcctacagaagtatcctttctcatgacctcctg gccaccgagacctga >gi568815586f:117916828_118131275|GENSCAN_predicted_peptide_4|886_aa MAAGGSAPEPRVLVCLGALLAGWVAVGLEAVVIGEVHENVTLHCGNISGLRGQVTWYRNN SEPVFLLSSNSSLRPAEPRFSLVDATSLHIESLSLGDEGIYTCQEILNVTQWFQVWLQVA SGPYQIEVHIVATGTLPNGTLYAARGSQVDFSCNSSSRPPPVVEWWFQALNSSSESFGHN LTVNFFSLLLISPNLQGNYTCLALNQLSKRHRKVTTELLVYYPPPSAPQCWAQMASGSFM LQLTCRWDGGYPDPDFLWIEEPGGVIVGKSKLGVEMLSESQLSDGKKFKCVTSHIVGPES GASCMVQIRGPSLLSEPMKTCFTGGNVTLTCQVSGAYPPAKILWLRNLTQPEVIIQPSSR HLITQDGQNSTLTIHNCSQDLDEGYYICRADSPVGVREMEIWLSVKEPLNIGGIVGTIVS LLLLGLAIISGLLLHYSPVFCWKVGNTSRGQNMDDVMVLVDSEEEEEEEEEEEEDAAVGE QEGAREREELPKEIPKQDHIHRVTALVNGNIEQMGNGFQDLQEEPLLLAELKPGRPHQFD WKSSCETWSVAFSPDGSWFAWSQGHCIVKLIPWPLEEQFIPKGFEAKSRSSKNETKGRGS PKEKTLDCGQIVWGLAFSPWPSPPSRKLWARHHPQVPDVSCLVLATGLNDGQIKIWEVQT GLLLLNLSGHQDVVRDLSFTPSGSLILVSASRDKTLRIWDLNKHGKQIQVLSGHLQWVYC CSISPDCSMLCSAAGEKSVFLWSMRSYTLIRKLEGHQSSVVSCDFSPDSALLVTASYDTN VIMWDPYTGERLRSLHHTQVDPAMDDSDVHISSLRSVCFSPEGLYLATVADDRTRDGHVQ FWTAPRVLSSLKHLCRKALRSFLTTYQVLALPIPKKMKEFLTYRTF >gi568815586f:117916828_118131275|GENSCAN_predicted_CDS_4|2661_bp atggccgcaggcggcagtgcgcccgagccccgcgtcctcgtctgcctcggggcgctcctg gccggctgggtcgccgtaggattggaggctgttgtcattggagaagttcatgagaatgtt actctgcactgtggcaacatctcgggactgaggggccaggtgacctggtaccggaacaac tcggagcctgtcttccttctctcgtccaactctagcctccggccagctgagcctcgcttc tctctagtggatgccacctccctgcacattgaatcgctgagcctgggagatgagggaatc tacacctgccaggagatcctgaatgtgactcagtggttccaagtgtggctgcaggtggcc agcggcccctatcagattgaggtccacatcgtggccaccggcacactccccaacggcacc ctctacgcagccaggggctcccaggtggacttcagctgcaacagcagctccaggccacca cccgtggttgaatggtggttccaggccctgaattccagcagcgagtcctttggccacaac ctgacagtcaactttttctcactgttactgatatcgccaaacctccaagggaactacacc tgtttagccttgaatcagctcagcaagagacatcgaaaggtgaccaccgagctcctggtc tactatccccctccatcagctccccagtgctgggcacagatggcatcaggatcgttcatg ttgcagcttacctgtcgctgggatgggggataccctgaccctgacttcctgtggatagaa gagccaggaggtgtaatcgtggggaagtcaaagctgggggtggaaatgctgagcgagtcc cagctgtcggatggcaagaagttcaagtgtgttacaagccacatagttgggccagagtcg ggcgccagctgcatggtgcagatcaggggtccctcccttctctctgagcccatgaagact tgcttcactgggggcaatgtgacgcttacatgccaggtgtctggggcctacccccctgcc aagatcctgtggctgaggaaccttacccagcccgaggtgatcatccagcctagcagccgc catctcattacccaggatggccagaactccaccctcactatccacaactgctcccaggac ctggatgagggctactacatctgccgagctgacagccctgtaggggtgagggagatggaa atctggctgagtgtgaaagaacctttaaatatcggggggattgtgggaaccattgtgagc ctccttctgctgggactggccattatctcagggcttctgttgcattatagccctgtgttc tgctggaaagtaggaaacacttccaggggacaaaacatggatgatgtcatggttttggtg gattcagaagaggaagaggaggaggaggaggaggaggaggaagatgctgcagtaggggaa caggagggagcacgtgagagagaggagttgccaaaagaaatacctaagcaggaccacatt cacagagtgaccgccttggtgaatgggaacatagaacagatgggaaatggattccaggat cttcaagaggaaccgctgctgctggccgaactcaagcccgggcgcccccaccagtttgat tggaagtccagctgtgaaacctggagcgtcgccttctccccagatggctcctggtttgct tggtctcaaggacactgcatcgtcaaactgatcccctggccgttggaggagcagttcatc cctaaagggtttgaagccaaaagccgaagtagcaaaaatgagacgaaagggcggggcagc ccaaaagagaagacgctggactgtggtcagattgtctgggggctggccttcagcccgtgg ccttccccacccagcaggaagctctgggcacgccaccacccccaagtgcccgatgtctct tgcctggttcttgctacgggactcaacgatgggcagatcaagatctgggaggtgcagaca gggctcctgcttttgaatctttccggccaccaagatgtcgtgagagatctgagcttcaca cccagtggcagtttgattttggtctccgcgtcacgggataagactcttcgcatctgggac ctgaataaacacggtaaacagattcaagtgttatcgggccacctgcagtgggtttactgc tgttccatctccccagactgcagcatgctgtgctctgcagctggagagaagtcggtcttt ctatggagcatgaggtcctacacgttaattcggaagctagagggccatcaaagcagtgtt gtctcttgtgacttctcccccgactctgccctgcttgtcacggcttcttacgataccaat gtgattatgtgggacccctacaccggcgaaaggctgaggtcactccaccacacccaggtt gaccccgccatggatgacagtgacgtccacattagctcactgagatctgtgtgcttctct ccagaaggcttgtaccttgccacggtggcagatgacaggacaagagatggccacgtccag ttctggacagctcctagggtcctgtcctcactgaagcacttatgccggaaagcccttcga agtttcctaacaacttaccaagtcctagcactgccaatccccaagaaaatgaaagagttc ctcacatacaggactttttaa >gi568815586f:117916828_118131275|GENSCAN_predicted_peptide_5|221_aa MGFLHVGQAGLELLTSELEKTTLKFIWNQKRACIAKTILSKKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRDIDQWNRTEPSEITPHIYNHLIFDKPIKNKKWGKDFLFNKWCWENWLT ICRKLKLDPFLTPHTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAM ATKAKIEKWDLIKELLHSKRNYHQSEQATYRMGENFCHLPI >gi568815586f:117916828_118131275|GENSCAN_predicted_CDS_5|666_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcctgcattgccaagacaatcttaagc aaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataacaccacacatctacaaccatctgatctttgacaaacctatcaaa aacaagaaatggggaaaggatttcctatttaataaatggtgctgggaaaactggctaacc atatgtagaaagctgaaactggatcccttccttacacctcatacaaaaattaattcaaga tggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacttaggc aataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaacaaaagccaaaatagaaaaatgggatctaattaaagagcttctgcacagcaaaaga aactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgccatctaccc atctga