GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:19:50 Sequence gi568815586f:75380881_75598975 : 218095 bp : 37.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 10177 10590 414 0 0 62 33 431 0.342 29.28 1.02 PlyA + 11206 11211 6 1.05 2.00 Prom + 23231 23270 40 -5.65 2.01 Init + 31385 31818 434 1 2 79 53 121 0.134 3.43 2.02 Intr + 37984 38139 156 1 0 105 37 52 0.092 0.00 2.03 Intr + 42024 42109 86 0 2 111 55 84 0.172 5.94 2.04 Intr + 49835 49861 27 0 0 130 50 32 0.608 0.77 2.05 Term + 50067 50281 215 1 2 12 48 338 0.865 18.61 2.06 PlyA + 50591 50596 6 1.05 3.00 Prom + 52663 52702 40 -5.95 3.01 Init + 53588 53685 98 1 2 83 19 47 0.242 -2.87 3.02 Term + 57505 57970 466 1 1 78 41 336 0.692 21.40 3.03 PlyA + 59321 59326 6 1.05 4.00 Prom + 59507 59546 40 -8.25 4.01 Init + 59779 59781 3 0 0 108 81 0 0.452 1.35 4.02 Intr + 60391 60692 302 0 2 2 47 241 0.384 6.01 4.03 Intr + 60911 61025 115 2 1 72 87 73 0.794 5.13 4.04 Intr + 72179 72256 78 1 0 33 119 59 0.351 2.33 4.05 Intr + 85746 85896 151 2 1 48 63 95 0.000 1.81 4.06 Intr + 100065 100174 110 1 2 39 91 138 0.212 8.28 4.07 Intr + 100954 101199 246 0 0 87 110 156 0.999 14.43 4.08 Intr + 106859 106929 71 1 2 130 96 17 0.990 3.76 4.09 Intr + 109526 109638 113 2 2 105 110 133 0.999 16.30 4.10 Term + 117121 117260 140 2 2 13 43 119 0.040 -2.96 4.11 PlyA + 117311 117316 6 1.05 5.11 PlyA - 117521 117516 6 1.05 5.10 Term - 119071 118929 143 2 2 34 38 182 0.876 4.91 5.09 Intr - 120936 120843 94 0 1 67 90 94 0.993 6.12 5.08 Intr - 121120 121043 78 1 0 51 102 80 0.958 4.53 5.07 Intr - 123194 123024 171 2 0 4 92 114 0.822 2.62 5.06 Intr - 125729 125604 126 2 0 58 63 115 0.974 5.96 5.05 Intr - 126036 125902 135 0 0 85 99 103 0.999 10.94 5.04 Intr - 127566 127394 173 1 2 89 50 137 0.997 8.74 5.03 Intr - 130764 130633 132 1 0 2 59 198 0.232 8.00 5.02 Intr - 138492 137014 1479 1 0 55 60 269 0.003 9.56 5.01 Init - 140170 139714 457 1 1 49 -3 212 0.012 4.21 5.00 Prom - 140262 140223 40 -6.15 6.06 PlyA - 140431 140426 6 1.05 6.05 Term - 142158 141706 453 0 0 62 43 152 0.663 2.27 6.04 Intr - 143560 143531 30 1 0 78 90 48 0.450 1.41 6.03 Intr - 146992 146948 45 1 0 94 88 19 0.348 0.19 6.02 Intr - 153602 153516 87 2 0 37 72 80 0.006 0.55 6.01 Init - 194561 194469 93 2 0 95 105 105 0.588 13.33 6.00 Prom - 196814 196775 40 -7.45 7.00 Prom + 197283 197322 40 -4.15 7.01 Sngl + 199105 199338 234 0 0 36 46 208 0.926 6.25 7.02 PlyA + 199623 199628 6 1.05 8.00 Prom + 202734 202773 40 -4.65 8.01 Init + 207423 207602 180 2 0 86 82 74 0.295 5.83 8.02 Intr + 213412 213471 60 0 0 113 78 50 0.862 4.51 8.03 Intr + 215297 215382 86 1 2 71 86 92 0.718 5.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_1|137_aa IGRSLGRPARAHWPVSGRWTMEAARPFAREWRAQSLPLAVGGVLKLRLCELWLLLLGSSL NARFLPDEEDVDFINEYVNLHNELRGDVIPRGSNLRFMVRPEGGLPTLPLPLPQRLPGSR VVGATALRLKDRGEPHF >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_1|414_bp attggcaggtcactgggacggccagcgcgtgcgcactggcctgtcagcggccggtggacc atggaggccgcaaggcccttcgcccgggagtggagggcccagtccctacccctggcagta gggggcgttttgaagctgcggctctgtgagctgtggctactgctactgggttctagtttg aacgccagatttttgccagacgaggaggacgtagactttatcaacgagtacgtgaacctc cacaatgagctgcggggcgacgtcattccccgagggtctaacttgcgcttcatggtgagg ccggaaggcggtttgccgacccttccactaccgttgccacaacgcctccctggatctcgg gtagttggggccactgctctgaggctgaaggatcgcggagaaccgcacttttag >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_2|305_aa MTSFGSEALFWINRVLKQFLQAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWGKIFTTYT SDKGLISRIYNELKQIYKKKTNNPIQKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAVREM QIKTTMRYHLTPVRMAIIKKSGNNRDMDEAGNHHSQQTNTRTENQTPHVLTHKWELNNEN TWTQGGEHHTLGPVGVGGTLTRRPYEPGIFCTRCGRRDKCTDFLCSNADRDQATLQSQFP NILLEQQMIFTPEESEAGNEEEEKEEEKKEKEEMEMEIMEMEEEKEEREEEEEETQKEKM EEEEK >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_2|918_bp atgacgtcctttggctctgaagcattattctggatcaaccgtgttctcaaacagttctta caagctaaaattgacaaatgggatctaattaaactaaaaagcttctgcacagcaaaagaa actaccatcagagtgaacaggcaacctacaaaatgggggaaaattttcacaacctacaca tctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaa acaaacaaccccatccaaaagtgggcaaaggatatgaacagacacttctcaaaagaagac atttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccgtcagagaaatg caaatcaaaaccacaatgagataccatctcacaccagttagaatggcgatcattaaaaag tcaggaaacaacagggacatggatgaagctggaaaccatcattctcagcaaactaacaca agaacagaaaaccaaacaccacatgttctcactcataagtgggagttgaacaatgagaac acatggacacagggaggggaacatcacacactggggcctgtcggggtgggaggaacactg acgagaagaccttatgaaccaggaatattttgtactcgatgtggcagacgtgacaaatgc acagattttctatgcagtaatgcagatcgtgaccaagccacattacagtctcagtttcca aatatcttgttggaacaacaaatgatatttacccctgaggaatctgaagcagggaatgaa gaggaggaaaaagaggaagagaagaaagagaaagaggaaatggaaatggaaataatggaa atggaggaggaaaaagaagagagagaggaggaggaggaggaaacacaaaaagaaaagatg gaggaagaggaaaaataa >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_3|187_aa MAEGKEEQVTSCMDGSRQRESLCRETPILKTIRCCCANSQQGQLLLCQFPAGTSTAVDEG WRRPYFHIRSRHWYCLAAVVEPPSSPTEPNTTPISPLGGAQSSSSHKQGALRQMRVHSLV SFVPKGALALSPFGVGEGGIFDLQLAVSTDTAPADMEGQLYCLYAGTLAAPSKDYPNGLS ASNFYIV >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_3|564_bp atggcagaaggcaaggaggagcaagtcacatcttgcatggatggcagcaggcagagagag agcttatgtagagaaactcccattttaaaaaccatcaggtgctgttgtgccaattcccag cagggacagctgctactgtgtcaattcccagcagggacatccacagcagtagatgagggg tggaggagaccttatttccacatccgttccaggcactggtactgcctggctgctgtggtg gaaccaccctcctcccctacagagcccaacaccacacccatttctccgctgggaggggca caatcatcgtcttctcacaagcagggagctctcaggcagatgagagtgcacagtctggtt tcctttgtcccaaagggtgctttggcactctctccctttggtgtgggtgagggtgggatt tttgatctgcagttggctgtatccacagacacagcacctgcagatatggagggccaactg tactgcctctatgctggaactttagctgcaccttcaaaagactatcctaatggtttatcg gccagcaatttctacatagtgtga >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_4|442_aa MAEVGTVWRAQKKTEKCGKVRDLVNGFDQNADSNMDNKVQADVVSDGDEELVGNWSKGDS CYVLAKRLAAFCPCSKDLWKFELEKYDLGYLVGEISKQQSKRPVGKSGFVGWAQGPRAVC SLGTWYPVSQLLQPWLKGAKQLINYSSGFPMLALVSAAASAHESALSWRTVLGESPLSES AAVFRASRQERLSPLKLCPQPPLCTGALSQGDESFITNILPDIENEDFIKDCVRIHNKFR SEVKPTASDMLYMTWDPALAQIAKAWASNCQFSHNTRLKPPHKLHPNFTSLGENIWTGSV PIFSVSSAITNWYDEIQDYDFKTRICKKVCGHYTQVSQERGRDPNPKRGFLNLARENPGL FGQIVTKLAAQFNFALKFLALTLFPMEHILYATTDQGMKTRDAHQYPEDQMIDGFYHEHL IATAAELVWKRFLFYKRKYPEG >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_4|1329_bp atggcagaggttggaacagtttggagggctcagaagaagacagaaaaatgtgggaaagtt agagatttggtgaatggctttgaccaaaatgctgatagcaatatggacaataaggtccag gctgatgtggtctcagatggagatgaggaacttgttgggaactggagtaaaggtgactct tgttatgttttagcaaagagactggcggcattttgcccctgctctaaagatttgtggaag tttgaacttgagaaatatgatttagggtatttggtgggagaaatttctaagcagcaaagc aagaggccagtaggaaaaagtggttttgtgggctgggcccagggtccccgtgctgtgtgc agtctaggaacttggtaccctgtgtcccagctgctccagccgtggctgaaaggggccaag cagctcatcaattacagttcaggttttcctatgctggcactggtttctgcagcagcctct gctcatgagtcagctctgagctggcgcactgtgctgggagaatcccctttgtcagaatca gctgctgtcttcagagccagcaggcaggaaagattgagtccactgaagctgtgcccacag ccacccctctgcactggtgctctgtcccagggagatgagagtttcatcacaaatattttg ccagatatcgaaaatgaagatttcatcaaagactgcgttcgaatccataacaagttccga tcagaggtgaaaccaacagccagtgatatgctatacatgacttgggacccagcactagcc caaattgcaaaagcatgggccagcaattgccagttttcacataatacacggctgaagcca ccccacaagctgcacccaaacttcacttcactgggagagaacatctggactgggtctgtg cccattttttctgtgtcttccgccatcacaaactggtatgacgaaatccaggactatgac ttcaagactcggatatgcaaaaaagtctgtggccactacactcaggtgtcacaggaaagg ggtcgggatccaaaccccaagagagggttcttgaatctcgcgcgagaaaatccagggttg tttgggcagatagttacaaagttggctgcgcagttcaattttgccctaaagtttctggct ttgacgctctttccaatggagcacattttatatgcaactacggaccagggtatgaagacc agggatgctcaccagtacccagaggaccagatgattgatgggttctaccacgagcatcta atagcgactgctgcagaactggtgtggaaaaggtttctattttataaaagaaagtatcca gaaggctaa >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_5|995_aa MGDFNTTLSTLDRSTRQKVNKDIQEQNSALHQVDLIDIYRTLHPKSTEYTFFSAPHYAYS KTNHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVYDMTVYLENPIISAQNLLKLISNFSKVS AYKINVQKSQAFLYTNNRQTESQILSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLL NEIKKDTNKRKNIPCSQVGRINIVKMSILPKVIYRFNAIPIKLPMTFFTELEKTALKFTW NQKRAHIAKSILSQKNKAGGITLPDFKLYCKTTVTKTAWYQYQNRDIDQWNRTEPSEIMP HIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTSYTKINSRWIKDLNV RPKTIKTLEENLGNSIQDIGMGKDFMSKTPKAMATKAKIERWDLIKLKSFCTAKETTIRV NRQPTEWEKIFAIYSSDKGLISRIYNELQQIYKKKTNNPIKKRAKDMNRHFSKEDIYAAK RHMKKCSSSLTIREMQMKTTVRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKL VQPLWKSVWRFLRDLELELLFDPAIPLLGIYPKDYKSCCYKDTCTRFKNSFISGATGSSL QMASPSLERPEKGAGKSEFRNQKPKPENQDESELLTVPDGWKEPAFSKEDNPRGLLEESS FATLFPKYREAYLKECWPLVQKALNEHHVNATLDLIEGSMTVCTTKKTFDPYIIIRARDL IKLLARSVSFEQAVRILQDDVACDIIKIGSLVRNKERFVKRRQRLIGPKGSTLKSLMIKR ELAKDSELRSQSWERFLPQFKHKNVNKRKEPKKKTVKKEYTPFPPPQPESQIDKELASGE YFLKANQKKRQKMEAIKAKQAEAISKRQEERNKAFIPPKEKPIVKPKEASTETKIDVASI KEKVKKAKNKKLGALTAEEIALKMEADEKKKKKKK >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_5|2988_bp atgggagactttaacaccacactgtcaacattagacagatcaacaagacagaaagttaac aaggatatccaggaacagaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacactacgcctattcc aaaactaaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaataaagacaca acataccagaatctctgggacacatttaaagcagtgtatgacatgactgtatatttagaa aaccccatcatctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca gcatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaaca gagagccaaatcttgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaaaaggatacaaacaaacggaagaacattccatgctcacaggtaggaaga atcaatatcgtgaaaatgtccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactgctttaaagttcacatgg aaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacgctacctgacttcaaactatactgcaagactacagtaaccaaaacagcatggtac cagtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgcca catatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacatcttatacaaaaattaattcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaatagcattcaggacataggc atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgag agatgggatctaattaaactaaagagcttctgcacagccaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagcgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactgaccatcagagaaatgcaaatgaaaaccaca gtgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacactgctggtgggactgtaaacta gttcaaccattgtggaaatcagtgtggcgattcctcagggatctagaactagaattacta tttgacccagccatcccattactgggtatatacccaaaggattataaatcatgctgctat aaagacacatgcacacggtttaagaacagtttcatttccggagccaccggaagcagcttg caaatggcgtctccctcgctggagcggccagaaaaaggcgctggaaaaagtgaatttcgt aaccagaagccgaagccggagaaccaagatgaatcagaactccttacggttcctgatggt tggaaggaaccagctttttccaaagaggacaatcccagaggacttttggaggagagcagt ttcgcaactttgttcccaaaatacagggaagcttacttgaaagagtgttggccattggtg cagaaagccttaaatgaacatcatgttaatgcaaccctggacctgatcgaaggcagcatg actgtttgtactacaaagaagacttttgatccatatatcatcattagggccagagatctg ataaaactgttagcaaggagtgtttcatttgaacaggcagtacgaattcttcaggatgat gttgcatgtgacatcattaaaataggttctttagtaaggaataaagagagatttgtaaaa cgaagacaacggcttattggtcccaaaggatctacattgaagagcttaatgattaagaga gagttggcaaaagattctgaattacgatcacaaagttgggagagatttttgccacagttc aaacacaaaaatgtgaataaacgcaaggaaccaaagaaaaaaactgttaagaaagaatat acgccattcccaccaccacaaccagaaagtcagatcgataaagaattggctagtggtgaa tactttttgaaggcaaatcagaagaagcggcagaaaatggaagcaataaaggctaaacaa gcagaagccatcagtaagagacaagaggaaagaaacaaagcatttattccacctaaggaa aaaccaattgtgaaacctaaggaagcttctactgaaactaaaattgatgtggccagcatc aaggaaaaggttaagaaagcaaagaataagaaactgggagctcttacagctgaagaaatt gcacttaagatggaggcagatgaaaagaaaaagaagaaaaaaaagtaa >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_6|235_aa MELNMHLVQDSGKLMQQEDLAGAQLLPLTNKETKAQEIKSFALGYPDEDHFGVLFYQLEE LRVLGQDTDPHALIGNLRSVALEKETPPLGTGHSQTKGSRNLCRLKCPCLTALKRVVVLP ARSWRSEIRQTASSSGSLTPEYPNWEAPPSGSRLTPHTAGYSSETKLLEERSGSNICCSP IFTVLQHPLLIPRQTGSGVHLQQTPTDLQLRVLTVRRKTNKQKGHPHQNPICMSP >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_6|708_bp atggagctaaatatgcatctggtccaggattcagggaaattgatgcagcaggaggatctg gcaggagcccagctgctgcccctcaccaacaaggaaaccaaagcccaggaaattaagtca tttgcactgggatacccagatgaggatcactttggtgtccttttctaccagttggaagaa ctacgtgtccttggacaagacactgaccctcatgcattgataggcaatttgcgaagtgtg gccttggagaaagagactccacctctggggacggggcatagccaaacaaaaggcagcaga aacctctgcagacttaaatgtccctgtctgacagctttgaagagagtagtggttctccca gcacgcagctggagatctgagatcagacagactgcctcctcaagtgggtcactgaccccc gagtaccctaactgggaagcaccccccagtgggagcagactgacaccccacacggctggg tactcctctgagacaaaacttctagaggaacgatcaggcagcaacatttgctgttcacca atattcactgttctgcagcatccgctgctgatacccaggcaaacagggtctggagtgcac ctccagcaaactccaacagacctgcagctgagggtcctgactgttagaaggaaaactaac aaacagaaaggacatccacaccaaaaccccatctgtatgtcaccatga >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_7|77_aa MGECDLGKSLNVSGSQHSFLYDERIKRSVSEIPSKCNIPDSKGSSLRTSSSTAFYSLIIE ERMLRTILNGIAFTSLS >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_7|234_bp atgggtgagtgtgaccttgggaaatcactcaacgtttctggttctcagcattctttcctt tatgatgagagaattaagcgcagtgtctctgagatcccttccaaatgtaacatccctgat agcaaaggttcatccctgaggacatcctcatccactgcattttattctctcatcatagag gagagaatgctgagaaccatcctcaatggcatcgccttcacaagtctttcctga >gi568815586f:75380881_75598975|GENSCAN_predicted_peptide_8|109_aa MVKDKGRAMGHLTWWLAKRENESQAKRETPYKIIRSCETYSLPREQYGGNRHHDSIMSHQ DHSLENNISWSERQSPEMGENHPENLVVMEGYSQEKRFSTSTPLTFWAS >gi568815586f:75380881_75598975|GENSCAN_predicted_CDS_8|327_bp atggtgaaagacaaaggaagagcaatgggacatcttacatggtggctggcaaagagagag aatgagagccaagcgaaaagggaaaccccttataaaatcatcagatcttgtgagacttat tcactaccacgagaacagtatggaggaaaccgccaccatgattcaattatgtctcaccag gatcactcgcttgaaaacaacatctcatggtcagagcggcaatcccctgaaatgggggag aatcaccctgaaaatctggtggttatggaaggttattctcaagaaaagagattttcaacc tcaacgccattgacattttgggcaagn