GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:49:19 Sequence gi568815586r:75399835_75611597 : 211763 bp : 37.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12431 12864 434 1 2 79 53 121 0.133 3.43 1.02 Intr + 19030 19185 156 1 0 105 37 52 0.089 0.00 1.03 Intr + 23070 23155 86 0 2 111 55 84 0.169 5.94 1.04 Intr + 30881 30907 27 0 0 130 50 32 0.608 0.77 1.05 Term + 31113 31327 215 1 2 12 48 338 0.865 18.61 1.06 PlyA + 31637 31642 6 1.05 2.00 Prom + 33709 33748 40 -5.95 2.01 Init + 34634 34731 98 1 2 83 19 47 0.242 -2.87 2.02 Term + 38551 39016 466 1 1 78 41 336 0.692 21.40 2.03 PlyA + 40367 40372 6 1.05 3.00 Prom + 40553 40592 40 -8.25 3.01 Init + 40825 40827 3 0 0 108 81 0 0.452 1.35 3.02 Intr + 41437 41738 302 0 2 2 47 241 0.384 6.01 3.03 Intr + 41957 42071 115 2 1 72 87 73 0.794 5.13 3.04 Intr + 53225 53302 78 1 0 33 119 59 0.351 2.33 3.05 Intr + 66792 66942 151 2 1 48 63 95 0.000 1.81 3.06 Intr + 81111 81220 110 1 2 39 91 138 0.212 8.28 3.07 Intr + 82000 82245 246 0 0 87 110 156 0.999 14.43 3.08 Intr + 87905 87975 71 1 2 130 96 17 0.990 3.76 3.09 Intr + 90572 90684 113 2 2 105 110 133 0.999 16.30 3.10 Term + 98167 98306 140 2 2 13 43 119 0.040 -2.96 3.11 PlyA + 98357 98362 6 1.05 4.11 PlyA - 98567 98562 6 1.05 4.10 Term - 100117 99975 143 2 2 34 38 182 0.876 4.91 4.09 Intr - 101982 101889 94 0 1 67 90 94 0.993 6.12 4.08 Intr - 102166 102089 78 1 0 51 102 80 0.958 4.53 4.07 Intr - 104240 104070 171 2 0 4 92 114 0.822 2.62 4.06 Intr - 106775 106650 126 2 0 58 63 115 0.974 5.96 4.05 Intr - 107082 106948 135 0 0 85 99 103 0.999 10.94 4.04 Intr - 108612 108440 173 1 2 89 50 137 0.997 8.74 4.03 Intr - 111810 111679 132 1 0 2 59 198 0.232 8.00 4.02 Intr - 119538 118060 1479 1 0 55 60 269 0.003 9.56 4.01 Init - 121216 120760 457 1 1 49 -3 212 0.012 4.21 4.00 Prom - 121308 121269 40 -6.15 5.06 PlyA - 121477 121472 6 1.05 5.05 Term - 123204 122752 453 0 0 62 43 152 0.663 2.27 5.04 Intr - 124606 124577 30 1 0 78 90 48 0.450 1.41 5.03 Intr - 128038 127994 45 1 0 94 88 19 0.348 0.19 5.02 Intr - 134648 134562 87 2 0 37 72 80 0.006 0.55 5.01 Init - 175607 175515 93 2 0 95 105 105 0.588 13.33 5.00 Prom - 177860 177821 40 -7.45 6.00 Prom + 178329 178368 40 -4.15 6.01 Sngl + 180151 180384 234 0 0 36 46 208 0.926 6.25 6.02 PlyA + 180669 180674 6 1.05 7.00 Prom + 183780 183819 40 -4.65 7.01 Init + 188469 188648 180 2 0 86 82 74 0.316 5.83 7.02 Intr + 194458 194517 60 0 0 113 78 50 0.949 4.51 7.03 Intr + 196343 196428 86 1 2 71 86 92 0.988 5.00 7.04 Term + 200188 200386 199 1 1 71 49 180 0.937 8.29 7.05 PlyA + 201838 201843 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_1|305_aa MTSFGSEALFWINRVLKQFLQAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWGKIFTTYT SDKGLISRIYNELKQIYKKKTNNPIQKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAVREM QIKTTMRYHLTPVRMAIIKKSGNNRDMDEAGNHHSQQTNTRTENQTPHVLTHKWELNNEN TWTQGGEHHTLGPVGVGGTLTRRPYEPGIFCTRCGRRDKCTDFLCSNADRDQATLQSQFP NILLEQQMIFTPEESEAGNEEEEKEEEKKEKEEMEMEIMEMEEEKEEREEEEEETQKEKM EEEEK >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_1|918_bp atgacgtcctttggctctgaagcattattctggatcaaccgtgttctcaaacagttctta caagctaaaattgacaaatgggatctaattaaactaaaaagcttctgcacagcaaaagaa actaccatcagagtgaacaggcaacctacaaaatgggggaaaattttcacaacctacaca tctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaa acaaacaaccccatccaaaagtgggcaaaggatatgaacagacacttctcaaaagaagac atttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccgtcagagaaatg caaatcaaaaccacaatgagataccatctcacaccagttagaatggcgatcattaaaaag tcaggaaacaacagggacatggatgaagctggaaaccatcattctcagcaaactaacaca agaacagaaaaccaaacaccacatgttctcactcataagtgggagttgaacaatgagaac acatggacacagggaggggaacatcacacactggggcctgtcggggtgggaggaacactg acgagaagaccttatgaaccaggaatattttgtactcgatgtggcagacgtgacaaatgc acagattttctatgcagtaatgcagatcgtgaccaagccacattacagtctcagtttcca aatatcttgttggaacaacaaatgatatttacccctgaggaatctgaagcagggaatgaa gaggaggaaaaagaggaagagaagaaagagaaagaggaaatggaaatggaaataatggaa atggaggaggaaaaagaagagagagaggaggaggaggaggaaacacaaaaagaaaagatg gaggaagaggaaaaataa >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_2|187_aa MAEGKEEQVTSCMDGSRQRESLCRETPILKTIRCCCANSQQGQLLLCQFPAGTSTAVDEG WRRPYFHIRSRHWYCLAAVVEPPSSPTEPNTTPISPLGGAQSSSSHKQGALRQMRVHSLV SFVPKGALALSPFGVGEGGIFDLQLAVSTDTAPADMEGQLYCLYAGTLAAPSKDYPNGLS ASNFYIV >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_2|564_bp atggcagaaggcaaggaggagcaagtcacatcttgcatggatggcagcaggcagagagag agcttatgtagagaaactcccattttaaaaaccatcaggtgctgttgtgccaattcccag cagggacagctgctactgtgtcaattcccagcagggacatccacagcagtagatgagggg tggaggagaccttatttccacatccgttccaggcactggtactgcctggctgctgtggtg gaaccaccctcctcccctacagagcccaacaccacacccatttctccgctgggaggggca caatcatcgtcttctcacaagcagggagctctcaggcagatgagagtgcacagtctggtt tcctttgtcccaaagggtgctttggcactctctccctttggtgtgggtgagggtgggatt tttgatctgcagttggctgtatccacagacacagcacctgcagatatggagggccaactg tactgcctctatgctggaactttagctgcaccttcaaaagactatcctaatggtttatcg gccagcaatttctacatagtgtga >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_3|442_aa MAEVGTVWRAQKKTEKCGKVRDLVNGFDQNADSNMDNKVQADVVSDGDEELVGNWSKGDS CYVLAKRLAAFCPCSKDLWKFELEKYDLGYLVGEISKQQSKRPVGKSGFVGWAQGPRAVC SLGTWYPVSQLLQPWLKGAKQLINYSSGFPMLALVSAAASAHESALSWRTVLGESPLSES AAVFRASRQERLSPLKLCPQPPLCTGALSQGDESFITNILPDIENEDFIKDCVRIHNKFR SEVKPTASDMLYMTWDPALAQIAKAWASNCQFSHNTRLKPPHKLHPNFTSLGENIWTGSV PIFSVSSAITNWYDEIQDYDFKTRICKKVCGHYTQVSQERGRDPNPKRGFLNLARENPGL FGQIVTKLAAQFNFALKFLALTLFPMEHILYATTDQGMKTRDAHQYPEDQMIDGFYHEHL IATAAELVWKRFLFYKRKYPEG >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_3|1329_bp atggcagaggttggaacagtttggagggctcagaagaagacagaaaaatgtgggaaagtt agagatttggtgaatggctttgaccaaaatgctgatagcaatatggacaataaggtccag gctgatgtggtctcagatggagatgaggaacttgttgggaactggagtaaaggtgactct tgttatgttttagcaaagagactggcggcattttgcccctgctctaaagatttgtggaag tttgaacttgagaaatatgatttagggtatttggtgggagaaatttctaagcagcaaagc aagaggccagtaggaaaaagtggttttgtgggctgggcccagggtccccgtgctgtgtgc agtctaggaacttggtaccctgtgtcccagctgctccagccgtggctgaaaggggccaag cagctcatcaattacagttcaggttttcctatgctggcactggtttctgcagcagcctct gctcatgagtcagctctgagctggcgcactgtgctgggagaatcccctttgtcagaatca gctgctgtcttcagagccagcaggcaggaaagattgagtccactgaagctgtgcccacag ccacccctctgcactggtgctctgtcccagggagatgagagtttcatcacaaatattttg ccagatatcgaaaatgaagatttcatcaaagactgcgttcgaatccataacaagttccga tcagaggtgaaaccaacagccagtgatatgctatacatgacttgggacccagcactagcc caaattgcaaaagcatgggccagcaattgccagttttcacataatacacggctgaagcca ccccacaagctgcacccaaacttcacttcactgggagagaacatctggactgggtctgtg cccattttttctgtgtcttccgccatcacaaactggtatgacgaaatccaggactatgac ttcaagactcggatatgcaaaaaagtctgtggccactacactcaggtgtcacaggaaagg ggtcgggatccaaaccccaagagagggttcttgaatctcgcgcgagaaaatccagggttg tttgggcagatagttacaaagttggctgcgcagttcaattttgccctaaagtttctggct ttgacgctctttccaatggagcacattttatatgcaactacggaccagggtatgaagacc agggatgctcaccagtacccagaggaccagatgattgatgggttctaccacgagcatcta atagcgactgctgcagaactggtgtggaaaaggtttctattttataaaagaaagtatcca gaaggctaa >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_4|995_aa MGDFNTTLSTLDRSTRQKVNKDIQEQNSALHQVDLIDIYRTLHPKSTEYTFFSAPHYAYS KTNHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVYDMTVYLENPIISAQNLLKLISNFSKVS AYKINVQKSQAFLYTNNRQTESQILSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLL NEIKKDTNKRKNIPCSQVGRINIVKMSILPKVIYRFNAIPIKLPMTFFTELEKTALKFTW NQKRAHIAKSILSQKNKAGGITLPDFKLYCKTTVTKTAWYQYQNRDIDQWNRTEPSEIMP HIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTSYTKINSRWIKDLNV RPKTIKTLEENLGNSIQDIGMGKDFMSKTPKAMATKAKIERWDLIKLKSFCTAKETTIRV NRQPTEWEKIFAIYSSDKGLISRIYNELQQIYKKKTNNPIKKRAKDMNRHFSKEDIYAAK RHMKKCSSSLTIREMQMKTTVRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKL VQPLWKSVWRFLRDLELELLFDPAIPLLGIYPKDYKSCCYKDTCTRFKNSFISGATGSSL QMASPSLERPEKGAGKSEFRNQKPKPENQDESELLTVPDGWKEPAFSKEDNPRGLLEESS FATLFPKYREAYLKECWPLVQKALNEHHVNATLDLIEGSMTVCTTKKTFDPYIIIRARDL IKLLARSVSFEQAVRILQDDVACDIIKIGSLVRNKERFVKRRQRLIGPKGSTLKSLMIKR ELAKDSELRSQSWERFLPQFKHKNVNKRKEPKKKTVKKEYTPFPPPQPESQIDKELASGE YFLKANQKKRQKMEAIKAKQAEAISKRQEERNKAFIPPKEKPIVKPKEASTETKIDVASI KEKVKKAKNKKLGALTAEEIALKMEADEKKKKKKK >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_4|2988_bp atgggagactttaacaccacactgtcaacattagacagatcaacaagacagaaagttaac aaggatatccaggaacagaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacactacgcctattcc aaaactaaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaataaagacaca acataccagaatctctgggacacatttaaagcagtgtatgacatgactgtatatttagaa aaccccatcatctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca gcatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaaca gagagccaaatcttgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaaaaggatacaaacaaacggaagaacattccatgctcacaggtaggaaga atcaatatcgtgaaaatgtccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactgctttaaagttcacatgg aaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacgctacctgacttcaaactatactgcaagactacagtaaccaaaacagcatggtac cagtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgcca catatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacatcttatacaaaaattaattcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaatagcattcaggacataggc atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgag agatgggatctaattaaactaaagagcttctgcacagccaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagcgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactgaccatcagagaaatgcaaatgaaaaccaca gtgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacactgctggtgggactgtaaacta gttcaaccattgtggaaatcagtgtggcgattcctcagggatctagaactagaattacta tttgacccagccatcccattactgggtatatacccaaaggattataaatcatgctgctat aaagacacatgcacacggtttaagaacagtttcatttccggagccaccggaagcagcttg caaatggcgtctccctcgctggagcggccagaaaaaggcgctggaaaaagtgaatttcgt aaccagaagccgaagccggagaaccaagatgaatcagaactccttacggttcctgatggt tggaaggaaccagctttttccaaagaggacaatcccagaggacttttggaggagagcagt ttcgcaactttgttcccaaaatacagggaagcttacttgaaagagtgttggccattggtg cagaaagccttaaatgaacatcatgttaatgcaaccctggacctgatcgaaggcagcatg actgtttgtactacaaagaagacttttgatccatatatcatcattagggccagagatctg ataaaactgttagcaaggagtgtttcatttgaacaggcagtacgaattcttcaggatgat gttgcatgtgacatcattaaaataggttctttagtaaggaataaagagagatttgtaaaa cgaagacaacggcttattggtcccaaaggatctacattgaagagcttaatgattaagaga gagttggcaaaagattctgaattacgatcacaaagttgggagagatttttgccacagttc aaacacaaaaatgtgaataaacgcaaggaaccaaagaaaaaaactgttaagaaagaatat acgccattcccaccaccacaaccagaaagtcagatcgataaagaattggctagtggtgaa tactttttgaaggcaaatcagaagaagcggcagaaaatggaagcaataaaggctaaacaa gcagaagccatcagtaagagacaagaggaaagaaacaaagcatttattccacctaaggaa aaaccaattgtgaaacctaaggaagcttctactgaaactaaaattgatgtggccagcatc aaggaaaaggttaagaaagcaaagaataagaaactgggagctcttacagctgaagaaatt gcacttaagatggaggcagatgaaaagaaaaagaagaaaaaaaagtaa >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_5|235_aa MELNMHLVQDSGKLMQQEDLAGAQLLPLTNKETKAQEIKSFALGYPDEDHFGVLFYQLEE LRVLGQDTDPHALIGNLRSVALEKETPPLGTGHSQTKGSRNLCRLKCPCLTALKRVVVLP ARSWRSEIRQTASSSGSLTPEYPNWEAPPSGSRLTPHTAGYSSETKLLEERSGSNICCSP IFTVLQHPLLIPRQTGSGVHLQQTPTDLQLRVLTVRRKTNKQKGHPHQNPICMSP >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_5|708_bp atggagctaaatatgcatctggtccaggattcagggaaattgatgcagcaggaggatctg gcaggagcccagctgctgcccctcaccaacaaggaaaccaaagcccaggaaattaagtca tttgcactgggatacccagatgaggatcactttggtgtccttttctaccagttggaagaa ctacgtgtccttggacaagacactgaccctcatgcattgataggcaatttgcgaagtgtg gccttggagaaagagactccacctctggggacggggcatagccaaacaaaaggcagcaga aacctctgcagacttaaatgtccctgtctgacagctttgaagagagtagtggttctccca gcacgcagctggagatctgagatcagacagactgcctcctcaagtgggtcactgaccccc gagtaccctaactgggaagcaccccccagtgggagcagactgacaccccacacggctggg tactcctctgagacaaaacttctagaggaacgatcaggcagcaacatttgctgttcacca atattcactgttctgcagcatccgctgctgatacccaggcaaacagggtctggagtgcac ctccagcaaactccaacagacctgcagctgagggtcctgactgttagaaggaaaactaac aaacagaaaggacatccacaccaaaaccccatctgtatgtcaccatga >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_6|77_aa MGECDLGKSLNVSGSQHSFLYDERIKRSVSEIPSKCNIPDSKGSSLRTSSSTAFYSLIIE ERMLRTILNGIAFTSLS >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_6|234_bp atgggtgagtgtgaccttgggaaatcactcaacgtttctggttctcagcattctttcctt tatgatgagagaattaagcgcagtgtctctgagatcccttccaaatgtaacatccctgat agcaaaggttcatccctgaggacatcctcatccactgcattttattctctcatcatagag gagagaatgctgagaaccatcctcaatggcatcgccttcacaagtctttcctga >gi568815586r:75399835_75611597|GENSCAN_predicted_peptide_7|174_aa MVKDKGRAMGHLTWWLAKRENESQAKRETPYKIIRSCETYSLPREQYGGNRHHDSIMSHQ DHSLENNISWSERQSPEMGENHPENLVVMEGYSQEKRFSTSTPLTFWASSTLSDASTMAQ AGIALIGLAVMGQNLILCMNDHGFMACAFNSTVSKVDDFLATRQREPKMLMFSP >gi568815586r:75399835_75611597|GENSCAN_predicted_CDS_7|525_bp atggtgaaagacaaaggaagagcaatgggacatcttacatggtggctggcaaagagagag aatgagagccaagcgaaaagggaaaccccttataaaatcatcagatcttgtgagacttat tcactaccacgagaacagtatggaggaaaccgccaccatgattcaattatgtctcaccag gatcactcgcttgaaaacaacatctcatggtcagagcggcaatcccctgaaatgggggag aatcaccctgaaaatctggtggttatggaaggttattctcaagaaaagagattttcaacc tcaacgccattgacattttgggcaagttctactctgtccgatgcttccaccatggcccaa gctggcattgcactgattggactggctgtcatgggccagaacttaatattgtgcatgaat gaccatggcttcatggcctgtgcttttaatagtacagtctccaaagttgatgatttcttg gcaacaaggcaaagggaaccaaagatgttgatgttcagtccttga