GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:27:22 Sequence gi568815595f:196682647_196928402 : 245756 bp : 42.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 582 577 6 1.05 1.03 Term - 6290 6169 122 0 2 18 43 117 0.069 -2.14 1.02 Intr - 7847 7792 56 1 2 107 91 35 0.125 3.40 1.01 Init - 21000 20564 437 0 2 41 99 214 0.145 12.94 1.00 Prom - 22079 22040 40 -5.15 2.03 PlyA - 23650 23645 6 1.05 2.02 Term - 25266 24905 362 1 2 71 43 377 0.995 25.21 2.01 Init - 26023 25882 142 1 1 70 82 89 0.626 6.85 2.00 Prom - 27083 27044 40 -9.65 3.00 Prom + 27609 27648 40 -5.35 3.01 Init + 29301 29387 87 2 0 63 37 100 0.480 3.09 3.02 Intr + 38684 38710 27 1 0 100 106 13 0.407 1.69 3.03 Intr + 39842 39910 69 1 0 76 77 88 0.444 4.96 3.04 Intr + 45277 45490 214 0 1 90 100 251 0.802 23.87 3.05 Term + 55168 55232 65 2 2 129 53 18 0.069 -0.53 3.06 PlyA + 57058 57063 6 1.05 4.04 PlyA - 59124 59119 6 1.05 4.03 Term - 60745 60635 111 1 0 90 42 88 0.336 1.98 4.02 Intr - 72330 72091 240 0 0 34 110 100 0.101 3.62 4.01 Init - 78997 78959 39 1 0 81 81 23 0.261 1.14 4.00 Prom - 87636 87597 40 -5.15 5.00 Prom + 90242 90281 40 -5.85 5.01 Init + 100001 100187 187 1 1 57 94 161 0.944 12.87 5.02 Intr + 119281 119381 101 2 2 104 93 120 0.999 13.01 5.03 Intr + 120371 120518 148 1 1 78 99 158 0.982 14.79 5.04 Intr + 122706 122737 32 1 2 77 92 23 0.950 -1.47 5.05 Intr + 123933 124040 108 2 0 56 70 142 0.964 8.76 5.06 Intr + 124427 124474 48 1 0 49 110 61 0.806 2.36 5.07 Intr + 125136 125268 133 2 1 73 44 153 0.942 8.70 5.08 Intr + 127944 128007 64 1 1 97 110 23 0.980 2.26 5.09 Intr + 129573 129621 49 0 1 110 53 32 0.995 -0.34 5.10 Intr + 130093 130205 113 0 2 93 111 106 0.995 11.66 5.11 Intr + 131805 131922 118 0 1 59 92 169 0.998 13.95 5.12 Intr + 135411 135510 100 2 1 77 70 64 0.902 2.26 5.13 Intr + 137725 137921 197 2 2 104 90 214 0.978 21.41 5.14 Intr + 144550 144687 138 0 0 59 99 139 0.901 11.84 5.15 Term + 147919 148020 102 0 0 96 38 100 0.899 3.10 5.16 PlyA + 149824 149829 6 1.05 6.00 Prom + 174519 174558 40 -6.05 6.01 Init + 176543 176777 235 1 1 58 26 175 0.147 6.65 6.02 Intr + 179246 179273 28 0 1 37 68 11 0.127 -9.74 6.03 Term + 180433 180829 397 1 1 53 43 299 0.662 15.46 6.04 PlyA + 180900 180905 6 1.05 7.03 PlyA - 184717 184712 6 1.05 7.02 Term - 185991 184790 1202 1 2 39 48 524 0.609 34.35 7.01 Init - 196925 196775 151 2 1 54 16 102 0.204 -0.15 7.00 Prom - 198620 198581 40 -6.05 8.00 Prom + 199630 199669 40 -9.55 8.01 Init + 202536 204048 1513 2 1 55 72 959 0.822 83.39 8.02 Intr + 217020 217125 106 1 1 94 44 102 0.981 4.75 8.03 Intr + 217278 217416 139 0 1 89 92 120 0.971 12.05 8.04 Intr + 232081 232140 60 0 0 87 109 0 0.092 0.01 8.05 Intr + 245150 245233 84 1 0 53 115 54 0.003 3.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_1|204_aa MRIDYGCLGLGMGIDYGCLGRGMGIDYGCLGLRMGIDYGCLGLGMGIDYGCLGLRMGIDY GCLGLGMGIDYGCLGLGMGIDYGCLGLRMGIDYGCLGLGMGIDYGCLGLGMGDDFLEVDA NTLKLDCVLFSQFYKFTELICALKNRQTLRETLDQRRPLGGSSPEGILAEEIRRREGFMQ AGATGKASQKELELSVSSKGDLSV >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_1|615_bp atgagaattgattatggctgcctggggctggggatgggaattgattatggctgcctgggg cgggggatgggaattgattatggctgcctggggctgcggatgggaattgattatggctgc ctggggctggggatgggaattgattatggctgcctggggctgcggatgggaattgattat ggctgcctggggctggggatgggaattgattatggctgcctggggctggggatgggaatt gattatggctgcctggggctgaggatgggaattgattatggctgcctgggtctggggatg ggaattgattatggctgcctggggctggggatgggagatgattttttggaggttgatgca aacactttaaaactggattgtgtcttgttttcacaattctataaattcactgaattgatt tgtgcacttaaaaacaggcagacccttcgggagacacttgaccagcgcaggcccttgggg ggcagctcgccagaaggcattcttgctgaagaaattcggcggagggaaggattcatgcag gctggagcaacagggaaagcttcacaaaaggagttggagttgagtgtgtcctcaaaggga gatttgtctgtgtag >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_2|167_aa MYMGMMCTAKKCGIRFQPPAIILIYESEIKGKIRQRIMPVRNFSKFSDCTRAAEQLKNNP RHKSYLEQVSLRQLEKLFSFLRGYLSGQSLAETMEQIQRETTIDPEEDLNKLDDKELAKR KSIMDELFEKNQKKKDDPNFVYDIEVEFPQDDQLQSCGWDTESADEF >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_2|504_bp atgtacatgggaatgatgtgcactgccaagaaatgtgggattaggtttcagcctccagct attatcttaatctatgagagtgaaatcaaggggaaaattcgccagcgcattatgccagtt cgaaacttttcaaagttttcagattgcaccagagctgctgaacaattaaagaataatccg cgacacaagagttacctagaacaagtatccctgaggcagctagagaagctattcagtttt ttacgaggttacttgtcggggcagagtctggcagaaacaatggaacaaattcaacgggaa acaaccattgatcctgaggaagacctgaacaaactagatgacaaggagcttgccaaaaga aagagcatcatggatgaactttttgagaaaaatcagaagaagaaggatgatccaaatttt gtttatgacattgaggttgaatttccacaggacgatcaactgcagtcctgtggctgggac acagagtcagctgatgagttctga >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_3|153_aa MTSSAGQRLNNQEWVFPPNRSARKAPKPTHLMAFNYIWDIPAGLYVDPYELASLRERNIT EAVMVSENFDIEAPNYLSKESEVLIYARRDSQCIDCFQAFLPVHCRYHRPHSEDGEASIV VNNPDLLMFCDQGVLYFGCSCLASEILFICEIL >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_3|462_bp atgacatcgtctgcaggccaacgtctaaacaaccaggagtgggtttttccacccaaccgc agtgcacgcaaagcccctaagccgactcatctaatggcttttaattacatctgggacatt cctgcaggactttatgtggatccgtatgagttggcttcattacgagagagaaacataaca gaggcagtgatggtttcagaaaattttgatatagaggcccctaactatttgtccaaggag tctgaagttctcatttatgccagacgagattcacagtgcattgactgttttcaagccttt ttgcctgtgcactgccgctatcatcggccgcacagtgaagatggagaagcctcgattgtg gtcaataacccagatttgttgatgttttgtgaccaaggtgtattgtatttcgggtgctca tgtttggcttcggaaatcctgtttatctgtgaaatcctttga >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_4|129_aa MTMAALWNRKAGKNYEDILSLPTSHRFIWRLRVGFEESKQNQELVSHSLLHRGSFCGSHS SLSSVISQHTFREITRRKFQKANQEVCVCARVQVRKASLEKNVYLAQGKVEQQEFKRRTL APNHMGYME >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_4|390_bp atgacaatggcggctttgtggaatagaaaggcgggaaagaactatgaagatatcttgtct ctccctactagccatcgctttatttggagactcagagtcggctttgaagaatcaaagcag aatcaagaattggtcagccactccttgctacacagaggcagtttttgtggcagccacagt tccttatcttctgtgatctctcaacatacattcagggagattaccaggcgcaaattccag aaagcaaaccaggaagtgtgtgtctgcgcccgtgtgcaggtgagaaaagcaagtcttgaa aagaatgtatatcttgcccagggtaaagtagaacagcaagaattcaaacgcaggactcta gctcctaaccacatgggatacatggagtaa >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_5|545_aa MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP QAVLDVLKFYDSNTVKQKYLSFTPPEKDGFPSGTPALNAKGTEAPAVVTEEEDDDEETAP PVIAPRPDHTKSRLAVIQKAVENEIPNQIYTRSVIDPVPAPVGDSHVDGAAKSLDKQKKK TKMTDEEIMEKLRTIVSIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQINLQK QPKKELIINEILVMKELKNPNIVNFLDSYLVGDELFVVMEYLAGGSLTDVVTETCMDEAQ IAAVCRECLQALEFLHANQVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPEQSKRSTMV GTPYWMAPEVVTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGTPELQ NPEKLSPIFRDFLNRCLEMDVEKRGSAKELLQACDSTKGFSIDRSSLGFVVATSSIVTCT SIAGN >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_5|1638_bp atgtctgataacggagaactggaagataagcctccagcacctcctgtgcgaatgagcagc accatctttagcactggaggcaaagaccctttgtcagccaatcacagtttgaaacctttg ccctctgttccagaagagaaaaagcccaggcataaaatcatctccatattctcaggcaca gagaaaggaagtaaaaagaaagaaaaggaacggccagaaatttctcctccatctgatttt gagcacaccatccatgttggctttgatgctgttactggagaattcactggcatgccagaa cagtgggctcgattactacagacctccaatatcaccaaactagagcaaaagaagaatcct caggctgtgctggatgtcctaaagttctacgactccaacacagtgaagcagaaatatctg agctttactcctcctgagaaagatggctttccttctggaacaccagcactgaatgccaag ggaacagaagcacccgcagtagtgacagaggaggaggatgatgatgaagagactgctcct cccgttattgccccgcgaccggatcatacgaaatcacgccttgctgtcattcaaaaggca gttgagaatgaaattcctaaccagatttacacacggtctgtaattgaccctgttcctgca ccagttggtgattcacatgttgatggtgctgccaagtctttagacaaacagaaaaagaag actaagatgacagatgaagagattatggagaaattaagaactatcgtgagcataggtgac cctaagaaaaaatatacaagatatgaaaaaattggacaaggggcttctggtacagttttc actgctactgacgttgcactgggacaggaggttgctatcaaacaaattaatttacagaaa cagccaaagaaggaactgatcattaacgagattctggtgatgaaagaattgaaaaatccc aacatcgttaactttttggacagttacctggtaggagatgaattgtttgtggtcatggaa taccttgctggggggtcactcactgatgtggtaacagaaacgtgcatggatgaagcacag attgctgctgtatgcagagagtgtttacaggcattggagtttttacatgctaatcaagtg atccacagagacatcaaaagtgacaatgtacttttgggaatggaaggatctgttaagctc actgactttggtttctgtgcccagatcacccctgagcagagcaaacgcagtaccatggtc ggaacgccatactggatggcaccagaggtggttacacggaaagcttatggccctaaagtc gacatatggtctctgggtatcatggctattgagatggtagaaggagagcctccatacctc aatgaaaatcccttgagggccttgtacctaatagcaactaatggaaccccagaacttcag aatccagagaaactttccccaatatttcgggatttcttaaatcgatgtttggaaatggat gtggaaaaaaggggttcagccaaagaattattacaggcttgtgactcaacaaagggcttt tccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttacctgcacg tccattgctggcaactga >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_6|219_aa MDKFLETHNLPILKQEEIETLNRPISSYDSESLKGVYQPRKAPDQMDSQLNSTTCTKTKT NPPVLVRSHTAIKAYPRLGTVSRVGSFWILLIGAFYRTLIGAFYSMLIGVFYRVLIGAFY NPLASYSVLIGAFYRALIGAFYNLLASYRALIGAFYRALIGAFYNSLASYKALIGAFLQS ADWCILQSSLKTEKFSKSPLNPGSPAGFTCHYQRHFSRN >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_6|660_bp atggataaattcctggaaacacacaatctgccaattttgaagcaggaagaaattgaaacc ctgaacagaccaatatcaagttatgattctgaatcattaaaaggagtctaccaaccaaga aaagcccctgatcagatggattcacagctgaattctactacatgtacaaagactaagact aatcctcctgtattagtccgttctcacactgctataaaggcatacccaagactgggtact gtgtccagagttggttccttctggatcctgctgattggtgcattttacagaacactgatt ggtgcattttacagcatgctgattggtgtgttttacagagtgctgattggtgcattttac aatcctcttgctagctacagcgtgctgattggtgcgttttacagagcgctgattggtgca ttttacaatctgcttgctagctacagagcgctgattggtgcgttttacagagcactgatt ggtgcattttacaattctcttgctagctacaaagcattgattggcgcatttttacagagt gccgattggtgcattttacaatcctctttaaagacagaaaagttctccaagtccccactc aacccaggaagtccagctggcttcacctgtcactaccaacgtcatttttcaaggaactag >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_7|450_aa MTMCLKPGGPPIQKGAVKSQSKGSKNVILIEFLKCMKANKGETDNQKRVCIQNSHILSAS LHPDRVATGRRAEADTAQSRPKGKDVSHRAHAQPGSLEKGPGEREERNEGRKEAERSKQI KTPDFHGGRSWRPVTAAPPLRAFPASGGAGGRQLPGACDAQPGWSESAGQRPLGTSSLAS PGKGGCGRDGRRPPARAGEGGSRPPDRALRWRTGAGMAPASPARRSLSPSLVCPRPRLPA TLVPGSCPAAGPVAAAAWTADPAFVSSKGSPASDEASGLFAAATAAELPLKRLPRRRRWP IASPPPRPSACEAGPGTARFRRSRPASRPGAAPPPPVTASLAPASLRTRWDGNRVQELLS AFETSAPSPAFNLKGLNQTAASVHPYPPLIPSPLRRKQKTHSSSGRCEAWPRRPLSGFPI DLTPGIVIGLFIYVLPAPTVNAFQNYKPMS >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_7|1353_bp atgacaatgtgccttaagcctggagggccaccaatccagaagggagcggtcaaaagtcag tcaaaaggctccaagaatgtgatcctgatagaatttctaaagtgtatgaaagcaaataaa ggagaaacagacaaccagaagagagtttgcatccaaaactcccacatcctctccgcatcc ctccaccccgacagagtggccacagggcgtagagcggaggctgataccgctcagagcaga ccgaagggaaaagacgtctcccaccgcgctcatgcccaaccgggctctctggaaaagggt cccggggagcgcgaggaaagaaacgaaggccggaaggaggcagagcggtctaagcaaata aaaaccccggacttccacggcggccggagctggaggcctgtgacagcggctcctcctctc agagctttcccggcttcaggaggagctggggggcgccagcttcccggtgcctgcgacgca cagcccggctggagcgagagcgcagggcagcggcccctcgggacgagctcgctcgcctcc cccggaaagggaggctgtggccgagacggacggcggcctccagctcgggcgggggagggc ggctcgaggcctccggaccgggcgctgaggtggcgcaccggcgcgggaatggccccagcc agccccgccaggcgatcgctgagtccgtcactcgtctgcccgcgtccccgcttacctgct accctcgttcctggcagctgccccgccgccggccccgtcgcggcagcagcctggaccgcg gaccccgccttcgtcagctccaagggaagccccgccagcgacgaggcgtccggcctgttc gccgcggccacagccgccgagcttcctctgaagcggctgccgcgacgccggcgctggcct atcgcgtcacctcctccgcgcccttccgcctgcgaggctgggcctggcaccgcccgcttc cggcgctcccgccccgcctcccgacccggggccgccccgcccccgcccgtcaccgcgtcc ttagccccggcctccctcagaacgcgctgggatgggaaccgcgttcaggagctgctaagc gcgtttgagacttcagccccttctccagcttttaacctaaaagggctaaatcagaccgca gctagtgtacacccctaccctccgctgatcccgtcacccctgcgccgaaagcaaaagacc cattccagctctggacgttgtgaggcctggcccaggcgtccgctctctggttttcctatt gatcttacccctggtattgtgattggcctctttatctatgtcttacccgccccaactgtg aatgctttccaaaattataagccgatgtcctga >gi568815595f:196682647_196928402|GENSCAN_predicted_peptide_8|634_aa MKKQRKILWRKGIHLAFSEKWNTGFGGFKKFYFHQHLCILKAKLGRPVTWNRQLRHFQGR KKALQIQKTWIKDEPLCAKTKFNVATQNVSTLSSKVKRKDAKHFISSSKTLLRLQAEKLL SSAKNSDHEYCREKNLLKAVTDFPSNSALGQANGHRPRTDPQPSDFPMKFNGESQSPGES GTIVVTLNNHKRKGFCYGCCQGPEHHRNGGPLIPKKFQLNQHRRIKLSPLMMYEKLSMIR FRYRILRSQHFRTKSKVCKLRKAQRSWVQKVTGDHQETRRENGEGGSCSPFPSPEPKDPS CRHQPYFPDMDSSAVVKGTNSHVPDCHTKGSSFLGKELSLDEAFPDQQNGSATNAWDQSS CSSPKWECTELIHDIPLPEHRSNTMFISETEREIMTLGQENQTSSVSDDRVKLSVSGADT SVSSVDGPVSQKAVQNENSYQMEEDGSLKQSILSSELLDHPYCKSPLEAPLVCSGLKLEN QVGGGKNSQKASPVDDEQLSVCLSGFLDEVMKKYGSLVPLSEKEVLGRLKDVFNEDFSNR KPFINREITNYRARHQKCNFRIFYNKHMLDMDDLATLDGQNWLNDQEHQIEQLSMQQNTF IRTRNQNIRKYLLTEAREKNRPEFLQGWQTAVTK >gi568815595f:196682647_196928402|GENSCAN_predicted_CDS_8|1902_bp atgaaaaaacagaggaaaattctatggaggaaaggaatccacttagccttttctgagaaa tggaatactgggtttggaggctttaagaagttttattttcaccaacacttgtgcattctg aaagctaagctgggaaggccagttacttggaatagacagttgagacatttccagggtaga aagaaagctcttcaaatccagaaaacgtggatcaaggatgaacccctttgtgctaagacc aagttcaatgtggctactcaaaatgttagtactttgtcctctaaagtgaaaagaaaggac gctaaacacttcatttcctcctcaaagactctcctgagactccaagcagagaagctgttg tcatcagcaaagaattctgaccatgaatactgcagagagaaaaatctcttgaaggcagtt actgactttccatcaaatagtgctttaggtcaggccaatggtcacagacctaggacagac ccacaaccttctgactttcccatgaagttcaatggggagagccaaagtccaggtgagagt ggcacgattgtggtcaccttgaacaaccataagagaaagggcttttgttacggctgctgc caagggccggagcaccacaggaatgggggacccttgattccaaaaaagttccaacttaac caacatagaaggataaaattatctcctcttatgatgtatgagaaattatccatgattaga tttcggtacaggattctcagatcccagcacttcagaaccaaaagcaaggtttgcaagcta agaaaagcccagcgaagctgggtacagaaagtcactggggaccatcaagagacccgtagg gagaacggtgagggtggcagttgcagcccatttccttccccagaacctaaagacccttct tgtcggcatcagccgtactttccagatatggacagcagtgctgtggtgaaggggacgaac tctcatgtgcctgattgccacactaaaggaagctctttcttgggcaaggagcttagttta gacgaagcattccctgaccaacagaatggcagtgccacaaacgcctgggaccagtcatcc tgttcttctcctaagtgggagtgtacagagctgattcatgacatccccttaccagaacat cgttctaataccatgttcatttcagaaactgaaagagaaattatgactctgggtcaggaa aatcagacaagttctgtcagtgatgacagagtaaaactgtcagtgtctggagcagataca tctgtgagtagcgtagatgggcctgtgtcccaaaaggctgttcaaaatgagaactcatac cagatggaggaggatggatctctcaagcagagcattcttagttctgagttgctggaccac ccttactgtaaaagtccactggaggctcccttggtgtgcagtggactcaaactagaaaat caagtaggaggtggaaagaacagtcagaaagcctctccagtggatgatgaacagctgtca gtctgtctttctggattcctagatgaggttatgaagaagtatggcagtttggttccactc agtgaaaaagaagtccttggaagattaaaagatgtctttaatgaagacttttctaataga aaaccatttatcaatagggaaataacaaactatcgggccagacatcaaaaatgtaacttc cgtatcttctataataaacacatgctggatatggacgacctggcgactctggatggtcag aactggctgaatgaccaggaacaccaaattgaacaactatccatgcaacaaaataccttc ataagaaccagaaatcagaatataagaaagtatttgctgactgaagccagagaaaaaaat agacctgaatttcttcagggttggcagactgctgttacgaag