GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:52:22 Sequence gi568815597f:110239406_110445606 : 206201 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1972 1967 6 1.05 1.10 Term - 3615 3426 190 2 1 76 50 99 0.037 1.82 1.09 Intr - 17300 17216 85 0 1 92 79 51 0.085 3.48 1.08 Intr - 23552 23434 119 1 2 95 115 -14 0.159 2.11 1.07 Intr - 30879 30714 166 1 1 19 94 139 0.569 6.62 1.06 Intr - 31626 31416 211 0 1 97 61 79 0.697 4.59 1.05 Intr - 34248 34124 125 1 2 59 65 42 0.481 -0.70 1.04 Intr - 43445 43284 162 0 0 48 44 134 0.621 4.95 1.03 Intr - 43924 43838 87 2 0 54 75 64 0.165 1.64 1.02 Intr - 53374 53336 39 2 0 84 98 24 0.031 1.20 1.01 Init - 62317 62311 7 1 1 79 95 0 0.012 0.92 1.00 Prom - 76538 76499 40 -2.76 2.00 Prom + 92898 92937 40 -3.76 2.01 Sngl + 100133 102874 2742 1 0 90 39 2080 0.917 196.22 2.02 PlyA + 104848 104853 6 1.05 3.07 PlyA - 104974 104969 6 1.05 3.06 Term - 126250 126224 27 1 0 121 38 20 0.178 -1.63 3.05 Intr - 137756 137545 212 0 2 48 105 182 0.992 14.53 3.04 Intr - 139951 139448 504 2 0 81 110 89 0.687 3.35 3.03 Intr - 141738 141577 162 1 0 90 98 48 0.969 5.95 3.02 Intr - 142390 142247 144 2 0 82 117 17 0.940 4.25 3.01 Init - 143540 143429 112 2 1 59 75 63 0.473 2.47 3.00 Prom - 146409 146370 40 -6.36 4.00 Prom + 150680 150719 40 -3.06 4.01 Init + 157043 157118 76 1 1 98 68 96 0.602 8.03 4.02 Term + 157160 157266 107 0 2 90 37 51 0.535 -1.23 4.03 PlyA + 157746 157751 6 1.05 5.05 PlyA - 158004 157999 6 1.05 5.04 Term - 162178 162118 61 0 1 98 44 63 0.620 0.18 5.03 Intr - 164631 164514 118 1 1 96 92 49 0.672 5.62 5.02 Intr - 166974 166913 62 2 2 45 81 42 0.512 -2.42 5.01 Init - 168461 168181 281 2 2 104 94 71 0.604 5.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:110239406_110445606|GENSCAN_predicted_peptide_1|396_aa MDALTAQAYKLTPREVLWFEFDPTKMHVETDPQCGGVGKWGPVTVFYWTQVHQSLRDLVT VSQWFLNLNNYEHAYKSNVWNCKAKQGPSPGPQQPFEEASHCDQEHIVPRFAGEGQGGHQ WAGRSNTVQIHSPNLVANSLNCHPQDPSIGFQAFLKRSFKRVPFSPPGSKSTEELIFHIN RLLQCPGMCTSELRPPQSQSAGAEKCNATEAYFQRNEANAAGNIQTERTQRGESEGGMTS TRGEDRGRSGTAHDQGKPVGRQCFSRAILGDGVTRSLTTDAQGLTGGKDRTSSSRQMLII SPDCADLSFFHYNHCDRAWSEQGLRRLWTFMSETPCLERLSHLPKVTQLTSGKASGARIS NLAVDQKSMAVVVLVPRGWPQSQCRSMVKIACEIYL >gi568815597f:110239406_110445606|GENSCAN_predicted_CDS_1|1191_bp atggatgccctgacagcccaggcctataagctgacacccagagaagtgctatggtttgag tttgaccccaccaaaatgcatgttgaaactgatccccaatgtggcggtgttgggaagtgg ggcccggtgacagttttttactggactcaggttcatcagtctcttagggatttagtgact gtaagtcaatggttcttgaacctgaacaactacgagcatgcttataaatccaatgtctgg aactgcaaagccaagcaagggccatcccctggtccacagcaacccttcgaagaagcctcc cactgtgaccaggagcacatagttccacgatttgctggagagggccagggtggccatcag tgggcagggaggagcaacacagttcaaatccacagcccaaacctggtagccaattctctg aactgtcatcctcaggatccttccattggcttccaagccttcctgaaaagatctttcaag agggttccattcagccccccagggtctaaatctactgaagagctgatcttccacatcaat agactcctccaatgtcctggaatgtgtacctctgaattgaggccaccgcagagccagtct gcaggggctgaaaaatgcaatgcaactgaggcttatttccagaggaatgaggccaatgca gcaggcaacatccaaacagaaaggacccagagaggagagtctgagggcggcatgacttca acaaggggtgaggacagaggtagatcaggaactgctcatgaccaagggaagccagttgga cgtcagtgtttttccagggctatcctaggtgatggggttacgcggagtctcaccacagat gctcaaggcttgacaggcgggaaagacaggacgtcctcatcacgacaaatgcttattatc agtcctgactgtgcagacttgtctttcttccactacaaccactgtgaccgggcctggagc gagcaaggcctccgtcgcctctggacttttatgagtgagacaccatgcttagaaaggtta agtcatttgcccaaggtcacgcagctcacaagtggcaaagccagtggagccagaattagc aatctggccgtggaccagaaatcgatggcagtggtggtgctggtgccaaggggctggcca cagagccagtgtagaagcatggtcaaaatcgcctgtgaaatctatctgtga >gi568815597f:110239406_110445606|GENSCAN_predicted_peptide_2|913_aa MKGKERSPVKAKRSRGGEDSTSRGERSKKLGGSGGSNGSSSGKTDSGGGSRRSLHLDKSS SRGGSREYDTGGGSSSSRLHSYSSPSTKNSSGGGESRSSSRGGGGESRSSGAASSAPGGG DGAEYKTLKISELGSQLSDEAVEDGLFHEFKRFGDVSVKISHLSGSGSGDERVAFVNFRR PEDARAAKHARGRLVLYDRPLKIEAVYVSRRRSRSPLDKDTYPPSASVVGASVGGHRHPP GGGGGQRSLSPGGAALGYRDYRLQQLALGRLPPPPPPPLPRDLERERDYPFYERVRPAYS LEPRVGAGAGAAPFREVDEISPEDDQRANRTLFLGNLDITVTESDLRRAFDRFGVITEVD IKRPSRGQTSTYGFLKFENLDMSHRAKLAMSGKIIIRNPIKIGYGKATPTTRLWVGGLGP WVPLAALAREFDRFGTIRTIDYRKGDSWAYIQYESLDAAHAAWTHMRGFPLGGPDRRLRV DFADTEHRYQQQYLQPLPLTHYELVTDAFGHRAPDPLRGARDRTPPLLYRDRDRDLYPDS DWVPPPPPVRERSTRTAATSVPAYEPLDSLDRRRDGWSLDRDRGDRDLPSSRDQPRKRRL PEESGGRHLDRSPESDRPRKRHCAPSPDRSPELSSSRDRYNSDNDRSSRLLLERPSPIRD RRGSLEKSQGDKRDRKNSASAERDRKHRTTAPTEGKSPLKKEDRSDGSAPSTSTASSKLK SPSQKQDGGTAPVASASPKLCLAWQGMLLLKNSNFPSNMHLLQGDLQVASSLLVEGSTGG KVAQLKITQRLRLDQPKLDEVTRRIKVAGPNGYAILLAVPGSSDSRSSSSSAASDTATST QRPLRNLVSYLKQKQAAGVISLPVGGNKDKENTGVLHAFPPCEFSQQFLDSPAKALAKSE EDYLVMIIVRGAS >gi568815597f:110239406_110445606|GENSCAN_predicted_CDS_2|2742_bp atgaagggaaaagagcgctcgccagtgaaggccaaacgctcccgtggtggtgaggactcg acttcccgcggtgagcggagcaagaagttagggggctctggtggcagcaatgggagcagc agcggaaagaccgatagcggcggtgggtcgcggcggagtctccacctggacaagtccagc agtcgaggtggcagccgcgagtatgataccggtgggggcagctccagtagccgcttgcat agttatagctccccgagcaccaaaaattcttcgggcgggggcgaatcgcgcagcagctcc cggggtggaggcggggagtcacgttcctctggggccgcctcctcagctcccggcggcggg gacggcgcggaatacaagactctgaagataagcgagttggggtcccagcttagtgacgaa gcggtggaggacggcctgtttcatgagttcaaacgcttcggtgatgtaagtgtgaaaatc agtcatctgtcgggttctggcagcggggatgagcgggtagcctttgtgaacttccggcgg ccagaggacgcgcgggcggccaagcatgccagaggccgcctggtgctctatgaccggcct ctgaagatagaagctgtgtatgtgagccggcgccgcagccgctcccctttagacaaagat acttatcctccatcagccagtgtggtcggggcctctgtaggtggtcaccggcacccccct ggaggtggtggaggccagagatcactttcccctggtggcgctgctttgggatacagagac taccggctgcagcagttggctcttggccgcctgccccctccacctccgccaccattgcct cgagacctggagagagaaagagactacccgttctatgagagagtgcgccctgcatacagt cttgagccaagggtgggagctggagcaggtgctgctcctttcagagaagtggatgagatt tcacccgaggatgatcagcgagctaaccggacgctcttcttgggcaacctagacatcact gtaacggagagtgatttaagaagggcgtttgatcgctttggagtcatcacagaagtagat atcaagaggccttctcgcggccagactagtacttacggctttctcaaatttgagaactta gatatgtctcaccgggccaaattagcaatgtctggcaaaattataattcggaatcctatc aaaattggttatggtaaagctacacccaccacccgcctctgggtgggaggcctgggacct tgggttcctcttgctgccctggcacgagaatttgatcgatttggcaccatacgcaccata gactaccgaaaaggtgatagttgggcatatatccagtatgaaagcctggatgcagcgcat gctgcctggacccatatgcggggcttcccacttggtggcccagatcgacgccttagagta gactttgccgacaccgaacatcgttaccagcagcagtatctgcagcctctgcccttgact cattatgagctggtgacagatgcttttggacatcgggcaccagaccctttgaggggtgct cgggataggacaccacccttactatacagagatcgtgatagggacctttatcctgactct gattgggtgccacccccacccccagtccgagaacgcagcactcggactgcagctacttct gtgcctgcttacgagccactggatagcctagatcgcaggcgggatggttggtccttggac cgggacagaggtgatcgagatctgcccagcagcagagaccagcctaggaagcgaaggctg cctgaggagagtggaggacgtcatctggataggtctcctgagagtgaccgcccacgaaaa cgtcactgcgctccttctcctgaccgcagtccagaattgagcagtagccgggatcgttac aacagcgacaatgatcgatcttcccgtcttctcttggaaaggccctctccaatcagagac agacgaggtagtttggagaagagccagggtgacaagcgagaccgtaaaaactctgcatca gctgaacgagataggaagcaccggacaactgctcccactgagggaaaaagccctctgaaa aaagaagaccgctctgatgggagtgcacctagcaccagcactgcttcctccaagctgaag tccccgtcccagaaacaggatggggggacagcccctgtggcatcagcctctcccaaactc tgtttggcctggcagggcatgcttctactgaagaacagcaactttccttccaacatgcat ctgttgcagggtgacctccaagtggctagtagtcttcttgtggagggttcaactggaggc aaagtggcccagctcaagatcactcagcgtctccgtttggaccagcccaagttggatgaa gtaactcgacgcatcaaagtagcagggcccaatggttatgccattcttttggctgtgcct ggaagttctgacagccggtcctcctcttcctcagctgcatcagacactgccacttctact cagaggccacttaggaaccttgtgtcctatttaaagcaaaagcaggcagccggggtgatc agcctccctgtggggggcaacaaagacaaggaaaacaccggggtccttcatgccttccca ccttgtgagttctcccagcagttcctggattcccctgccaaggcactggccaaatctgaa gaagattacctggtcatgatcattgtccgtggtgcgtcctaa >gi568815597f:110239406_110445606|GENSCAN_predicted_peptide_3|386_aa MTKTFAIFFVVFQEEFEGTSEQIGWIGSIMSSLRFCAGPLVAIICDILGEKTTSILGAFV VTGGYLISSWATSIPFLCVTMGLLPGLGSAFLYQVAAVVTTKYFKKRLALSTAIARSGMG LTFLLAPFTKFLIDLYDWTGALILFGAIALNLVPSSMLLRPIHIKSENNSGIKDKGSSLS AHGPEAHATETHCHETEESTIKDSTTQKAGLPSKNLTVSQNQSEEFYNGPNRNRLLLKSD EESDKVISWSCKQLFDISLFRNPFFYIFTWSFLLSQLAYFIPTFHLVARAKTLGIDIMDA SYLVSVAGILETVSQIISGWVADQNWIKKYHYHKSYLILCGITNLLAPLATTFPLLMTYT ICFAIFAGGYLALILPVLKCLQEFRD >gi568815597f:110239406_110445606|GENSCAN_predicted_CDS_3|1161_bp atgaccaagacttttgcaattttctttgtggtctttcaagaagagtttgaaggcacctca gagcaaattggttggattggatccatcatgtcatctcttcgtttttgtgcaggtcccctg gttgctattatttgtgacatacttggagagaaaactacctccattcttggggctttcgtt gttactggtggatatctgatcagcagctgggccacaagtattccttttctttgtgtgact atgggacttctacccggtttgggttctgctttcttataccaagtggctgctgtggtaact accaaatacttcaaaaaacgattggctctttctacagctattgcccgttctgggatggga ctgacttttcttttggcaccctttacaaaattcctgatagatctgtatgactggacagga gcccttatattatttggagctatcgcattgaatttggtgccttctagtatgctcttaaga cccatccatatcaaaagtgagaacaattctggtattaaagataaaggcagcagtttgtct gcacatggtccagaggcacatgcaacagaaacacactgccatgagacagaagagtctacc atcaaggacagtactacgcagaaggctggactacctagcaaaaatttaacagtctcacaa aatcaaagtgaagagttctacaatgggcctaacaggaacagactgttattaaagagtgat gaagaaagtgataaggttatttcgtggagctgcaaacaactgtttgacatttctctcttt agaaatcctttcttctacatatttacttggtcttttctcctcagtcagttagcatacttc atccctacctttcacctggtagccagagccaaaacactggggattgacatcatggatgcc tcttaccttgtttctgtagcaggtatccttgagacggtcagtcagattatttctggatgg gttgctgatcaaaactggattaagaagtatcattaccacaagtcttacctcatcctctgc ggcatcactaacctgcttgctcctttagccaccacatttccactacttatgacctacacc atctgctttgccatctttgctggtggttacctggcattgatactgcctgtactgaaatgt ctacaagagttcagagactag >gi568815597f:110239406_110445606|GENSCAN_predicted_peptide_4|60_aa MAGCRFQALPCGEAAKAWREIERSAAAGLGAKPLTARGRQGWRASLSECGARQAHTHPEL >gi568815597f:110239406_110445606|GENSCAN_predicted_CDS_4|183_bp atggcgggctgcaggttccaagccctgccctgcggggaggcagctaaggcctggcgagaa atcgagcgcagcgccgctgctggcctgggtgctaagcccctcactgcccggggccggcag ggctggcgggccagcctctccgagtgcggggcccgccaagcccacacccacccggaactc tag >gi568815597f:110239406_110445606|GENSCAN_predicted_peptide_5|173_aa MEPGAGHLDGHRAGSPSLRQALCDGSAVMFSSKERGRCTVINFVPLEAPLRSTPRSRQVT EACGGEGRAVPLGSEPEWSVGGMEATLEQHLEDTMKNPSIVGVLCTDSQGLNLGCRGTLS DEHAGVISVLAQQAAKLTSDPTDIPVVCLESDNGNIMIQKHDGITVAVHKMAS >gi568815597f:110239406_110445606|GENSCAN_predicted_CDS_5|522_bp atggagccaggtgcaggtcacctcgacggtcaccgcgcggggagcccaagccttcgtcag gctctgtgcgacggaagcgcagtgatgttttccagtaaagaacgcggacgttgcaccgtg atcaattttgtccctttggaggcgccgttacggtccacgccccgctcgcgtcaagtgact gaggcctgtggtggagaaggacgtgccgtgccgctgggttctgagccggagtggtcggtg ggtgggatggaggcgaccttggagcagcacttggaagacacaatgaagaatccctccatt gttggagtcctgtgcacagattcacaaggacttaatctgggttgccgcgggaccctgtca gatgagcatgctggagtgatatctgttctagcccagcaagcagctaagctaacctctgac cccactgatattcctgtggtgtgtctagaatcagataatgggaacattatgatccagaaa cacgatggcatcacggtggcagtgcacaaaatggcctcttga