GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:45:53 Sequence gi568815597r:110263769_110489323 : 225555 bp : 44.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 352 347 6 1.05 1.08 Term - 996 987 10 2 1 122 50 2 0.048 -2.53 1.07 Intr - 6516 6351 166 1 1 19 94 139 0.405 6.62 1.06 Intr - 7263 7053 211 0 1 97 61 79 0.696 4.59 1.05 Intr - 9885 9761 125 1 2 59 65 42 0.481 -0.70 1.04 Intr - 19082 18921 162 0 0 48 44 134 0.621 4.95 1.03 Intr - 19561 19475 87 2 0 54 75 64 0.165 1.64 1.02 Intr - 29011 28973 39 2 0 84 98 24 0.031 1.20 1.01 Init - 37954 37948 7 1 1 79 95 0 0.012 0.92 1.00 Prom - 52175 52136 40 -2.76 2.00 Prom + 68535 68574 40 -3.76 2.01 Sngl + 75770 78511 2742 1 0 90 39 2080 0.917 196.22 2.02 PlyA + 80485 80490 6 1.05 3.07 PlyA - 80611 80606 6 1.05 3.06 Term - 101887 101861 27 1 0 121 38 20 0.178 -1.63 3.05 Intr - 113393 113182 212 0 2 48 105 182 0.992 14.53 3.04 Intr - 115588 115085 504 2 0 81 110 89 0.687 3.35 3.03 Intr - 117375 117214 162 1 0 90 98 48 0.969 5.95 3.02 Intr - 118027 117884 144 2 0 82 117 17 0.940 4.25 3.01 Init - 119177 119066 112 2 1 59 75 63 0.473 2.47 3.00 Prom - 122046 122007 40 -6.36 4.00 Prom + 126317 126356 40 -3.06 4.01 Init + 132680 132755 76 1 1 98 68 96 0.602 8.03 4.02 Term + 132797 132903 107 0 2 90 37 51 0.535 -1.23 4.03 PlyA + 133383 133388 6 1.05 5.05 PlyA - 133641 133636 6 1.05 5.04 Term - 137815 137755 61 0 1 98 44 63 0.620 0.18 5.03 Intr - 140268 140151 118 1 1 96 92 49 0.672 5.62 5.02 Intr - 142611 142550 62 2 2 45 81 42 0.512 -2.42 5.01 Init - 144098 143818 281 2 2 104 94 71 0.604 5.79 5.00 Prom - 173173 173134 40 -3.86 6.00 Prom + 186998 187037 40 -4.26 6.01 Init + 187449 187520 72 2 0 78 98 37 0.630 3.32 6.02 Intr + 190193 190318 126 1 0 130 73 146 0.991 18.18 6.03 Term + 192464 192583 120 1 0 107 48 123 0.996 8.67 6.04 PlyA + 193568 193573 6 1.05 7.06 PlyA - 194277 194272 6 -0.45 7.05 Term - 194926 194621 306 1 0 66 43 131 0.484 1.52 7.04 Intr - 199955 199793 163 1 1 81 99 2 0.203 0.58 7.03 Intr - 200600 200432 169 0 1 64 69 137 0.489 8.40 7.02 Intr - 202601 202503 99 0 0 54 109 78 0.715 6.58 7.01 Init - 204926 204869 58 2 1 84 95 53 0.984 7.07 7.00 Prom - 212855 212816 40 -1.96 8.00 Prom + 216937 216976 40 -3.96 8.01 Init + 216998 217059 62 1 2 72 109 33 0.984 4.68 8.02 Intr + 219007 219157 151 1 1 92 76 152 0.995 14.56 8.03 Intr + 220146 220263 118 2 1 97 72 144 0.976 13.74 8.04 Intr + 220938 221176 239 1 2 -2 103 151 0.434 4.93 8.05 Intr + 223088 223242 155 1 2 99 12 217 0.794 14.07 8.06 Intr + 224734 224866 133 1 1 42 94 100 0.094 6.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_1|268_aa MDALTAQAYKLTPREVLWFEFDPTKMHVETDPQCGGVGKWGPVTVFYWTQVHQSLRDLVT VSQWFLNLNNYEHAYKSNVWNCKAKQGPSPGPQQPFEEASHCDQEHIVPRFAGEGQGGHQ WAGRSNTVQIHSPNLVANSLNCHPQDPSIGFQAFLKRSFKRVPFSPPGSKSTEELIFHIN RLLQCPGMCTSELRPPQSQSAGAEKCNATEAYFQRNEANAAGNIQTERTQRGESEGGMTS TRGEDRGRSGTAHDQGKPVGRQCFSRDE >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_1|807_bp atggatgccctgacagcccaggcctataagctgacacccagagaagtgctatggtttgag tttgaccccaccaaaatgcatgttgaaactgatccccaatgtggcggtgttgggaagtgg ggcccggtgacagttttttactggactcaggttcatcagtctcttagggatttagtgact gtaagtcaatggttcttgaacctgaacaactacgagcatgcttataaatccaatgtctgg aactgcaaagccaagcaagggccatcccctggtccacagcaacccttcgaagaagcctcc cactgtgaccaggagcacatagttccacgatttgctggagagggccagggtggccatcag tgggcagggaggagcaacacagttcaaatccacagcccaaacctggtagccaattctctg aactgtcatcctcaggatccttccattggcttccaagccttcctgaaaagatctttcaag agggttccattcagccccccagggtctaaatctactgaagagctgatcttccacatcaat agactcctccaatgtcctggaatgtgtacctctgaattgaggccaccgcagagccagtct gcaggggctgaaaaatgcaatgcaactgaggcttatttccagaggaatgaggccaatgca gcaggcaacatccaaacagaaaggacccagagaggagagtctgagggcggcatgacttca acaaggggtgaggacagaggtagatcaggaactgctcatgaccaagggaagccagttgga cgtcagtgtttttccagagacgagtag >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_2|913_aa MKGKERSPVKAKRSRGGEDSTSRGERSKKLGGSGGSNGSSSGKTDSGGGSRRSLHLDKSS SRGGSREYDTGGGSSSSRLHSYSSPSTKNSSGGGESRSSSRGGGGESRSSGAASSAPGGG DGAEYKTLKISELGSQLSDEAVEDGLFHEFKRFGDVSVKISHLSGSGSGDERVAFVNFRR PEDARAAKHARGRLVLYDRPLKIEAVYVSRRRSRSPLDKDTYPPSASVVGASVGGHRHPP GGGGGQRSLSPGGAALGYRDYRLQQLALGRLPPPPPPPLPRDLERERDYPFYERVRPAYS LEPRVGAGAGAAPFREVDEISPEDDQRANRTLFLGNLDITVTESDLRRAFDRFGVITEVD IKRPSRGQTSTYGFLKFENLDMSHRAKLAMSGKIIIRNPIKIGYGKATPTTRLWVGGLGP WVPLAALAREFDRFGTIRTIDYRKGDSWAYIQYESLDAAHAAWTHMRGFPLGGPDRRLRV DFADTEHRYQQQYLQPLPLTHYELVTDAFGHRAPDPLRGARDRTPPLLYRDRDRDLYPDS DWVPPPPPVRERSTRTAATSVPAYEPLDSLDRRRDGWSLDRDRGDRDLPSSRDQPRKRRL PEESGGRHLDRSPESDRPRKRHCAPSPDRSPELSSSRDRYNSDNDRSSRLLLERPSPIRD RRGSLEKSQGDKRDRKNSASAERDRKHRTTAPTEGKSPLKKEDRSDGSAPSTSTASSKLK SPSQKQDGGTAPVASASPKLCLAWQGMLLLKNSNFPSNMHLLQGDLQVASSLLVEGSTGG KVAQLKITQRLRLDQPKLDEVTRRIKVAGPNGYAILLAVPGSSDSRSSSSSAASDTATST QRPLRNLVSYLKQKQAAGVISLPVGGNKDKENTGVLHAFPPCEFSQQFLDSPAKALAKSE EDYLVMIIVRGAS >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_2|2742_bp atgaagggaaaagagcgctcgccagtgaaggccaaacgctcccgtggtggtgaggactcg acttcccgcggtgagcggagcaagaagttagggggctctggtggcagcaatgggagcagc agcggaaagaccgatagcggcggtgggtcgcggcggagtctccacctggacaagtccagc agtcgaggtggcagccgcgagtatgataccggtgggggcagctccagtagccgcttgcat agttatagctccccgagcaccaaaaattcttcgggcgggggcgaatcgcgcagcagctcc cggggtggaggcggggagtcacgttcctctggggccgcctcctcagctcccggcggcggg gacggcgcggaatacaagactctgaagataagcgagttggggtcccagcttagtgacgaa gcggtggaggacggcctgtttcatgagttcaaacgcttcggtgatgtaagtgtgaaaatc agtcatctgtcgggttctggcagcggggatgagcgggtagcctttgtgaacttccggcgg ccagaggacgcgcgggcggccaagcatgccagaggccgcctggtgctctatgaccggcct ctgaagatagaagctgtgtatgtgagccggcgccgcagccgctcccctttagacaaagat acttatcctccatcagccagtgtggtcggggcctctgtaggtggtcaccggcacccccct ggaggtggtggaggccagagatcactttcccctggtggcgctgctttgggatacagagac taccggctgcagcagttggctcttggccgcctgccccctccacctccgccaccattgcct cgagacctggagagagaaagagactacccgttctatgagagagtgcgccctgcatacagt cttgagccaagggtgggagctggagcaggtgctgctcctttcagagaagtggatgagatt tcacccgaggatgatcagcgagctaaccggacgctcttcttgggcaacctagacatcact gtaacggagagtgatttaagaagggcgtttgatcgctttggagtcatcacagaagtagat atcaagaggccttctcgcggccagactagtacttacggctttctcaaatttgagaactta gatatgtctcaccgggccaaattagcaatgtctggcaaaattataattcggaatcctatc aaaattggttatggtaaagctacacccaccacccgcctctgggtgggaggcctgggacct tgggttcctcttgctgccctggcacgagaatttgatcgatttggcaccatacgcaccata gactaccgaaaaggtgatagttgggcatatatccagtatgaaagcctggatgcagcgcat gctgcctggacccatatgcggggcttcccacttggtggcccagatcgacgccttagagta gactttgccgacaccgaacatcgttaccagcagcagtatctgcagcctctgcccttgact cattatgagctggtgacagatgcttttggacatcgggcaccagaccctttgaggggtgct cgggataggacaccacccttactatacagagatcgtgatagggacctttatcctgactct gattgggtgccacccccacccccagtccgagaacgcagcactcggactgcagctacttct gtgcctgcttacgagccactggatagcctagatcgcaggcgggatggttggtccttggac cgggacagaggtgatcgagatctgcccagcagcagagaccagcctaggaagcgaaggctg cctgaggagagtggaggacgtcatctggataggtctcctgagagtgaccgcccacgaaaa cgtcactgcgctccttctcctgaccgcagtccagaattgagcagtagccgggatcgttac aacagcgacaatgatcgatcttcccgtcttctcttggaaaggccctctccaatcagagac agacgaggtagtttggagaagagccagggtgacaagcgagaccgtaaaaactctgcatca gctgaacgagataggaagcaccggacaactgctcccactgagggaaaaagccctctgaaa aaagaagaccgctctgatgggagtgcacctagcaccagcactgcttcctccaagctgaag tccccgtcccagaaacaggatggggggacagcccctgtggcatcagcctctcccaaactc tgtttggcctggcagggcatgcttctactgaagaacagcaactttccttccaacatgcat ctgttgcagggtgacctccaagtggctagtagtcttcttgtggagggttcaactggaggc aaagtggcccagctcaagatcactcagcgtctccgtttggaccagcccaagttggatgaa gtaactcgacgcatcaaagtagcagggcccaatggttatgccattcttttggctgtgcct ggaagttctgacagccggtcctcctcttcctcagctgcatcagacactgccacttctact cagaggccacttaggaaccttgtgtcctatttaaagcaaaagcaggcagccggggtgatc agcctccctgtggggggcaacaaagacaaggaaaacaccggggtccttcatgccttccca ccttgtgagttctcccagcagttcctggattcccctgccaaggcactggccaaatctgaa gaagattacctggtcatgatcattgtccgtggtgcgtcctaa >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_3|386_aa MTKTFAIFFVVFQEEFEGTSEQIGWIGSIMSSLRFCAGPLVAIICDILGEKTTSILGAFV VTGGYLISSWATSIPFLCVTMGLLPGLGSAFLYQVAAVVTTKYFKKRLALSTAIARSGMG LTFLLAPFTKFLIDLYDWTGALILFGAIALNLVPSSMLLRPIHIKSENNSGIKDKGSSLS AHGPEAHATETHCHETEESTIKDSTTQKAGLPSKNLTVSQNQSEEFYNGPNRNRLLLKSD EESDKVISWSCKQLFDISLFRNPFFYIFTWSFLLSQLAYFIPTFHLVARAKTLGIDIMDA SYLVSVAGILETVSQIISGWVADQNWIKKYHYHKSYLILCGITNLLAPLATTFPLLMTYT ICFAIFAGGYLALILPVLKCLQEFRD >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_3|1161_bp atgaccaagacttttgcaattttctttgtggtctttcaagaagagtttgaaggcacctca gagcaaattggttggattggatccatcatgtcatctcttcgtttttgtgcaggtcccctg gttgctattatttgtgacatacttggagagaaaactacctccattcttggggctttcgtt gttactggtggatatctgatcagcagctgggccacaagtattccttttctttgtgtgact atgggacttctacccggtttgggttctgctttcttataccaagtggctgctgtggtaact accaaatacttcaaaaaacgattggctctttctacagctattgcccgttctgggatggga ctgacttttcttttggcaccctttacaaaattcctgatagatctgtatgactggacagga gcccttatattatttggagctatcgcattgaatttggtgccttctagtatgctcttaaga cccatccatatcaaaagtgagaacaattctggtattaaagataaaggcagcagtttgtct gcacatggtccagaggcacatgcaacagaaacacactgccatgagacagaagagtctacc atcaaggacagtactacgcagaaggctggactacctagcaaaaatttaacagtctcacaa aatcaaagtgaagagttctacaatgggcctaacaggaacagactgttattaaagagtgat gaagaaagtgataaggttatttcgtggagctgcaaacaactgtttgacatttctctcttt agaaatcctttcttctacatatttacttggtcttttctcctcagtcagttagcatacttc atccctacctttcacctggtagccagagccaaaacactggggattgacatcatggatgcc tcttaccttgtttctgtagcaggtatccttgagacggtcagtcagattatttctggatgg gttgctgatcaaaactggattaagaagtatcattaccacaagtcttacctcatcctctgc ggcatcactaacctgcttgctcctttagccaccacatttccactacttatgacctacacc atctgctttgccatctttgctggtggttacctggcattgatactgcctgtactgaaatgt ctacaagagttcagagactag >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_4|60_aa MAGCRFQALPCGEAAKAWREIERSAAAGLGAKPLTARGRQGWRASLSECGARQAHTHPEL >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_4|183_bp atggcgggctgcaggttccaagccctgccctgcggggaggcagctaaggcctggcgagaa atcgagcgcagcgccgctgctggcctgggtgctaagcccctcactgcccggggccggcag ggctggcgggccagcctctccgagtgcggggcccgccaagcccacacccacccggaactc tag >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_5|173_aa MEPGAGHLDGHRAGSPSLRQALCDGSAVMFSSKERGRCTVINFVPLEAPLRSTPRSRQVT EACGGEGRAVPLGSEPEWSVGGMEATLEQHLEDTMKNPSIVGVLCTDSQGLNLGCRGTLS DEHAGVISVLAQQAAKLTSDPTDIPVVCLESDNGNIMIQKHDGITVAVHKMAS >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_5|522_bp atggagccaggtgcaggtcacctcgacggtcaccgcgcggggagcccaagccttcgtcag gctctgtgcgacggaagcgcagtgatgttttccagtaaagaacgcggacgttgcaccgtg atcaattttgtccctttggaggcgccgttacggtccacgccccgctcgcgtcaagtgact gaggcctgtggtggagaaggacgtgccgtgccgctgggttctgagccggagtggtcggtg ggtgggatggaggcgaccttggagcagcacttggaagacacaatgaagaatccctccatt gttggagtcctgtgcacagattcacaaggacttaatctgggttgccgcgggaccctgtca gatgagcatgctggagtgatatctgttctagcccagcaagcagctaagctaacctctgac cccactgatattcctgtggtgtgtctagaatcagataatgggaacattatgatccagaaa cacgatggcatcacggtggcagtgcacaaaatggcctcttga >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_6|105_aa MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC HPGSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_6|318_bp atgagaggtgccacgcgagtctcaatcatgctcctcctagtaactgtgtctgactgtgct gtgatcacaggggcctgtgagcgggatgtccagtgtggggcaggcacctgctgtgccatc agcctgtggcttcgagggctgcggatgtgcaccccgctggggcgggaaggcgaggagtgc caccccggcagccacaaggtccccttcttcaggaaacgcaagcaccacacctgtccttgc ttgcccaacctgctgtgctccaggttcccggacggcaggtaccgctgctccatggacttg aagaacatcaatttttag >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_7|264_aa MRLSDIGKGESPDDHRGQAAAKTLACQTSSPEIQIQWLWDAAQDPGYFTSTPASGELKHI REKIVRSFLDILLVYLTEEIPPGNASVVCWLQHVGAIFIIAIVQMGKLRYTETNMSVTDA QQSLSSRQGAETGKDLLDKRTINHILPVTWTKWSSQDPSTAPQGGSQFRKGSTKVHQAAV NGTSRKAMNVKGQAMVKMESWTWKGTLYPGPEPETKGESDSRAARRSQVTTAGDNVRKRR QKQKQKQTKKGPCGGLAWLKWVLE >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_7|795_bp atgaggctcagtgacattggcaagggtgaaagcccagatgaccacaggggccaggcagct gctaaaactctagcttgccagacttcctctccagaaatccagatccagtggctctgggat gcggcacaggaccctgggtacttcacaagcaccccagcttctggggaattgaagcacatc cgagaaaagattgttcgttccttcctggacatcttattggtttacctcactgaagagatc cctccaggtaatgctagtgtggtgtgctggttacaacacgtaggtgctatttttattatt gccattgtacagatggggaaactgaggtacacagaaacaaatatgtctgtgactgacgcg cagcagagtctatcgagtaggcagggggctgagacaggaaaggatcttttggacaaaagg accataaatcatatccttcctgtgacctggaccaagtggtcttcccaagacccaagcact gcccctcagggagggtcacaatttagaaagggctccacaaaagtccaccaggcagcagtc aatggtacctcgagaaaagctatgaatgtcaaaggacaagccatggtcaaaatggaatcc tggacgtggaagggaactttatatccaggacctgagcctgaaaccaagggagaaagcgac tcaagggctgcaagaaggtcccaagtcaccacagcaggtgacaatgtccgtaaaaggaga caaaaacaaaaacaaaaacaaacaaagaaaggaccctgtgggggcctggcctggcttaag tgggtcctggagtaa >gi568815597r:110263769_110489323|GENSCAN_predicted_peptide_8|286_aa MRGLVVFLAVFALSEVNAITRVPLHKGKSLRRALKERRLLEDFLRNHHYAVSRKHSSSGV VASESLTNYLDCQYFGKIYIGTLPQKFTLVFDTGSPDIWVPSVYCNSDACQRIASIEAIF LQGLLRGSALAGTVSGEKILRAPLTLGLCFRKPPTLRSVQVLHPQNMGKSLSIQYGTGSM RGLLGYDTVTDLVEVMILLQVSNIVDPHQTVGLSTQEPGDVFTYSEFDGILGLAYPSLAS EYALRLGFRNDQGSMLTLRAIDLSYYTGSLHWIPMTARILAVHCGQ >gi568815597r:110263769_110489323|GENSCAN_predicted_CDS_8|858_bp atgaggggccttgtggtattccttgcagtctttgctctctctgaggtcaatgccatcacc agggttcctctgcacaaagggaagtcgctgaggagggccctgaaggagcgcaggctcctg gaggacttcctgaggaatcaccattatgcagtcagcaggaagcactccagctctggggtg gtggccagcgagtctctgaccaactacctggattgtcagtactttgggaagatctacatc gggacccttccccagaagttcaccttggtgtttgatacaggctccccggatatctgggtg ccctctgtctactgcaacagtgatgcctgtcagagaattgccagtattgaggccatattc ttacaaggcctgctgaggggttcggccctggctggcaccgtgagtggtgagaagattcta agggccccactaactctgggtctgtgtttcagaaaaccaccaacgcttcgatccgtccaa gtcctccacccacagaacatgggcaagtccctgtccatccagtatggcacaggcagcatg cggggcttgctgggctatgacactgtcaccgacctagtggaggtcatgattctcttgcag gtctccaacattgtggacccccaccagactgtgggtctgagcacccaggaacctggcgac gtcttcacctactccgagtttgatgggatcctggggctggcctatccctctcttgcctct gagtacgcgctgcgccttggtttcaggaatgaccaggggagcatgctcacgctgagggcc attgatctgtcgtactacacaggctccctgcactggatacccatgactgcaagaatactg gcagttcactgtggacag