GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:53:58 Sequence gi568815575f:9553587_9815005 : 261419 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6703 6924 222 0 0 61 92 109 0.561 6.92 1.02 Term + 7930 8043 114 0 0 68 48 65 0.528 -0.93 1.03 PlyA + 8405 8410 6 1.05 2.04 PlyA - 9128 9123 6 1.05 2.03 Term - 23986 23852 135 1 0 60 53 98 0.121 1.52 2.02 Intr - 33681 33516 166 2 1 74 35 144 0.120 7.66 2.01 Init - 37510 37449 62 1 2 100 36 79 0.519 2.64 2.00 Prom - 44087 44048 40 -4.86 3.05 PlyA - 45106 45101 6 1.05 3.04 Term - 49448 49315 134 0 2 89 37 79 0.632 1.15 3.03 Intr - 50162 50078 85 2 1 102 103 28 0.799 5.09 3.02 Intr - 64172 64098 75 2 0 43 76 74 0.508 1.41 3.01 Init - 75496 75458 39 1 0 76 72 73 0.659 4.69 3.00 Prom - 75662 75623 40 -5.16 4.00 Prom + 79296 79335 40 -3.16 4.01 Init + 81467 81520 54 1 0 92 52 77 0.346 4.03 4.02 Intr + 82212 82482 271 2 1 60 -3 134 0.157 -1.39 4.03 Intr + 86687 86774 88 0 1 118 121 87 0.875 14.03 4.04 Intr + 99979 100103 125 1 2 61 69 83 0.380 3.93 4.05 Intr + 100629 100736 108 1 0 71 110 93 0.802 10.06 4.06 Intr + 130457 130602 146 0 2 125 94 205 0.983 24.80 4.07 Intr + 134431 134689 259 0 1 109 70 365 0.838 33.84 4.08 Intr + 137993 138125 133 0 1 70 90 46 0.861 2.70 4.09 Intr + 138527 138668 142 2 1 66 75 82 0.760 5.06 4.10 Intr + 139563 139626 64 2 1 76 101 48 0.812 3.19 4.11 Intr + 149920 150015 96 2 0 79 38 84 0.096 2.48 4.12 Intr + 151407 151528 122 1 2 82 98 81 0.563 8.71 4.13 Intr + 155662 155736 75 0 0 131 75 79 0.912 10.61 4.14 Intr + 156047 156174 128 1 2 87 116 133 0.999 15.48 4.15 Intr + 158025 158190 166 0 1 64 115 186 0.998 18.86 4.16 Intr + 161316 161417 102 2 0 100 58 153 0.711 13.87 4.17 Intr + 163533 163572 40 2 1 93 86 10 0.129 -0.90 4.18 Intr + 163816 164026 211 2 1 58 79 104 0.067 4.47 4.19 Term + 167052 167193 142 0 1 99 36 82 0.111 1.50 4.20 PlyA + 170070 170075 6 1.05 5.12 PlyA - 171779 171774 6 1.05 5.11 Term - 172254 172160 95 1 2 126 49 62 0.598 4.09 5.10 Intr - 186133 185875 259 1 1 126 74 95 0.485 8.94 5.09 Intr - 187869 187752 118 2 1 86 78 44 0.955 3.67 5.08 Intr - 190087 189979 109 2 1 88 19 87 0.754 1.14 5.07 Intr - 192361 192305 57 2 0 81 80 34 0.613 0.76 5.06 Intr - 192567 192458 110 2 2 76 99 161 0.999 15.93 5.05 Intr - 195080 194988 93 1 0 87 111 85 0.026 9.78 5.04 Intr - 205840 205746 95 1 2 78 51 55 0.089 -0.44 5.03 Intr - 207240 207131 110 1 2 53 101 50 0.443 2.80 5.02 Intr - 212212 211982 231 2 0 39 82 321 0.002 24.34 5.01 Init - 226276 226216 61 1 1 83 46 109 0.947 7.61 5.00 Prom - 230419 230380 40 -5.46 6.00 Prom + 230577 230616 40 -6.86 6.01 Init + 232960 233189 230 0 2 105 75 366 0.517 32.84 6.02 Intr + 233378 233755 378 2 0 98 3 180 0.306 4.58 6.03 Intr + 236779 236971 193 1 1 26 68 145 0.336 5.79 6.04 Intr + 244565 244613 49 1 1 25 58 77 0.017 -3.35 6.05 Term + 249934 250109 176 2 2 102 47 44 0.175 -0.38 6.06 PlyA + 251661 251666 6 1.05 7.05 PlyA - 255569 255564 6 1.05 7.04 Term - 257139 256889 251 1 2 91 48 72 0.709 -0.63 7.03 Intr - 258877 258331 547 1 1 19 108 228 0.965 10.36 7.02 Intr - 259114 259032 83 2 2 61 77 86 0.994 4.16 7.01 Init - 259402 259186 217 1 1 45 21 184 0.640 6.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 139736 139833 98 0 2 95 87 31 0.890 3.43 S.002 Init - 195058 194988 71 1 2 93 111 115 0.927 12.84 S.003 Init - 212231 211982 250 2 1 63 82 360 0.938 28.53 S.004 Term - 224868 224741 128 1 2 74 38 60 0.830 -1.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_1|111_aa VRTGGTEDIWPKDTTAGTGWMVIRPQSLWRSICAVCCQASCIPDRVTGSVRVTSRKGSEA KLPALLLTFTLESEDEYRNMFSGVKVKPCEIHTGEAADSSSSRWHEYGGGG >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_1|336_bp gtgagaacaggaggtactgaggacatctggcccaaggatactaccgctggcacggggtgg atggtgattcgccctcagtccctctggcgcagcatctgtgctgtttgctgccaggccagc tgcatcccagatagagtgacaggatctgttcgcgtcacatccagaaaaggctctgaggcc aagctccctgcgttgcttctcactttcactctggaatcagaggatgagtatagaaacatg ttctctggggtgaaagtgaagccttgtgaaattcatacaggtgaagctgctgatagcagc tcttcccgatggcatgaatatggaggtggcggttga >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_2|120_aa MALPKIQPSFLALGSVLGRPWKSSKRKPRTADSSHSRHRWVSALPSDQKEGNLLRDEASS ARCLQGSFGTFQGCTRMPEAPTVTTKNVSRHCEMAPGGWGGYKVTRFEKHYSKGIQYQRF >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_2|363_bp atggccttgcccaagatccaaccctcattcctggctctgggatctgtcctgggtcgtccc tggaaaagctccaagcgcaagccgcggacagcagacagctctcactcccgacacagatgg gtttccgccctgccatccgaccagaaggagggcaacctgctcagggatgaggcctccagc gctcggtgcctccagggctccttcggtaccttccagggctgcacgcggatgccagaagca cccactgtcaccaccaaaaatgtctccagacattgtgagatggcccctggagggtgggga ggatacaaagtcacccgctttgagaagcactattctaaaggaattcagtaccagcggttt tga >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_3|110_aa MQKKKIRIPQVEQELSVAYTFRHSKTLMGITEGSVTPQNWETIHVLSQPVGGTLLQQPQE TNHHQHGCLKSKQIFLYSETSPMYHSMIYALGLLVHNDCSQVDEDKHFNI >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_3|333_bp atgcagaagaagaagatccgtattccacaagtagagcaggaactcagcgttgcttatact ttccggcactccaagactcttatgggcatcactgaaggaagtgtgaccccacagaactgg gagaccatacatgttttaagccagccagttggtggcacgttgctacagcagccccaggaa acaaaccaccaccaacatggatgtctgaaaagtaaacaaatcttcctttactctgagact tctccaatgtatcattcaatgatttatgcactggggcttctcgttcacaacgactgctcc caggtggatgaagacaagcatttcaacatttaa >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_4|823_aa MGAGGAVSRRLLPVAAELEETQVKKDKGRTMPGNSTASHCPKDMNVHSSMVDPSQVPVSC CLEKHNVVQPRSSIVFSHKRSEALERATTWMLWENMLSERSQMQEVTYCWSLQTASHFYD RMLGSLLVHRSIHEVTIRNIEVTCSMTELAGASSSCCHRPAGRGAMQSVLHHFQRLRGRE GGSHFINTSSPRGEAKMSITSDEVNFLVYRYLQESGFSHSAFTFGIESHISQSNINGTLV PPAALISILQKGLQYVEAEISINEDGTVFDGRPIESLSLIDAVMPDVVQTRQQAFREKLA QQQASAAAAAAAATAAATAATTTSAGVSHQNPSKNREATVNGEENRAHSVNNHAKPMEID GEVEIPSSKATVLRGHESEVFICAWNPVSDLLASGSGDSTARIWNLNENSNGGSTQLVLR HCIREGGHDVPSNKDVTSLDWNTNGTLLATGSYDGFARIWTEDGRRVTPAVRGRLRGGGT DARSQPQTHSLLEDRTPALDVDWQNNTTFASCSTDMCIHVCRLGCDRPVKTFQGHTNEVN AIKWDPSGMLLASCSDDMTLKIWSMKQEVCIHDLQAHNKEIYTIKWSPTGPATSNPNSNI MLASASFDSTVRLWDIERGVCTHTLTKHQEPVYSVAFSPDGKYLASGSFDKCVHIWNTQS GNLVHSYRGTGGIFEVCWNARGDKVGASASDGSLLQILGLKGTLARRKVVRKTLAGQGSV FGVSSEIALQPVREVPAMPAGRRTSHLLVCMHLGSQQESSEEPRALWPQGSIGRTISPCQ VEQVCCQVVDLGHPEGYCRTLGQLGKNRMETVGRQQSMRQFPG >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_4|2472_bp atgggggctggaggggccgtgtcccggcgcctccttcctgtggctgctgagttggaggag actcaggtaaagaaggataaaggcagaacgatgccaggcaattccactgctagccattgc cccaaggacatgaatgttcatagcagcatggttgacccaagccaggttcctgtcagctgc tgcctggaaaaacacaacgtggttcagccacgcagtagcatagtattcagccataaaagg agtgaagccctggaacgtgctacaacatggatgctctgggaaaacatgctcagtgaaaga agccagatgcaagaggtcacatattgctggtccctgcagacggcttcccatttttatgac cggatgctgggaagcctgctggtccacaggtccatacatgaagtcaccataagaaacatc gaggtgacatgtagcatgaccgagctcgctggcgcctcttcatcgtgctgccaccgccct gcaggaagaggggccatgcagtcagtcttgcaccactttcaacgtttgcgagggagagag ggtggttcccacttcatcaacacctcatcgccgcgaggtgaggctaagatgagcataacc agtgacgaggtgaactttctggtgtatcggtatctccaggagtcaggtttttcccactcg gctttcacgtttgggattgagagccacatcagccagtccaacatcaatgggacgctagtg ccaccggccgccctcatctccattctccagaagggcctgcagtatgtagaggccgagatc agtatcaacgaggatggcacagtgttcgacggccgccccatagagtccctgtcactgata gacgccgtgatgcccgacgtggtgcagacgcggcagcaggcattccgagagaagctcgct cagcagcaagccagtgcggcggcggcggcggctgcggccacggcagcagcgacagcagcc accacgacctcagccggcgtttcccaccaaaatccatcgaagaacagagaggccacggtg aatggggaagagaacagagcacattcagtcaataatcacgcgaagccaatggaaatagat ggagaggttgagattccatccagcaaagccacagtccttcggggccatgagtctgaggtg ttcatttgtgcctggaatcctgtcagtgatttgctagcctccggatctggagactcaact gcaaggatatggaacctgaatgagaatagcaacgggggctccacccagctcgtgttgagg cactgtatacgagaggggggccatgacgtcccgagtaacaaagacgtcacctcactggac tggaataccaatggaacactcttggctacgggttcatatgacggttttgcaagaatatgg acggaagatggaaggcgggtgacccctgcagtccgtggacgcctgagaggaggtggaaca gatgctcgctcccagccgcagacccactctctgctagaggacaggacccctgcccttgat gtggactggcagaacaacacgacctttgcctcctgtagcacagacatgtgtatccatgtg tgcaggctcggctgtgaccgcccagtcaaaaccttccagggacacacaaacgaggtcaac gccatcaaatgggatccgtctggaatgttgctggcatcctgctcggatgacatgacattg aagatctggagcatgaaacaggaggtgtgcatccatgatcttcaggctcacaataaagag atctacaccatcaagtggagccccactgggcccgccaccagcaacccaaactccaacatc atgttggcaagtgcttcgtttgattctacggtgcgactgtgggacatagaacgaggcgtc tgcacccacacgctcacgaagcatcaggagcctgtctatagcgtagctttcagccctgat gggaagtacttggccagtggatccttcgacaagtgcgtccatatctggaatactcagagt ggaaatcttgtccacagctaccgaggcactggcggcatcttcgaggtgtgctggaacgcc cgaggagacaaagtgggtgccagcgcgtccgacggctctttgcttcagatcttaggtctc aagggcactttggcgcgtagaaaggtggtcaggaaaactctggctggacaagggtcagtc ttcggggtcagcagcgagattgctctgcagccagtgagggaggtccccgccatgccggct ggaagaagaacctcgcatcttttggtgtgtatgcaccttggcagccagcaggaaagcagc gaagagccgcgcgccctctggcctcaagggagcattggcagaaccataagcccctgccaa gtggagcaagtgtgttgccaggtggtggatttggggcaccctgaggggtactgcagaacc ctgggacagctcgggaagaaccgcatggaaacagtgggcagacaacaaagcatgcggcag ttcccaggataa >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_5|445_aa MVAECKENKISSGGTKKLQNWTFCCPTRDAATQLVLSFQPRAFHALCLGSGGLRLALGLL QLLPGRRPAGPGSPATSPPASVRILRAAAACDLLGCLGMVIRSTVWLGFPNFVDSVSDMN HTEIWPAAFCVGSAMWIQLLYSACFWWLFCYAVDAYLVIRRSAGLSTILLYHIMAWGLAT LLCVEGAAMLYYPSVSRCERGLDHAIPHYVTMYLPLLLVLVANPILFQKTVTAGQLTSFV ILQRHQQNTAAEMASLLKGRQGIYTENERRMGAVIKIRFFKIMLVLIICWLSNIINESLL FYLEMQTDINGGSLKPVRTAAKTTWFIMGILNPAQGFLLSLAFYGWTGCSLGFQSPRKEI QWESLTTSAAEGAHPSPLMPHENPASGKVSQVGGQTSDEALSMLSEGNALSATAGSDAST IEIHTASESCNKNEGDPALPTHGDL >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_5|1338_bp atggtggctgagtgcaaggagaacaagatctccagtggaggaaccaagaagctgcagaac tggaccttctgctgccccacgcgggacgcagccacgcagctcgtgctgagcttccagccg cgggccttccacgcgctctgcctgggcagcggcgggctccgcttggcgctgggccttctg cagctgctgcccggccgccggcccgcgggccccgggtcccccgcgacgtccccgccggcc tcggtccgcatcctgcgcgctgccgctgcctgcgaccttctcggctgcctgggtatggtg atccggtccaccgtgtggttaggattcccaaattttgttgacagcgtctcggatatgaac cacacggaaatttggcctgctgctttctgcgtggggagtgcgatgtggatccagctgttg tacagtgcctgcttctggtggctgttttgctatgcagtggatgcttatctggtgatccgg agatcggcaggactgagcaccatcctgctgtatcacatcatggcgtggggcctggccacc ctgctctgtgtggagggagccgccatgctctactacccttccgtgtccaggtgtgagcgg ggcctggaccacgccatcccccactatgtcaccatgtacctgcccctgctgctggttctc gtggcgaaccccatcctgttccaaaagacagtgactgcaggacagttgacctcatttgtc atcctccagagacatcagcagaacacagctgcagaaatggcctctttacttaaaggaaga caaggcatttacacggagaacgagaggaggatgggagccgtgatcaagatccgatttttc aaaatcatgctggttttaattatttgttggttgtcgaatatcatcaatgaaagcctttta ttctatcttgagatgcaaacagatatcaatggaggttctttgaaacctgtcagaactgca gccaagaccacatggtttattatgggaatcctgaatccagcccagggatttctcttgtct ttggccttctacggctggacaggatgcagcctgggttttcagtctcccaggaaggagatc cagtgggaatcactgaccacctcggctgctgagggggctcacccatccccactgatgccc catgaaaaccctgcttccgggaaggtgtctcaagtgggtgggcagacttctgacgaagcc ctgagcatgctgtctgaaggtaatgccctgtctgccacagcaggttctgatgccagcaca attgaaattcacactgcaagtgaatcctgcaacaaaaatgagggtgaccctgctctccca acccatggagacctatga >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_6|341_aa MEGAEPRARPERLAEAETRAADGGRLVEVQLSGGAPWGFTLKGGREHGEPLVITKVRRPR GAGADSRELAALRGASRGLAGSSGAPGPASGGLHLRGCLEASEGTCPRAVPTYVCLSGVP GSAPLGPVPVITTKSRVAAVASATSRFPSAHQCLLTFGFQSQIHCALDTCANLEKPESLL PKSRTRVLIIIKKFQMYLIRNVVANQALCEISVKVLKKRDLMNSAVGQERVTLNFVAAGG FLCPDAVCGQHLHEGVLLLLQVSENQWVLAAYKEELGQPVEEAGLQRRVPGAFPTRWCSV PPVRKKAELTATHYQIQEPLLSSWEGGFDEVLMNLGLVFTK >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_6|1026_bp atggagggcgccgagccccgcgcgcggcccgagcgcctggccgaggccgagacgcgggcg gcggacggcgggcgcctggtggaggtgcagctgagcggcggcgccccgtggggcttcacc ctgaagggcggccgcgagcacggcgagccgctggtcatcaccaaggtaaggcggccgcgg ggcgcgggcgctgacagccgggagctggccgccctgcggggcgcgtcgaggggcctggcc gggtcctctggcgctcccgggcccgccagtggcggcctgcacctgcggggctgcctggag gccagcgaaggaacgtgtccccgagccgtccccacctatgtttgcctctctggggtccct ggttcggcaccccttggacccgtgcctgtgatcactacgaaatccagagtagcagctgtc gccagcgcaacttctcgattcccttcagcccatcaatgtttactgactttcggcttccag tcccagatccactgcgcactggatacctgcgcaaacttagagaaacctgaaagcctgctt cctaagagcaggacccgagttctaataataataaagaaattccagatgtatctcatcagg aatgttgtagccaaccaagctctttgtgaaatatcagtaaaggttttgaaaaagagggac ctgatgaattcagcagtgggtcaggagcgggtgactctcaactttgtggctgctggtgga tttttgtgcccagatgcagtgtgcggtcaacatctgcacgagggagtgctgctgcttctc caggtgtctgagaaccagtgggtactggccgcctacaaagaggagttaggacagcctgtg gaggaagctggactccagagaagggtgcccggcgcttttcccacccgatggtgctccgtg cctcctgtcaggaagaaagcagaactgactgccacacactaccagattcaggagcctctc ctgtcttcctgggaaggtggttttgatgaggtactgatgaacttgggcctggtttttaca aaatga >gi568815575f:9553587_9815005|GENSCAN_predicted_peptide_7|365_aa MTLDYRKLHQVVISVAAAVPDVVSLLEQTNTSPGTWYAAIDLANAFFSIPVHKAHQKQFV FSSNIPLLSYLSDITLVHCIDDIIPIGSSEQEVANTLDLLQERGTTTLSGPIWILEATPH LGVLLQPIYRVTRKAASFEWGPELEKALQQVQAAVQAALPLGPYDPMVPEVRVADKDAVW SLWQAPISESQWRPLGFWSKALPSSADNYSPFEIKLLACYWALAETEHLTMGHQVTMRPE LPIMNWMLSGPSSHKAGCAQQHSIIKWKWYIRDRAQAGPEGTTRIHESRNQGVQVEVAKL TITPSDPLAKFLLPVPATLRSAGLGGLSSRGRNAATRRHNNDSIKLEVQIATWVLWAPPT FKSTG >gi568815575f:9553587_9815005|GENSCAN_predicted_CDS_7|1098_bp atgacactggattatcgtaagcttcaccaagtggtgatttcagttgcagctgctgtacca gatgtggtttcactgcttgagcaaactaacacatctcctggtacctggtatgcggccatt gacttggcaaatgcctttttctccattcctgtccacaaggcccaccagaagcaatttgtc ttcagcagcaatatacctttactgtcctacctcagtgacatcacactggtccactgcatt gatgatattataccaattggatccagtgaacaagaagtagcaaacacactggacttattg caagaaagaggcacaacaacgcttagtgggcctatttggattttggaggcaacacctcat ttgggtgtgttactccaacccatttatcgagtgacccgaaaggctgccagttttgaatgg ggtccagaactggagaaggctctgcaacaggtccaggctgctgtgcaagctgctctgcca cttgggccgtatgatccaatggtgcctgaggtgagagtggcagataaggatgctgtttgg agcctttggcaggcccccataagtgaatcacagtggaggcctctaggattttggagcaag gccctgccatcttctgcagacaactactctccttttgagataaagctcttggcctgttac tgggctttggcagaaactgaacatttgactatgggtcatcaagtcaccatgcgacctgaa ctacctatcatgaactggatgctttctggcccatctagccataaagctgggtgtgcacag cagcattccatcatcaaatggaagtggtatatacgtgatcgggctcaagcaggtcctgaa ggcacaaccaggattcacgagtccagaaatcaaggggtgcaagtagaagtggcaaaactc accatcacccctagtgatccactagcaaaatttttgcttcctgttcctgcaacattacgt tctgctggcctagggggtcttagttccagagggaggaacgctgccaccaggagacacaac aatgattccattaaactggaagttcagattgccacctgggtgctttgggctcctcctacc tttaagtcaacaggctaa