GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:05:11 Sequence gi568815586r:8504287_8707016 : 202730 bp : 42.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4178 4355 178 2 1 81 20 159 0.812 7.40 1.02 Intr + 4464 4583 120 2 0 76 23 82 0.511 0.27 1.03 Intr + 8156 8278 123 1 0 65 21 169 0.555 7.76 1.04 Intr + 13907 13988 82 1 1 21 87 57 0.033 -2.81 1.05 Intr + 14723 14874 152 0 2 -3 89 139 0.633 3.86 1.06 Intr + 15940 16055 116 0 2 40 84 88 0.817 1.93 1.07 Intr + 16838 16947 110 2 2 99 21 108 0.573 4.21 1.08 Intr + 23059 23161 103 2 1 61 72 119 0.795 5.91 1.09 Intr + 23454 23565 112 0 1 85 94 46 0.744 4.36 1.10 Term + 26747 26887 141 1 0 87 44 81 0.570 0.55 1.11 PlyA + 27387 27392 6 1.05 2.04 PlyA - 28279 28274 6 1.05 2.03 Term - 30523 30352 172 0 1 92 48 115 0.602 4.22 2.02 Intr - 31919 31804 116 2 2 103 76 65 0.885 5.13 2.01 Init - 32870 32829 42 2 0 94 70 31 0.410 2.39 2.00 Prom - 39259 39220 40 -3.65 3.03 PlyA - 39656 39651 6 1.05 3.02 Term - 55658 55450 209 0 2 57 28 221 0.952 9.52 3.01 Init - 56456 56294 163 2 1 55 88 58 0.565 2.44 3.00 Prom - 56913 56874 40 -4.05 4.05 PlyA - 59903 59898 6 1.05 4.04 Term - 60756 60601 156 0 0 76 42 122 0.055 3.35 4.03 Intr - 62301 62119 183 0 0 53 83 123 0.025 7.36 4.02 Intr - 63738 63551 188 1 2 64 28 78 0.037 -2.11 4.01 Init - 64334 64193 142 2 1 87 61 105 0.044 8.06 4.00 Prom - 66337 66298 40 -5.65 5.00 Prom + 66352 66391 40 -4.65 5.01 Init + 79089 79161 73 2 1 74 77 51 0.337 3.88 5.02 Term + 87085 87212 128 2 2 117 33 101 0.862 4.96 5.03 PlyA + 88606 88611 6 1.05 6.07 PlyA - 88729 88724 6 1.05 6.06 Term - 100051 99998 54 1 0 122 44 65 0.892 2.08 6.05 Intr - 100636 100521 116 2 2 77 87 29 0.784 0.95 6.04 Intr - 101199 100929 271 0 1 92 86 254 0.898 21.79 6.03 Intr - 101442 101354 89 1 2 68 58 38 0.353 -2.43 6.02 Intr - 102726 102536 191 2 2 103 74 167 0.500 15.11 6.01 Init - 122304 121934 371 0 2 68 22 143 0.017 2.21 6.00 Prom - 123180 123141 40 -6.65 7.00 Prom + 127249 127288 40 -3.65 7.01 Init + 130340 131002 663 1 0 59 48 497 0.831 38.08 7.02 Intr + 131120 131341 222 1 0 45 50 215 0.810 10.80 7.03 Intr + 132265 132616 352 0 1 96 18 391 0.406 26.87 7.04 Term + 132618 133279 662 1 2 -46 37 732 0.385 47.78 7.05 PlyA + 134434 134439 6 1.05 8.09 PlyA - 136600 136595 6 1.05 8.08 Term - 143917 143805 113 2 2 109 43 43 0.035 -0.36 8.07 Intr - 145288 145215 74 0 2 27 116 73 0.037 2.13 8.06 Intr - 146303 146216 88 0 1 67 110 14 0.039 -0.39 8.05 Intr - 150195 150151 45 1 0 106 92 71 0.422 6.76 8.04 Intr - 151161 151129 33 1 0 103 99 55 0.941 5.38 8.03 Intr - 156669 156577 93 1 0 60 92 68 0.758 3.42 8.02 Intr - 157820 157761 60 0 0 61 105 63 0.400 3.09 8.01 Init - 159062 158996 67 2 1 74 50 66 0.463 2.69 8.00 Prom - 162045 162006 40 -4.55 9.03 PlyA - 164371 164366 6 1.05 9.02 Term - 166752 166183 570 0 0 42 38 325 0.649 16.35 9.01 Init - 167956 167822 135 1 0 54 88 115 0.977 8.29 9.00 Prom - 172433 172394 40 -6.95 10.00 Prom + 175856 175895 40 -4.95 10.01 Init + 177389 177532 144 1 0 41 119 102 0.720 8.90 10.02 Intr + 185402 185473 72 1 0 104 59 28 0.035 0.18 10.03 Intr + 195499 195642 144 0 0 103 74 54 0.157 5.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 60637 60730 94 1 1 59 94 49 0.899 1.65 S.002 Init - 103386 103346 41 0 2 64 82 45 0.924 1.21 S.003 Sngl - 122304 121930 375 0 0 68 50 139 0.819 3.99 S.004 Intr + 144421 144587 167 0 2 88 100 132 0.823 12.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_1|412_aa XLVAAWVDTTAVVVAEGLSVAFGNSTPERHRAAASGNVQLEMGWLRYEPKPRALPSEEQG PKGSKGSTTAVTAAKWLSVVSGNSAPDRYRATANGNAQLGGKRPVVDTAYTPGLKLLTRC QLETQLSSGLIIAMIPSKEELRGTGVHKLEHHAKLKCIKEKSELKSAEGSTWNCCPIDWR AFQSNCYFPLTDNKTWAESERNCSGMGAHLMTISTEAEQNFIIQFLDRRLSYFLGLRDEN AKGQWRWVDQTPFNPRRVFWHKNEPDNSQGENCVVLVYNQDKWAWNDVPCNFEARAQAHG EPNSVPEPLAGVFGDPAGKPCPLRKAVSRSSVGLGTFQLRAGESRAFQGWDARPRWRAFV SGIFQSIVLLLRLRHLCSTTPTYGRPPHSSSGFETLGQSNPHQDALLNPLAF >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_1|1239_bp nncctagtggcagcatgggtagataccactgcagtggtagtggcagaggggctgtcagtt gcttttgggaactccaccccagagagacacagggctgctgccagtgggaatgttcagctg gagatggggtggctacgctatgagcccaagcccagggccctgcctagtgaagagcagggg cccaagggcagcaagggcagtaccactgcagtgacagcggcaaagtggctgtcagtggtc tctgggaactccgccccagacagatacagagccacagctaatgggaatgctcagctgggg ggcaagcgaccagtggttgacaccgcatatacacctggcttaaagttattgactcgctgc cagttggagacccagctgtcctctggactcatcatcgcaatgattccatcaaaggaggag ctgagaggcacaggagtgcacaagttagagcaccatgcaaagctcaaatgcatcaaagag aaatcagaactgaaaagtgctgaagggagcacctggaactgttgtcctattgactggaga gccttccagtccaactgctattttcctcttactgacaacaagacgtgggctgagagtgaa aggaactgttcagggatgggggcccatctgatgaccatcagcacggaagctgagcagaac tttattattcagtttctggatagacggctttcctatttccttggacttagagatgagaat gccaaaggtcagtggcgttgggtggaccagacgccatttaacccacgcagagtattctgg cataagaatgaacccgacaactctcagggagaaaactgtgttgttcttgtttataaccaa gataaatgggcctggaatgatgttccttgtaactttgaagcaagggcccaggcccacgga gaaccaaattctgtccccgagcctctggctggagtttttggagatcctgcagggaagccc tgccccctgaggaaggctgtgtcaaggagttcagtaggccttggcacattccagctgaga gccggtgagagtcgcgcgttccagggttgggatgctaggccccggtggcgtgcgtttgtg agtgggatcttccaatccatagtcctcctgcttcggctcagacacctttgctcaaccacc cccacgtatgggcgtcctcctcactctagctcaggctttgagactttgggacagtccaac ccccaccaggatgccctgctgaacccactggcgttctga >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_2|109_aa MGAHLVVINSQEEQEFLSYKKPKMREFFIGLSDQVVEGQWQWVDGTPLTKSLSFWDVGEP NNIATLEDCATMRDSSNPRQNWNDVTCFLNYFRICEMVGINPLNKGKSL >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_2|330_bp atgggggctcacctggtggttatcaactcacaggaggagcaggaattcctttcctacaag aaacctaaaatgagagagttttttattggactgtcagaccaggttgtcgagggtcagtgg caatgggtggacggcacacctttgacaaagtctctgagcttctgggatgtaggggagccc aacaacatagctaccctggaggactgtgccaccatgagagactcttcaaacccaaggcaa aattggaatgatgtaacctgtttcctcaattattttcggatttgtgaaatggtaggaata aatcctttgaacaaaggaaaatctctttaa >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_3|123_aa MINQTYRLRHHCLTKPPNHINFFTKTGETYLNNFDFTQRLAKQIILRCPDCQLTARHIKA YRGVARTQPSTRNEENDPTGPTALDDAASSDDTRLRYYLGDPEEDNSGGRANPALDTDTI HSR >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_3|372_bp atgatcaatcagacctacaggttaagacatcattgcttgaccaagccccccaatcacatc aattttttcaccaaaactggagaaacttatctaaacaatttcgactttacccagagacta gctaaacaaattatcctacgatgcccagattgccagctcacagcacgacacatcaaagca taccgtggcgtggctaggacccaacccagtaccagaaatgaagaaaatgaccctacagga ccaacagccctggatgatgcagcttcctcggatgacacacgcctcagatattacctaggg gatcctgaagaggataactcaggaggccgagcaaatcctgctctggacacagacaccatt cactccagataa >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_4|222_aa MVAADPGLLHEGAPPPGQGCSCPNCHCGSEPPCALGGGLEQTGSALPGPPMASHGPIGRH FLPSEVHKSLGLSQSKVEGREKPWAQPEKGRGQRTERGLDNQLQRGATFSMPDEYGPVKI QVPFSLQDLLQTKGDLGRFSDGPNRYVEAFQKLTQVTWHPAMSSVGYESAEFPYLILPLT RCVIFSKSLSVLERSSSLGAPLRLCSACFRCLATGRDTRIER >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_4|669_bp atggtggctgcagacccaggtctcctgcacgaaggagccccacctcctgggcaaggctgc agctgtccaaactgccactgtggatctgagcctccctgtgctcttggagggggcctggag cagacaggatctgccctcccaggcccacccatggccagtcatggaccaatcggcaggcac ttcctcccatctgaggtccataaaagcctagggctcagccagagtaaagtagagggcaga gaaaagccctgggctcagccagagaagggcagaggacagaggactgagagaggactggac aaccagctgcaaagaggagctaccttctctatgcctgatgaatatggccctgttaagatc caggtgcccttttctcttcaggatttattgcaaactaagggggatcttggcaggttttca gatggccctaataggtatgtagaagctttccagaaattaacccaagtaacatggcatcct gccatgagttcagttggctatgagagtgcagagtttccttatctgattctgccacttact agatgtgtgattttcagcaaatcactcagtgtcttggaacgttcctcatccttgggcgct ccccttcgtctttgcagcgcttgctttcgttgcctggcaacgggacgcgacaccaggatt gagcgttaa >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_5|66_aa MIVTFLRTPQKQMPVSCFLYSLQNHCSTLTKCFVSHCWTTDLSARGHPIQVDPIAVECYS TGVPQT >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_5|201_bp atgattgtaaccttcctgaggactccccagaagcagatgcctgtgtcatgcttcctgtac agcctgcagaaccattgctcaaccctcacaaaatgttttgtaagtcattgttggaccacg gacttgagcgctcgaggccatcctatccaggttgatcccattgctgtggaatgttacagc acaggggtcccacagacgtag >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_6|363_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGDKVTDQQDPKAEEFFLVQNK MKTLPCLLLSTQTRQPSDFSIFSPPFPPFYSTKPPLSSWPIPNEPLGTPPRRGRGRAEGL LTSHLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKVSIKVGF ASSLMVNSHFVFLIPQKPIAFLHSAGLVLPRVQLYLLNGCHVELLFLRYISDWDLDPGRC YRVTWFTSWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQ IAIMTFKDYFYCWNTFVENHERTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRT LGL >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_6|1092_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg gataaggtcacagatcaacaggatcccaaggcagaagaatttttcttagtacagaacaaa atgaaaactctcccatgtctacttctatccacacagacccggcaaccatccgatttctca attttttccccacccttcccgcctttctattccacaaaaccgccattgtcatcatggccc atccccaatgagccgctgggcacacctcccagacggggtcgtggccgggcagaggggctc ctcacttcccacctcttgatgaaccggaggaagtttctttaccaattcaaaaatgtccgc tgggctaagggtcggcgtgagacctacctgtgctacgtagtgaagaggcgtgacagtgct acatccttttcactggactttggttatcttcgcaataaggtatcaattaaagtcggcttt gcaagcagtttaatggtcaactcgcacttcgtcttcctcattccacaaaaacccatagcc ttccttcactctgcaggactagtgctgccaagggttcagctctacctactgaacggctgc cacgtggaattgctcttcctccgctacatctcggactgggacctagaccctggccgctgc taccgcgtcacctggttcacctcctggagcccctgctacgactgtgcccgacatgtggcc gactttctgcgagggaaccccaacctcagtctgaggatcttcaccgcgcgcctctacttc tgtgaggaccgcaaggctgagcccgaggggctgcggcggctgcaccgcgccggggtgcaa atagccatcatgaccttcaaagattatttttactgctggaatacttttgtagaaaaccac gaaagaactttcaaagcctgggaagggctgcatgaaaattcagttcgtctctccagacag cttcggcgcatccttttgcccctgtatgaggttgatgacttacgagacgcatttcgtact ttgggactttga >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_7|632_aa MLSLFHPKPGRFIAGADINMLATCKTPQEVTQVSQEAQRTFEKLEKAAKPVIAAISGSCL GGGLELAISCQYRIATKDGKTILGAPEVLLGILPGAGGTQRLPKMVGVPAVFDMMLTSRN IHADSAKKMGLVDQLVEPLGPGIKPPEEWTIEYLEAVAITFAKGLADKKISPKRDKGLVE KLTAYTMTIPFVRQQVYKKVEEKVQKETKGLYPAPLKIIDVVLCKKNKFGAPQKDVKRLA ILGEGLKGAGITQVSVDKRLKTILKDATLTRLGQGQQQVFKGLNDKVNKKALTSFSLVPP DEKHPQMWRGRPPLQLGSLISNVLKVDMVIEAVFEDLSLKHRVLKEVEAVIPDHCVFASN TSAFPISEIAAVSKRPEKVIGMHYFSPMDKMQLLEMITTEKTSKDTQCFSCGISLKQGKV IIVVMDGPGFYTTRCLAPMMSEVIRILQEGVDLEKLDSLTTSFGFPVGAATLVDEVGVDV AKHMAEDLRKALGERFGGGNPELLTQMVSKGFLGRKFGKGFYIYQEGVKSKNLNSDVDSI LVSLKMPPKSEVSPGEDIQFRLVTRFVNEAVMCLREGILATPAEGDIAAVFGLGFPPCFG GPFCFVALYGPQKIVARLKKYEAAYGKQFTPC >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_7|1899_bp atgctgtccttatttcatccaaagccaggccgctttattgcaggtgctgatatcaatatg ttagccacttgcaagacccctcaagaagtaacacaggtatcacaagaagcacagagaaca tttgagaaacttgaaaaggccgcaaagcctgttatagctgccatcagtggatcctgcctg ggaggagggcttgagcttgccatttcatgccaatacagaatagcaacaaaagatggaaaa acaatattaggtgcccctgaagtcttgctggggatcttaccaggagcaggaggcacacaa aggctgcccaaaatggtgggtgtgcctgctgtttttgacatgatgctgactagtagaaac attcatgcagatagcgcaaagaaaatgggactggttgaccaattggtagaacccctggga ccgggaataaaacctccagaggaatggacaatcgaatacctggaagcagttgcaattact tttgccaaaggactagctgataagaagatctctccaaagagagacaagggattggtggaa aaattgacagcgtacaccatgactattccatttgtcaggcaacaggtttacaaaaaagtg gaagaaaaagtgcaaaaggagactaaaggcctttatcctgcacctctgaaaataattgac gtggtcttgtgcaagaagaataaatttggagcaccacagaaggatgttaagcgtctggct attcttggtgaagggctgaagggagcaggcatcacccaagtctctgtggataagaggcta aagactatacttaaagatgccacactcactaggctaggccaaggacagcaacaagtgttc aaaggattgaatgataaagtgaataagaaagctctaacatcattttctctcgtcccaccc gacgagaaacacccacagatgtggaggggcaggccaccccttcaactgggcagcttgatt agcaacgttttaaaggtcgacatggtgattgaagctgtgtttgaggaccttagtcttaag cacagagtgctaaaggaagtagaagcggtgattccagatcactgtgtctttgccagtaac acatctgctttcccaatcagtgaaatcgctgctgtcagcaaaagacctgagaaggtgatt ggcatgcactacttctctcccatggacaagatgcagctgctggagatgatcacaaccgag aaaacttccaaagacactcagtgcttcagctgtggcatcagtctcaagcaggggaaggtc atcattgtggttatggatggacctggcttctataccaccaggtgtcttgcacccatgatg tctgaagtcatccgaatcctccaggaaggagttgacctggagaagctggattccctgacc acaagctttggctttcctgtgggtgccgccacactggtggatgaagttggcgtggatgta gcgaaacatatggcggaagatctgcgcaaagcccttggggagcggtttggaggtggaaac ccagaactgctgacacagatggtgtccaagggcttcctaggtcgcaagtttgggaagggc ttttacatctatcaggagggtgtgaagagtaagaatttgaattctgatgtggatagtatc ttagtgagtctaaagatgcctcctaagtctgaagtctccccaggtgaagacatccagttc cgcctggtgacaagatttgtgaatgaggcagtcatgtgcctgcgagaggggatcttggcc acacctgcagagggagacatcgcagctgtctttgggcttggcttcccgccttgttttgga gggcctttctgctttgtggctctgtatggcccccagaagatagtggcccggctcaagaag tatgaggctgcctatggaaaacagttcactccatgctag >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_8|190_aa MKKKYRNKKKFVSAEETNIEKENMSLLGPKVLLFLAAFIITSGLRGIELQSLTSLFSSFA IDWIPLGVNSQRGDLVNDPATDETVLAVLADIAPSTDDLECWDEKFTCTRLYSVHRPVKQ CIHQLCFTSLRRMYIVNKEICSRLVCKEHEAMKDELCRQMAGLPPRRLRRSNYFRLPPCE NVDLQRPNGL >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_8|573_bp atgaaaaagaaatacaggaataaaaagaaatttgttagtgctgaagagacgaatattgaa aaagaaaatatgtcgctcttgggacccaaggtgctgctgtttcttgctgcattcatcatc acctctgggttacgaggtatagagctgcagtcactcacatctcttttctcttcctttgct atagactggatacccctgggggtcaatagtcaacgaggagatctggtgaatgatcccgct acagatgaaacagttttggctgttttggctgatattgcaccttccacagatgacttggag tgctgggatgagaaatttacctgcacaaggctctactctgtgcatcggccggttaaacaa tgcattcatcagttatgcttcaccagtttacgacgtatgtacatcgtcaacaaggagatc tgctctcgtcttgtctgtaaggaacacgaagctatgaaagatgagctttgccgtcagatg gctggtctgccccctaggagactccgtcgctccaattacttccgacttcctccctgtgaa aatgtggatttgcagagacccaatggtctgtga >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_9|234_aa MWESLELPRDLLNDLAQNADSNMDNKIQAVWSQMEMRNLLGTGAKPRDLVTCVSATPAVA ERGQHTAWAVASEGGSPKPWQLPCDIGPAVGQKSRIEVWEPLSRFQKMYGNAWMPKQKFA AGVGPSCRTSARAVQKGNVWLEPPHKVLPEALPSGAVRRRPWPPRPQNGRSASSLHLVPG KAADTQHQPVKAAGRETVLCKATGAELPKTMGTHLLYHCGLDGRPGVKEMILEL >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_9|705_bp atgtgggaaagtttggaacttcctagagacttgttgaatgaccttgcccaaaatgctgac agcaatatggacaataaaatccaggctgtgtggtctcagatggagatgaggaacttgctg ggaactggagcaaagcctagggacttggtgacctgtgtctcagccactccagccgtggct gaaaggggccaacatacagcttgggctgtggcttcggagggtggaagccccaagccttgg cagcttccatgtgatatcgggcctgcagttggacagaagtcaagaattgaggtttgggaa cctctgtctagatttcagaagatgtatggaaatgcctggatgcccaagcaaaagtttgct gcaggggtggggccctcatgcagaacctctgccagggcagtgcagaagggaaatgtgtgg ttggagcccccacacaaagtccttcctgaggcactgcctagtggagctgtgagaagacgg ccatggccccccagaccccagaatggtagatccgccagcagcttgcaccttgtgcctgga aaagctgcagacactcaacaccagcctgtgaaagcagctggaagggagactgtactctgt aaagccacaggggcagagctgcccaagaccatgggaacccacctcttgtatcactgtggc ctggatgggagacctggagtcaaagagatgattttggagctttaa >gi568815586r:8504287_8707016|GENSCAN_predicted_peptide_10|120_aa MSFRVYGFVGCFHLLGGWCTLTQGQKPLSVLRTLADLFTWLFIFHNQLVCLSPQSFLRCL CLQQRGYGGDAEVLESACQGGGQSVILRTEPRRGTANSVTDNPPLGQEPLFSSCLAPPQG >gi568815586r:8504287_8707016|GENSCAN_predicted_CDS_10|360_bp atgagcttccgggtttatggattcgtgggctgcttccacctgctaggagggtggtgtact ctaactcagggacagaagcccctgtctgtgctcaggactcttgcagacctctttacctgg ctgttcatcttccataatcaactggtttgtctttctccccagtcctttctccggtgtctt tgtctccagcagagagggtatggtggtgatgcggaggtgctggagtctgcttgtcaggga ggggggcagtctgtgattctgagaacagagccaagaaggggaacagcaaattcagtcaca gacaatcctccactcggtcaagagccacttttctcttcctgccttgcccccccgcagggg