GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:26:28 Sequence gi568815586f:22525164_22784951 : 259788 bp : 37.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 542 447 96 0 0 69 92 57 0.683 3.26 1.05 Intr - 2729 2558 172 2 1 30 91 195 0.867 12.59 1.04 Intr - 10181 10095 87 2 0 114 107 43 0.995 8.05 1.03 Intr - 19038 18898 141 0 0 52 73 158 0.880 10.33 1.02 Intr - 19262 19157 106 1 1 81 79 67 0.866 4.40 1.01 Init - 20861 20806 56 2 2 93 87 55 0.437 6.71 1.00 Prom - 29956 29917 40 -1.95 2.00 Prom + 31890 31929 40 -3.75 2.01 Init + 33732 33776 45 2 0 82 72 45 0.713 3.03 2.02 Intr + 42471 42647 177 2 0 125 61 89 0.129 9.09 2.03 Intr + 45909 45993 85 2 1 32 73 75 0.064 -1.13 2.04 Intr + 63797 63940 144 0 0 56 75 94 0.183 4.23 2.05 Intr + 94439 94533 95 0 2 91 94 64 0.159 6.06 2.06 Intr + 99014 99064 51 1 0 67 115 25 0.025 1.29 2.07 Intr + 99983 100423 441 1 0 -3 97 308 0.010 15.33 2.08 Intr + 118600 118859 260 0 2 5 88 214 0.009 8.44 2.09 Intr + 124427 124459 33 2 0 97 115 24 0.727 2.42 2.10 Intr + 133851 133991 141 0 0 71 93 103 0.443 7.65 2.11 Intr + 140525 140549 25 2 1 57 83 35 0.262 -2.99 2.12 Intr + 143435 143537 103 1 1 73 52 71 0.328 0.83 2.13 Intr + 146109 146192 84 1 0 100 75 64 0.965 5.17 2.14 Intr + 148337 148497 161 0 2 79 111 121 0.968 12.29 2.15 Term + 149600 149632 33 1 0 68 45 19 0.344 -7.79 2.16 PlyA + 149781 149786 6 1.05 3.04 PlyA - 149897 149892 6 1.05 3.03 Term - 152649 152460 190 2 1 13 36 160 0.682 -0.76 3.02 Intr - 154143 153942 202 1 1 58 99 119 0.773 7.42 3.01 Init - 154286 154226 61 2 1 45 97 85 0.979 6.56 3.00 Prom - 158622 158583 40 -5.15 4.10 PlyA - 158909 158904 6 1.05 4.09 Term - 167323 167124 200 2 2 25 47 118 0.635 -2.02 4.08 Intr - 168023 167873 151 2 1 70 65 79 0.654 2.61 4.07 Intr - 168382 168141 242 2 2 71 45 240 0.755 14.35 4.06 Intr - 173237 173124 114 0 0 102 94 -14 0.463 0.10 4.05 Intr - 174662 174489 174 0 0 52 39 110 0.435 1.49 4.04 Intr - 175811 175713 99 0 0 56 77 60 0.361 0.86 4.03 Intr - 178521 178297 225 1 0 58 23 175 0.454 5.03 4.02 Intr - 181463 181339 125 1 2 26 75 113 0.796 3.11 4.01 Init - 186285 186176 110 0 2 63 44 86 0.735 1.44 4.00 Prom - 194060 194021 40 -2.25 5.04 PlyA - 194407 194402 6 1.05 5.03 Term - 206469 206159 311 1 2 53 49 119 0.548 -1.26 5.02 Intr - 216791 216729 63 0 0 95 92 51 0.478 3.97 5.01 Init - 217118 217001 118 2 1 99 50 72 0.825 4.91 5.00 Prom - 220866 220827 40 -4.65 6.00 Prom + 232395 232434 40 -3.75 6.01 Init + 252913 253040 128 0 2 95 59 130 0.882 10.48 6.02 Intr + 255499 255581 83 1 2 74 84 63 0.246 2.86 6.03 Intr + 258905 258994 90 0 0 89 84 33 0.077 2.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 118654 118859 206 0 2 98 88 141 0.934 13.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_1|220_aa MNSDFVDTGYTVNADFDICLVGPGRKNPSKPFLPPPLRNEVSAPARDSTLPPATTRLFVS AGAQEEAETETMPGKLKVKIVAGRHLPVMDRASDLTDAFVEVKFGNTTFKTDVYLKSLNP QWNSEWFKFEVDDEDLQDEPLQITVLDHDTYSANDAIGKVYIDIDPLLYSEAATVISGWF PIYDTIHGIRGEINVVVKVDLFNDLNRFRQSSCGVKFFCX >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_1|660_bp atgaactctgattttgttgatactgggtatactgtcaatgctgattttgatatttgcctt gtggggccgggacggaagaacccatcaaagcccttcctcccgccgcctctccggaatgag gtctcagcacccgcgcgggacagcacccttcctcctgcgacaactcgtttgtttgtttct gcaggagcccaagaagaggccgaaaccgagaccatgccagggaagctgaaggtgaaaatc gtggccgggcgccatttgccagtgatggaccgtgctagtgacctgactgatgccttcgtg gaggtaaaatttggtaataccacctttaaaacagatgtgtaccttaagtcactcaaccct cagtggaactcggagtggtttaaatttgaggtggatgatgaagacttacaagatgaacct ttacagatcacagttcttgaccatgatacttacagtgcaaatgatgccattggtaaagtg tacattgatattgatcctttactgtatagtgaagctgcaacagtcatctcaggatggttt ccaatttatgacaccatacatggtatccgtggggaaatcaatgtagttgtcaaagtagac ctcttcaatgatttaaatcgatttaggcagtcatcatgtggagtcaaattcttttgcann >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_2|625_aa MGSQPGNAMQYGKCWVGELRCRKEQTPLSRVYCAPLLRGGSAQVSKCGNQQEQTPCRPHG SIQVGVPMTPRPQRIPIQGICEYRKVHLDMSSKDQLKLVSGGTSFITIPKGCVFRVCILG NRMVARSTNNSSKIRMLLSMTVTKLNRAKRGTIVGSGNVAIDKLWLCFPGGNIPIHKRDT KKFRKFTVIGEVIILLITKRQASPGMLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSA ASRRSPAARPPVPAPPALPRGRPGTEGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIH VPPGSPEVPKLNVTVQDQEEHRCREGALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCY VGNTMEDVVLVRIYGNKTELLVDRDEEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEA LDPKHVCNPAIFRYHTTLIAIALWLIARQLAKIHAIHAHNGWIPKSNLWLKMGKYFSLIP TGFADEDINKRWWIGFELKIRYAIVAAMWRRTPCFLNALTGGLSHHLFTCTEPGDVQFID YEYSGYNYLAYDIGNHFNEFAGVSDVDYSLYPDRELQSQWLRAYLEAYKEFKGFGTEVTE KEVEILFIQVNQFALKADLDFLANF >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_2|1878_bp atgggctcacaaccaggaaatgcaatgcaatacggcaagtgctgggtgggagaacttagg tgccgtaaggagcaaactccactctctcgggtctactgtgctccactcctcaggggaggg agtgctcaagtgagcaagtgtgggaaccagcaggagcaaactccatgcaggccccacggc agcatccaggttggggtgcctatgactccaaggccccagaggatacccattcagggaatc tgtgaatatagaaaagttcacttggacatgagcagcaaggatcagctgaagttggtctca ggtggcacttcattcattaccataccaaaagggtgtgtcttcagagtttgtatccttggg aaccgcatggttgctaggtcgaccaataacagctctaagattcgcatgctgttgtccatg actgttactaagctaaacagagccaaaagaggcactattgtaggcagtggaaatgtggca atagacaagctgtggctctgctttcctggcggtaacattccaatacataagagggacact aaaaagtttaggaagtttaccgttatcggtgaagtaattattttgttaattacaaagcgt caggccagccccggcatgctctgcggccgcccgcggtccagctccgacaacaggaatttt ctccgagagcgggccgggctcagttcagctgctgtccagacccggatcggcaacagtgcc gcctccagacgttctcctgccgctcgcccgcccgtcccagcgcccccagccctcccgcga gggcgccccgggacggaaggatccaccagtctgtcggcgcccgccgttctcgtggtcgcc gtcgccgtcgtcgtggtggtagtctccgccgtcgcctgggccatggccaattacatccac gtccctcccggctccccggaggtgcccaagctgaacgtcaccgttcaggatcaggaggag catcgctgccgggagggggccctgagcctcctgcaacacctgcggcctcactgggacccc caggaggtgaccctgcagctcttcacagatggaatcacaaataaacttattggctgttac gtgggaaacaccatggaggatgtagtcctggtgagaatttatggcaataagactgagtta ttagtcgatcgagatgaggaagtaaagagttttcgagtgttgcaggctcatgggtgtgca ccacaactctactgtaccttcaataatggactatgctatgaatttatacaaggagaagca ctggatccaaagcatgtctgcaacccagccattttcagatatcacactaccttaattgct atagctttatggctaatagctcgtcagcttgctaaaatccatgctattcatgcacacaat ggctggatccccaaatctaatctttggctaaagatgggaaagtatttctctctcattccc acaggatttgcagatgaagacattaataaaaggtggtggattggatttgagctgaagatt aggtatgccattgttgctgccatgtggcggcgcaccccgtgcttcttaaacgcactgact ggaggtttatcgcatcacttgttcacatgcacggagcctggtgatgtacagttcattgat tatgaatattctggatacaactacctggcatatgatattggaaatcatttcaatgaattt gcaggtgtgagtgatgtagactatagtctgtatccagatagagaactacagagtcagtgg ctgcgtgcttaccttgaagcctacaaagaatttaagggctttgggactgaagttactgaa aaggaggtagaaatactcttcattcaagtcaatcagtttgcattgaaagctgatttggat ttccttgcaaacttttga >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_3|150_aa MSAGIIQLAASTAKTKQAEEESKFFSIWTLGLKRQWFARGLLGLWPQTEGSTVGSSGFET FKLVLRHYWLLSSSACRRPTVRLQLVIRQTESQIMSELPFTIAAKRIKCPGIQRTRDVKD LFKENYKTTDQGNKRTCTNGKTFHAHRYEE >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_3|453_bp atgtcggctggtatcatccaattggctgccagcacggcaaaaacaaagcaggcagaagaa gaatccaagttcttcagcatttggacacttggacttaaacgccagtggtttgccaggggg ctcttgggcctttggccacagactgaaggcagcactgtcggctcttctggttttgagact ttcaaacttgtactgcgccactactggcttctttcttcctcagcgtgcagacggcctact gtgagacttcaacttgtgatcagacaaacagagagccaaatcatgagtgaactcccattc acaattgctgcgaagagaattaaatgcccaggaatacaacgtacaagggatgtgaaagac ctcttcaaggagaactacaaaaccactgatcaaggaaataagaggacatgcacaaatgga aaaacattccatgctcatagatacgaagaataa >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_4|479_aa MTEIKNTRQKGFEVSTGFQIDICGKLLEVWDWSTRERKTYGVLQKVKPIGYVCLSVKSSN QYVLEVNVSEWKLFGEEGGCYFKLVDCVDNHDPSFLAVKHHHLASMLQSTIISNATHEKR SVIASPTYSTGLQRTVTEPLGWWLFHQLIPDGHGTTASTSKIVTGQGQPQRQEWLDISKS KNWLDIDANPSRAKLSGVSLYTDANPSRAKLSGVSLYTDANPSRAKLSGVSLYTDANPSR AKLSVRYSGLTRINKRKTKFHSTFTKTGAISNPPLILHLYCKVPTLAALEEPFRPPLHCG SPSLGWPEPAPSACREVWKQRHWWEPRLRVAHVALTGQREFQMGMGSAGPALRAAGQRRQ PWAGRARDLQSTTPKPPPPQTPWAPAQTEPPRQAPPPAQQCPVPSTAQGLRSVGRSAASL LKPVRPGTHQKEGTPNTSEHQKEQTPDTPSLRTVTLTAKVRGFILEVSETKNPPISDTM >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_4|1440_bp atgacagagatcaagaatacaagacagaaaggttttgaagtatctacaggattccagatt gacatatgtggaaagctgttggaagtttgggactggagcacaagagaaagaaagacctat ggagttttacagaaggttaagccaattggttacgtgtgccttagtgtcaagtcctcaaat caatatgtacttgaggtcaatgtcagtgaatggaaattatttggagaggagggaggctgt tacttcaagcttgttgattgtgttgataatcatgacccctcatttctggcagttaagcac caccatctggcctctatgcttcagagcactatcatttccaatgctacccatgagaaacgt tctgtgatagcctctcctacctacagcactggcttacagagaactgtcactgaacctctt ggctggtggctctttcatcagcttattcctgatggccatggtacaacagcatcaactagc aaaattgtcacggggcagggccagccccagaggcaggagtggttggatatatccaagtct aagaattggctggatattgacgctaatcctagcagggcgaagctctcaggtgtctctctt tatacagacgctaatcctagcagggcgaagctctcaggtgtctctctttatacagacgct aatcctagcagggcgaagctctcaggtgtctctctttatacagacgctaatcctagcagg gcgaagctctcagtgagatactcagggttgacaaggataaacaagagaaaaactaaattc cattcaacctttacaaagactggtgcaattagcaacccccctctcatccttcatttgtat tgcaaagtgcccactctggccgcgcttgaggagcccttcaggccgccgctgcactgtggg agcccctctctgggctggccagagccggctccctctgcttgccgggaggtgtggaagcag aggcactggtgggaacccaggctgcgtgtggcccacgtggccctcacgggccagcgcgag ttccagatgggcatgggctcggcgggccccgcactccgagccgccggccagcgccgccag ccctgggcaggcagggctcgggacctccagtccaccacgcccaagcctcccccaccccaa accccgtgggctcccgcgcagaccgagcctccccgacaggcaccgccccctgctcagcag tgtccagtcccatccaccgcccaagggctgaggagtgtgggaaggtctgctgcttcactc ctgaagccggtgagaccaggaactcaccagaaggaaggaactccaaacacgtctgaacat cagaaggaacaaactccggacacaccatctttaagaactgtaacactcactgcaaaggtg cgtggcttcattcttgaagtcagtgagaccaagaacccaccaatttcggacacaatgtga >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_5|163_aa MAVIPMKLKMLSNRLPLFENIDCFRVRVTESIQVGYWPRVAQIVSDSSRAWATSLRCQSR VFSYRQIRGGQWAQVVPGGHRPVLIEYTKHHNKDDYTDHLKPFKSVSFGIETWIESERMN EILQADRREEKDSSNKDNSMLKPKQMNGMGGLVTVADTWLEGR >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_5|492_bp atggccgtcataccaatgaagctcaagatgctgtcaaatagattgcccctttttgaaaat attgattgtttcagggtcagagtgactgaatctattcaagtaggctactggcccagagta gctcagattgtcagtgacagctccagggcttgggccacaagcctcaggtgccaatcaaga gtcttctcatatagacaaattaggggtgggcaatgggctcaggtcgtgcctggtggtcat aggcctgttctaatagagtacactaagcatcataataaagatgactacacagatcacttg aaaccattcaagtcggtgagttttggaatagaaacttggattgagtcagaaagaatgaat gaaattctccaggctgacaggagggaggagaaagattcctcaaacaaagacaatagcatg ctcaaacctaaacaaatgaatggcatgggtggcctggtcactgtggctgacacatggctt gaaggccgatga >gi568815586f:22525164_22784951|GENSCAN_predicted_peptide_6|101_aa MDMKGPEAQVRGESSTCLKNTLEKLAACTGLVLTQCGAKICQSFSRAVEDRKKKSPFLED FDSKRFPMSTGCFWAQFLKNSSQVRNNSGKDWGSISGNRSX >gi568815586f:22525164_22784951|GENSCAN_predicted_CDS_6|303_bp atggacatgaaagggccagaggcacaggtgaggggagagagcagcacctgcctgaagaat acacttgagaaactggctgcatgcacaggccttgttcttacacagtgtggggctaagatc tgccagagttttagcagggccgtagaagacaggaaaaagaaatcacccttcttagaagat tttgattcaaaaagatttcctatgtccactggttgtttctgggcccagtttctgaaaaac tcttcccaggtgagaaataatagtggcaaagactggggcagtatctcagggaataggtca gnn