GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:22:13 Sequence gi568815586r:84761635_84992120 : 230486 bp : 35.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 918 913 6 1.05 1.02 Term - 32598 32383 216 0 0 -22 48 275 0.280 8.66 1.01 Init - 73414 73271 144 1 0 73 63 131 0.895 9.27 1.00 Prom - 78655 78616 40 -7.35 2.00 Prom + 79449 79488 40 -4.65 2.01 Init + 83573 83736 164 1 2 45 116 90 0.531 6.75 2.02 Intr + 83962 84040 79 1 1 69 49 47 0.472 -2.47 2.03 Intr + 86567 86711 145 1 1 1 81 129 0.445 2.43 2.04 Intr + 91244 91397 154 0 1 60 87 91 0.902 4.41 2.05 Term + 95382 95610 229 0 1 100 45 106 0.812 2.62 2.06 PlyA + 97042 97047 6 1.05 3.10 PlyA - 97880 97875 6 1.05 3.09 Term - 100372 99998 375 1 0 107 37 327 0.996 23.25 3.08 Intr - 101967 101805 163 2 1 91 94 33 0.952 3.26 3.07 Intr - 105559 105400 160 2 1 73 97 88 0.994 6.32 3.06 Intr - 109036 108844 193 1 1 112 105 123 0.874 14.54 3.05 Intr - 111160 110968 193 0 1 94 30 143 0.862 7.67 3.04 Intr - 111723 111453 271 1 1 56 8 158 0.065 0.28 3.03 Intr - 116321 116283 39 0 0 122 75 19 0.304 1.28 3.02 Intr - 122406 122221 186 1 0 93 30 104 0.446 3.84 3.01 Init - 130486 130198 289 1 1 32 115 293 0.976 23.72 3.00 Prom - 141398 141359 40 -4.95 4.06 PlyA - 141632 141627 6 1.05 4.05 Term - 149193 149008 186 0 0 65 33 124 0.539 1.11 4.04 Intr - 150572 150189 384 2 0 39 77 274 0.602 15.52 4.03 Intr - 150975 150847 129 0 0 82 59 208 0.493 17.27 4.02 Intr - 176907 176708 200 1 2 62 42 113 0.031 2.25 4.01 Init - 178028 177830 199 2 1 47 -31 238 0.033 6.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 13594 13298 297 1 0 67 38 165 0.937 4.89 S.002 Sngl - 178028 177801 228 2 0 47 42 252 0.823 11.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:84761635_84992120|GENSCAN_predicted_peptide_1|119_aa MEEEYTVRIEANPFKEEAVGNVLADTDTCRNTQIFLEDHSLLLYLGEKEQWQTFSPIRSG NGHLTGSGAQRTPCRIRRGGSQLRVCDGGKQQWWTVSQSSARAITNTDQKSVQLQDLIE >gi568815586r:84761635_84992120|GENSCAN_predicted_CDS_1|360_bp atggaggaagaatatactgtcagaattgaagccaaccctttcaaagaagaggcagttggc aacgtattagcggacacagacacatgtagaaatacacagatattcctcgaggatcattct ctgctactatatttgggagagaaggaacaatggcaaacctttagcccgatcaggagtggc aatgggcaccttactggatcaggagcacagcggacaccctgccggatccggaggggtgga agtcagttacgggtctgcgatggcggcaaacagcagtggtggactgtcagccaaagctca gctcgagccataacaaacacggaccagaagagtgtgcagttgcaagatttaatagagtga >gi568815586r:84761635_84992120|GENSCAN_predicted_peptide_2|256_aa MLSNNITSYTEIFHERKSTYGTSFIVFLLLEIATATLTFSNHHPAQSTAQPSTLRLSGTE LAKSLRYAVYLSMIPEENLAMKKKKEEKKGKEEEEKKRERMEEEGRERRKEGEMKKGRKK GRRERGRKEARTSSTIEMLNNSGDSGHLCRVPDLRGNAFRFSPRSMILAMELSYMAFIML SQRSLSLWPLPPAHKVLCQTTTDVHLKTKDSYSPFRKVGSRLAKSWSRNAVQEPSPWLRD PKTLLVALAHCGGAGT >gi568815586r:84761635_84992120|GENSCAN_predicted_CDS_2|771_bp atgctatcaaacaacatcacatcctacacagaaatctttcatgaaaggaagtcaacctat gggacaagctttattgtcttcttacttttagaaattgccacagccaccctaaccttcagc aaccaccaccctgctcagtcaacagcccaaccatcaacattaaggttgtctggaactgaa cttgcaaaatctctaaggtatgctgtgtatttatccatgatccctgaagagaacttagca atgaaaaaaaagaaagaagaaaagaaaggaaaggaagaggaggagaagaagagggaaaga atggaggaagaagggagggagagaaggaaggaaggagagatgaagaaaggaagaaagaaa ggaaggagggaaagaggaaggaaggaagctaggacttccagtacgattgagatgttgaat aatagtggtgacagtgggcatctttgtcgtgttccagatcttagaggaaatgcttttaga ttttccccacgcagtatgatactagctatggagctgtcttatatggcttttatcatgttg agtcagaggagcctctctctgtggccactaccaccagcccataaggtgctctgccagacc accactgatgttcacttaaagaccaaggactcttactcacccttcagaaaagtgggctcc cgtctggccaaaagctggtccagaaatgctgtccaagagcctagtccttggctcagagac cccaagaccctgcttgttgccctagcccattgtggcggagctggtacctaa >gi568815586r:84761635_84992120|GENSCAN_predicted_peptide_3|622_aa MPKNSKVVKRELDDDVTESVKDLLSNEDAADDAFKTSELIVDGQEEKDTDVEEGSEVEDE RPAWNSKLQYILAQVGFSVGLGNVWRFPYLCQKNGGVVEPECEQSSATTYYWYREALNIS SSISESGGLNWKMTICLLAAWVMVCLAMIKGIQSSGKVNTYDRVTMSATVLADSIMFIFK QLEIMLEPKVWREAATQVFFALGLGFGGVIAFSSYNKRDNNCHFDAVLVSFINFFTSVLA TLVVFAVLGFKANVINEKCITQNSETIMKFLKMGNISQDIIPHHINLSTVTAEDYHLVYD IIQKVKEEEFPALHLNSCKIEEELNKAVQGTGLAFIAFTEAMTHFPASPFWSVMFFLMLV NLGLGSMFGTIEGIVTPIVDTFKVRKEILTVICCLLAFCIGLIFVQRSGNYFVTMFDDYS ATLPLLIVVILENIAVCFVYGIDKFMEDLKDMLGFAPSRYYYYMWKYISPLMLLSLLIAS VVNMGLSPPGYNAWIEDKASEEFLSYPTWGLVVCVSLVVFAILPVPVVFIVRRFNLIDDS SGNLASVTYKRGRVLKEPVNLEGDDTSLIHGKIPSEMPSPNFGKNIYRKQSGSPTLDTAP NGRYGIGYLMADIMPDMPESDL >gi568815586r:84761635_84992120|GENSCAN_predicted_CDS_3|1869_bp atgcccaaaaatagcaaggtggtaaaaagagaattagatgatgatgttactgagtctgtc aaagaccttctttccaatgaagacgcagctgatgatgcttttaagacaagtgaactaatt gttgatggccaggaagagaaagatacagatgttgaagaaggatctgaagtcgaagatgaa agaccagcttggaacagtaaactacaatacatcctggcccaagttggattttctgtaggt ttaggaaatgtgtggcgatttccatacctatgtcagaagaatgggggcgttgtagaacca gaatgtgaacaaagttctgccaccacctattactggtacagggaagcactgaatatttca agttccatttctgaaagtgggggcttaaactggaagatgaccatctgcttgttggctgcc tgggtcatggtttgcttggctatgatcaaaggcattcagtcttctggaaaagttaataca tatgatagagttactatgtcagcaactgtgttagctgattctattatgtttatttttaaa cagcttgaaataatgctggagcccaaggtctggagagaagctgctactcaagtgttcttt gccttaggtctgggatttggtggtgtcattgccttttcaagctacaacaagagagacaac aactgccactttgatgctgtcctggtgtccttcatcaattttttcacttctgtcctggca acattggtggtgtttgcagttctgggcttcaaagcaaatgtcataaatgagaaatgcatt acacaaaattcagagacgatcatgaaatttttgaaaatggggaacattagtcaggatatt attccccatcatatcaacctttcaactgttactgcagaagattatcatttagtttatgac atcattcaaaaagtgaaagaagaagagtttcctgctcttcatctcaattcctgtaaaatt gaagaagagctaaataaagctgttcaggggaccggcttagcttttattgcctttacagaa gcgatgacacattttcctgcatctcccttctggtcagtgatgtttttcctcatgctggtc aatctaggccttggcagtatgtttggaaccattgaagggattgtcacgcctattgtggac actttcaaagtgaggaaagaaattcttactgttatctgttgtcttctggcattttgtatt ggcctgatatttgtgcaacgctctggaaattactttgttacaatgtttgatgattattct gctacactgcctctgctaattgtagtcattttggagaatattgctgtatgctttgtttat ggcatagataagtttatggaagacctaaaagatatgctgggctttgctcccagcagatat tactactatatgtggaaatatatttctcctctaatgctattatcattgctaatagctagt gttgtgaatatgggattaagtcctcctggctataacgcatggattgaagataaggcatct gaagaatttctgagctatccaacatggggactggttgtttgtgtctctctggttgtcttt gcaatactcccagtccctgtagttttcattgttcgtcgcttcaaccttatagatgatagt tctggtaatttagcatctgtgacctataagagaggaagggtcctgaaagagcctgtgaac ttagagggcgatgatacaagcctcattcacggaaaaataccgagcgagatgccatctcca aattttggtaaaaatatttatcgaaaacagagtggatccccaactctggatactgctccc aatggacggtatggaatagggtacttgatggcagatattatgccagatatgccagaatct gatttgtag >gi568815586r:84761635_84992120|GENSCAN_predicted_peptide_4|365_aa MWTEVPKSGNGKKKSTPIKKDHYISKTFLREDRDIVVLRNSLIASKKGPPLRRRNSFPCP MEATAIGNHWYSLKGQELFSQLVLCTDRTGTHPSGQWAPFWPRAGPEMMSKSLGLGSGTP TPCLLLWPIVAELPPGPYPFATAAAAARTWKQQEALGQDTLTVMASAYAKGTTGGEHPQR PTRFQAKSGLVEGWGPTRESPRGKLGGLGQRETRGELEAKPRAGGRWGGESRELYSSFAS VRFSKGRRRLPRTECFHVTTVPLCLRYPVINGPKGGLSPYERHVAAGEITAPVFLRILEF ESVAAGAIYRYGQKGSEQYGLLLRGILIRCFHEEAVLALVPARCRSVVQGVPWTFVQQDM IGMSL >gi568815586r:84761635_84992120|GENSCAN_predicted_CDS_4|1098_bp atgtggactgaggtacccaagagtggcaatggcaagaagaagtccacgccaatcaagaaa gaccactacatctccaagacgttcctgcgtgaggacagagacatcgtggttctgaggaac tcgctcatcgccagcaagaaggggccacccctccgtcgacggaactcattcccctgtcct atggaggccactgccattggcaaccactggtattcacttaaaggccaagagctcttcagt cagcttgtgttgtgtaccgacaggactgggactcacccttcaggtcagtgggctcccttt tggcccagagctggtccagaaatgatgtccaagagcctaggcctgggctcaggaactcca acaccctgcttgttgctctggcccattgtggctgagctgccacctggaccctaccctttc gcgactgctgctgctgctgcccggacgtggaagcagcaagaggcgcttggtcaagacaca ctgacggtaatggcaagcgcctacgctaagggcacaactgggggtgagcatccccagaga cctactcgtttccaagccaagtcggggttggttgaagggtggggtccaacacgggaaagt ccaagaggaaagttgggagggttggggcagagagagacgcgaggtgaattggaagctaaa cctcgtgcaggtgggcgatggggcggggagagcagagagctttattcctcattcgcttcc gttcgctttagcaaaggcagaaggcgactaccgaggacagagtgttttcacgtgaccaca gttcccctttgcttgaggtatcctgtgataaatggcccgaaagggggcctttcgccgtac gaaaggcacgtggctgctggagaaatcacggctcctgtgtttctgcgcatcctggaattt gagagtgtggcggctggggccatttataggtacgggcagaaggggtctgagcagtatggc ttgctgctccgcgggatattaataagatgcttccatgaagaggcggttctggctcttgtt ccggcgcgatgtcgttctgttgttcagggtgttccttggacctttgtccagcaagatatg atagggatgtctctttag