GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:07:55 Sequence gi568815596r:38481875_38702601 : 220727 bp : 42.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 94 930 837 0 0 35 37 1158 0.769 100.88 1.02 PlyA + 978 983 6 1.05 2.05 PlyA - 5878 5873 6 1.05 2.04 Term - 7659 7494 166 2 1 74 45 177 0.505 8.41 2.03 Intr - 9120 9048 73 1 1 64 116 0 0.248 -2.05 2.02 Intr - 14604 14401 204 1 0 104 87 128 0.699 12.55 2.01 Init - 20113 20017 97 1 1 79 74 65 0.526 2.72 2.00 Prom - 24034 23995 40 -5.95 3.00 Prom + 24813 24852 40 -5.75 3.01 Init + 38204 38312 109 1 1 64 57 75 0.617 2.43 3.02 Term + 38868 39016 149 1 2 71 41 178 0.622 8.48 3.03 PlyA + 39392 39397 6 1.05 4.17 PlyA - 39774 39769 6 1.05 4.16 Term - 50725 50606 120 1 0 76 48 98 0.764 2.09 4.15 Intr - 51675 51508 168 0 0 9 14 178 0.443 1.72 4.14 Intr - 53885 53732 154 1 1 89 56 98 0.540 5.85 4.13 Intr - 69912 69834 79 1 1 14 99 59 0.004 -2.71 4.12 Intr - 86423 86325 99 0 0 97 91 54 0.821 5.76 4.11 Intr - 86569 86512 58 1 1 78 111 64 0.970 5.24 4.10 Intr - 87460 87259 202 0 1 39 102 53 0.801 0.17 4.09 Intr - 88051 87930 122 1 2 109 19 89 0.823 2.47 4.08 Intr - 91553 91336 218 0 2 89 115 80 0.760 8.10 4.07 Intr - 95658 95587 72 1 0 65 115 59 0.142 4.76 4.06 Intr - 100111 100039 73 1 1 52 92 30 0.217 -2.14 4.05 Intr - 104007 103770 238 2 1 121 68 256 0.979 23.59 4.04 Intr - 109719 109656 64 1 1 23 91 88 0.114 -0.54 4.03 Intr - 120726 120503 224 2 2 122 58 354 0.444 32.65 4.02 Intr - 123929 123802 128 2 2 88 47 54 0.309 -0.14 4.01 Init - 133153 133037 117 1 0 60 53 103 0.349 4.25 4.00 Prom - 158010 157971 40 -3.05 5.00 Prom + 158189 158228 40 -5.55 5.01 Sngl + 161309 161746 438 1 0 73 38 218 0.769 9.32 5.02 PlyA + 164151 164156 6 1.05 6.00 Prom + 164310 164349 40 -8.55 6.01 Init + 166711 166893 183 0 0 80 17 221 0.917 13.36 6.02 Term + 168637 168960 324 0 0 77 45 237 0.766 12.08 6.03 PlyA + 170675 170680 6 1.05 7.00 Prom + 170954 170993 40 -6.75 7.01 Init + 184288 184477 190 0 1 39 98 193 0.680 14.63 7.02 Intr + 194038 194192 155 2 2 119 106 134 0.959 17.17 7.03 Intr + 199406 199612 207 1 0 50 111 212 0.357 18.05 7.04 Intr + 207939 208020 82 2 1 89 77 95 0.674 6.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 34031 33910 122 2 2 69 84 77 0.905 5.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_1|278_aa MQQIRMSLRGKVVVLMGKNTMMRKAIRGHLENNPALEKLLPHIWGNVGFVFTKEDLTEIR DMLLANKVPAAARAGAIAPCEVTVPAQNTGLGPEKTSFFQALGITTKISRGTIEILSDVQ LIKTGDKVGASEATLLNMLNISPFSFGLVIQQVFDNGSIYNPEVLDKTEETLHSRFLEGV RNVASVCLQTGYPTVASVPHSIINGYKRVLALSVETDYTFPLAENVKAFLADPSAFVAAA PVAADTTAAPAAAAAPAKVEAKEESEESDEDMGFGLFD >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_1|837_bp atgcagcagatccgcatgtcccttcgcgggaaggtcgtggtgctgatgggcaagaacacc atgatgcgcaaggccatccgagggcacctggaaaacaacccagctctggagaaactgctg cctcatatctgggggaatgtgggctttgtgttcaccaaggaggacctcactgagatcagg gacatgttgctggccaataaggtgccagctgctgcccgtgctggtgccattgccccatgt gaagtcactgtgccagcccagaacactggtctcgggcccgagaagacctcctttttccag gctttaggtatcaccactaaaatctccaggggcaccattgaaatcctgagtgatgtgcag ctgatcaagactggagacaaagtgggagccagcgaagccacgctgctgaacatgctcaac atctcccccttctcctttgggctggtcatccagcaggtattcgacaatggcagcatctac aaccctgaagtgcttgataaaacagaggaaactctgcattctcgcttcctggagggtgtc cgcaatgttgccagtgtctgtctgcagactggctacccaactgttgcatcagtaccccat tctatcatcaacgggtacaaacgagtcctggccttgtctgtggagacggattacaccttc ccacttgctgaaaatgtcaaggccttcttggctgatccatctgcctttgtggctgctgcc cctgtggctgctgacaccacagctgctcctgctgctgctgcagccccagctaaggttgaa gccaaggaagagtcggaggagtcggacgaggatatgggatttggtctctttgactaa >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_2|179_aa MARGWRGGLVMAYSLLDMLGLQCPWEFKVKVSRQETEALGKVEVARGLTVNCARIGGQEP SLLEEDPPRIPSRMPRSRASASLLWAQEGCGSPQNVLLAREVHSLWQSLTHMGWLSQNCT ESCESLGKTGGVSPGPSPRSPEGGLGECGAVGDLKANRDEVPSGALNVGLMRSVLQKDE >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_2|540_bp atggcgaggggatggcggggagggctggtgatggcatattcacttttggacatgttgggg ttgcagtgtccatgggagttcaaggtgaaggtgtccaggcaggaaactgaagctcttggg aaggtggaagttgcacgtggtctcacagtcaactgtgccagaataggaggccaggaacca tctctgctggaggaagacccaccacgcatcccttcaagaatgccaaggtccagagcaagt gcttcccttctatgggcacaggagggctgtgggagcccacagaatgtgcttctggccaga gaggtccatagtttatggcagtctttgacccatatgggttggctatcacaaaactgcaca gaatcctgtgaaagcttgggaaagacaggtggagtctcccctggccccagccccaggtcc ccagaaggtggtctgggggagtgtggagcagttggagatttgaaggcaaatcgtgatgag gtccccagtggagcactgaatgtggggttgatgagaagcgtccttcagaaggatgaatag >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_3|85_aa MEERLSLNGGRRGRLRRREQQEDKQGREGPKKLTSPGSISWCGHTAIASKKHAVHGNYYD SKHLLAELKLFLSDRLLVNGKAVPL >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_3|258_bp atggaggaaagactgtcactaaatggagggagaagagggaggttacgtaggagagagcag caagaagataaacagggaagggaaggaccaaagaagctgacatctcctggctccatcagc tggtgtggccacacagctatagctagcaagaagcacgcagtccacgggaactattatgat tctaaacaccttctggcagaactcaaactttttctaagtgatcgcctcttagtcaacgga aaggctgtccccttatag >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_4|711_aa MGGASGIQGAGVKGAGKHSTTYSSLSAKNYLDQNVNGAKACPKPVSYSSIAGVPFELWQW ELLAAAYLTAANSIKCSENSQRETYEEDREYESQAKRLKTEEGEIDYSAEEGENRREATP RGGGDGGGGGRSFSQPVTKRGPPVRCAWLLPGKQGLGLCESVVEADLVEALEKFGTICYV MMMPFKRQALVEFENIDSAKECVTFAADEPVYIAGQQAFFNYSTSKRITRPGNTDDPSGG NKVLLLSIQNPLYPITVPTRLNVIRNDNDSWDYTKPYLGRRDRGKGRQRQAILGEHPSSF RHDGYGSHGPLLPLPSRYRMGSRDTPELVAYPLPQASSSYMHGGNPSGSVVMVSGLHQLK MNCSRVFNLFCLYGNIEKVKFMKTIPGTALVEMGDEYAVERAVTHLNNVKLFGKRLNVCV SKQHSVVPSQIFELEDGTSSYKDFAMSKNNRFTSAGQASKNIIQPPSCVLHYYNVPLCVT EETFTKLCNDHEVLTFIKYKVFDAKPSAKTLSGLLEWECKTDAVEALTALNHYQIRVPSR KRGLQTKECGQLLETGNSPQLTASSPLTLVKNACADPQSLVLPSCSLVLGKGGTTVETEE YFGGLAGGRVFTVVQKIGSHSKQLLASISELQYSQLHWPVQMQIQVLRMAAQLPTPSSKH HGTDSSDQAKGNHTLEGWLYRGEKMMGKEIKPKLESQTDPVCMGRQGVDET >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_4|2136_bp atgggtggcgctagtggtatccagggggcaggggtcaagggtgctggtaaacattctaca acgtacagctccctctcagcaaagaattatctggaccaaaatgtcaatggtgccaaggcc tgccctaagcctgtctcctactccagtatagctggagtcccctttgagctgtggcaatgg gagctcttggctgctgcctacttgacagcagcgaattcaatcaaatgttcagaaaatagt caaagggagacgtacgaggaggaccgggagtacgagagccaggccaagcgtctcaagacc gaggagggggagatcgactactcggccgaggaaggcgagaaccgccgggaagcgacgccc cggggcgggggcgatggcggcggcggcggccggagcttctctcagccggtaacaaagcgc ggcccccctgtccgctgtgcctggctgctccccgggaagcagggcctcggactctgtgaa tctgtggtggaagcagacctcgtggaagcgctggaaaaatttgggacaatatgctatgtg atgatgatgccatttaaacgacaggctctagtggaatttgaaaacatagatagtgccaaa gaatgtgtgacatttgctgcagatgaacccgtgtacattgctggtcaacaggcttttttc aactattctacaagcaaaaggatcactcggccaggaaatactgatgatccatcaggaggc aacaaagttcttctgctctcaattcagaatccgctttatccaattacagtgccaactcgt ctaaatgttattaggaatgacaatgacagttgggactacactaaaccatatttgggaaga cgagatagaggaaagggtcgccagagacaagccattttgggagaacacccttcttcgttt agacatgatggctatggatcccatggtccattattgcctttaccaagtcgttacagaatg ggctctcgagatacacctgaacttgttgcttatccattaccacaggcttcttcctcttac atgcatggaggaaatccctctggttcagttgtaatggttagtggattacatcaactaaaa atgaattgttcaagagtcttcaacctgttctgcttatatggaaatattgagaaggtaaaa tttatgaagaccattcctggtacagcactggtagaaatgggtgatgagtatgctgtagaa agagctgtcacacaccttaataatgtcaaattatttgggaaaagacttaatgtttgcgtg tctaaacaacattcagttgttccaagtcaaatatttgagctggaggatggtaccagcagc tacaaagattttgcaatgagcaaaaataatcgctttacaagtgctggccaagcatctaag aatataatccagccaccctcctgtgttttgcattattataatgttccattgtgtgtcaca gaagagaccttcacaaagttgtgtaatgaccatgaagttcttacattcatcaaatataaa gtgtttgatgcaaaaccttcagccaaaacactttctgggctattagaatgggagtgcaaa actgatgcagtagaagcccttacggcactgaatcactatcagataagagtgccgagtaga aaaagggggctgcaaaccaaggaatgtgggcagcttctagaaactggaaacagtcctcag ctgacagccagcagccctcttacgcttgtgaagaacgcatgtgctgacccccagagcctg gtcctgccgagctgctccctggtgctggggaaaggtggcacaactgtagagacagaagag tattttgggggcctggcagggggcagagtgttcacggtggtccagaagattggcagtcat tccaaacagctcctggccagcatcagtgaacttcagtactctcagcttcactggcctgtg caaatgcaaatacaggtcctgagaatggctgcacagctgcctacaccctcctcaaagcat cacgggacagacagctcagaccaagccaaaggcaaccacacactggagggctggctctac agaggggagaagatgatgggtaaagagatcaagcctaagctagagtcacagacagaccca gtctgcatgggtagacagggagttgatgagacttga >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_5|145_aa MAGRKSRALPREEAAEAQRALCGQASTAGGPRAPSAAAIPGAKPLTAWGWRCRPAARSAG PVEPAPTWNSRWPACAACSLSSRLHFFLHTSPQAEGDGSGLGHPREGLPQCSGRLKVSSS AASVGTKAEEAPRASEGCQYAVTSR >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_5|438_bp atggcaggccgcaagtccagagccctgccccgcgaggaggcagctgaggcccagcgagcg ctgtgcgggcaggccagcactgctgggggacccagggcaccctccgcagctgctatcccg ggtgctaagcccctcactgcctggggctggcggtgccggccggctgctcggagtgcgggg cccgttgagcccgcgcccacctggaactcgcgctggcccgcgtgcgccgcgtgcagcctc agttcccgcctgcacttcttcctccacacctccccacaagcagagggagatggctccggc cttggccatcccagagaggggctcccacagtgcagcggcaggctgaaggtctcctcaagc gcggccagtgtgggtaccaaggccgaggaggcgccaagagcaagcgagggctgccagtac gctgtcacctctcggtag >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_6|168_aa MHCQGCAGPRDQCRTCRTFSSRPHANCYRLQLQTSRAHRKQVDKGERRRGTQKGPVALPA VPISLYTSPEQKGCSPTQPSLALQTGLRTTEPSRGHSPHQKKVKLASTKLDSALDVEVPT ISRTFGQALPQGSESSIIYRETGTHSAGLLIPQGGFEKVVLQEAASGS >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_6|507_bp atgcattgccaaggctgtgcaggaccacgggaccagtgccggacctgcaggaccttcagt tcacgaccacatgctaactgctacaggctgcagcttcagaccagcagagcacacagaaaa caggtggacaaaggagagagaaggcgggggacccagaaggggcccgtggctcttcctgct gtgcccatctccctctataccagccctgaacagaagggctgttctcctacccagccctcc ctggccttgcagacaggacttaggaccacagagccgtccaggggacattcaccacatcag aagaaggtcaaattggcatcaaccaagctagattcagcattagatgtggaggttccaaca atctccagaacctttggccaggctctacctcagggatctgagagcagcataatctacaga gagacagggacacattcagcaggactgctgattccccagggaggctttgagaaagtagtg cttcaggaagcagcatctgggagctga >gi568815596r:38481875_38702601|GENSCAN_predicted_peptide_7|212_aa MASVTRAVFGELPSGGGTVEKFQLQSDLLRVDIISWGCTITALEVKDRQGRASDVVLGFA ELEGYLQKQPYFGAVIGRVANRIAKGTFKVDGKEYHLAINKEPNSLHGGVRGFDKVLWTP RVLSNGVQFSRISPDGEEGYPGELKVWVTYTLDGGELIVNYRAQASQATPVNLTNHSYFN LAGQASPNINDHEVTIEADTYLPVDETLIPTX >gi568815596r:38481875_38702601|GENSCAN_predicted_CDS_7|636_bp atggcttcggtgaccagggccgtgtttggagagctgccctcgggaggagggacagtggag aagttccagctgcagtcagacctcttgagagtggacatcatctcctggggctgcacgatc acagccctagaggtcaaagacaggcaggggagagcctcggacgtggtgcttggcttcgcc gagttggaaggatacctccaaaagcagccatactttggagcagttattgggagggtggcc aaccgaatcgccaaaggaaccttcaaggtggatgggaaggagtatcacctggccattaac aaggaacccaacagtctgcatggaggagtcagagggtttgataaagtgctctggacccct cgggtgctgtcaaatggcgtccagttctcgcgcatcagtccagatggtgaagaaggctac cccggagagttaaaagtctgggtgacatacaccctggatggcggagagctcatagtcaac tacagagcacaagccagtcaggccacaccagtcaacctgaccaaccattcttacttcaac ctggcaggccaggcttccccaaatataaatgaccatgaagtcaccatagaagcggatact tatttgcctgtggatgaaaccctgattcctacagnn