GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:06:02 Sequence gi568815586r:113363193_113571498 : 208306 bp : 49.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5924 6017 94 2 1 167 94 135 0.826 21.97 1.02 Intr + 9457 9615 159 0 0 88 109 240 0.999 26.28 1.03 Intr + 11282 11382 101 1 2 131 95 129 0.998 16.81 1.04 Intr + 11601 11815 215 0 2 93 61 357 0.858 31.86 1.05 Intr + 17553 17650 98 1 2 70 105 188 0.824 18.43 1.06 Intr + 20913 21073 161 2 2 81 94 217 0.989 20.39 1.07 Intr + 21659 21754 96 2 0 116 48 144 0.956 12.32 1.08 Intr + 22020 22091 72 0 0 95 46 118 0.943 6.82 1.09 Intr + 23745 23897 153 0 0 95 92 216 0.999 21.89 1.10 Intr + 24552 24714 163 0 1 134 48 192 0.999 19.78 1.11 Term + 25267 25434 168 0 0 114 45 191 0.944 15.28 1.12 PlyA + 28355 28360 6 1.05 2.10 PlyA - 28468 28463 6 1.05 2.09 Term - 29957 29749 209 0 2 98 55 343 0.945 29.50 2.08 Intr - 30824 30700 125 1 2 76 71 171 0.999 14.43 2.07 Intr - 34200 33973 228 2 0 113 99 321 0.999 32.68 2.06 Intr - 35414 35323 92 2 2 97 84 170 0.785 16.29 2.05 Intr - 35696 35515 182 0 2 23 105 228 0.530 17.59 2.04 Intr - 35959 35920 40 1 1 100 114 -14 0.956 0.20 2.03 Intr - 36518 36352 167 0 2 117 48 317 0.652 30.28 2.02 Intr - 42253 42141 113 0 2 72 81 36 0.174 1.32 2.01 Init - 55730 55645 86 2 2 69 106 91 0.746 9.29 2.00 Prom - 57079 57040 40 -8.76 3.00 Prom + 60222 60261 40 -4.06 3.01 Init + 60872 60925 54 1 0 73 26 51 0.121 -3.32 3.02 Intr + 64770 64964 195 2 0 106 78 210 0.669 21.41 3.03 Intr + 65228 65267 40 1 1 112 60 17 0.935 -0.90 3.04 Intr + 65968 66107 140 2 2 106 116 185 0.999 23.28 3.05 Intr + 70942 71030 89 0 2 89 91 50 0.998 4.17 3.06 Intr + 72137 72364 228 2 0 122 94 287 0.999 29.68 3.07 Intr + 73559 73683 125 2 2 107 92 171 0.999 19.63 3.08 Intr + 74694 74881 188 1 2 76 41 164 0.419 9.91 3.09 Intr + 85382 85522 141 1 0 2 9 207 0.336 4.75 3.10 Term + 89109 89216 108 2 0 101 42 12 0.173 -3.59 3.11 PlyA + 89224 89229 6 1.05 4.10 PlyA - 90431 90426 6 1.05 4.09 Term - 100365 99998 368 1 2 106 48 548 0.571 47.37 4.08 Intr - 104229 104064 166 0 1 108 94 215 0.989 23.63 4.07 Intr - 105212 104935 278 0 2 84 80 644 0.996 60.44 4.06 Intr - 106153 105930 224 0 2 70 91 470 0.997 43.27 4.05 Intr - 108421 108134 288 0 0 31 110 266 0.775 19.46 4.04 Intr - 109249 109126 124 2 1 35 23 64 0.171 -5.76 4.03 Intr - 110706 110578 129 1 0 87 121 92 0.726 13.07 4.02 Intr - 120502 120464 39 2 0 114 98 14 0.484 3.20 4.01 Init - 135844 135769 76 1 1 55 65 112 0.170 4.85 4.00 Prom - 153965 153926 40 -2.16 5.08 PlyA - 154585 154580 6 1.05 5.07 Term - 177079 176894 186 1 0 63 54 129 0.738 4.49 5.06 Intr - 188012 187779 234 2 0 73 5 133 0.000 1.39 5.05 Intr - 190083 189925 159 0 0 95 35 74 0.010 2.98 5.04 Intr - 194698 194580 119 2 2 91 34 54 0.035 0.48 5.03 Intr - 197050 197015 36 2 0 72 111 34 0.151 2.33 5.02 Intr - 198785 198726 60 0 0 85 78 41 0.142 1.51 5.01 Init - 207184 207067 118 1 1 61 99 87 0.843 7.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 187935 187773 163 2 1 103 55 128 0.911 8.41 S.002 Init - 189986 189925 62 2 2 102 35 102 0.930 5.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:113363193_113571498|GENSCAN_predicted_peptide_1|493_aa XWAFLELGTSGQYNDSLQAYAAGVVEAAVSEELIYMHWMNTVVNYCGPFEYEVGYCERLK SFLEANLEWMQEEMESNPDSPYWHQVRLTLLQLKGLEDSYEGRVSFPAGKFTIKPLGFLL LQLSGDLEDLELALNKTKIKPSLGSGSCSALIKLLPGQSDLLVAHNTWNNYQHMLRVIKK YWLQFREGPWGDYPLVPGNKLVFSSYPGTIFSCDDFYILGSGLVTLETTIGNKNPALWKY VRPRGCVLEWVRNIVANRLASDGATWADIFKRFNSGTYNNQWMIVDYKAFIPGGPSPGSR VLTILEQIPGMVVVADKTSELYQKTYWASYNIPSFETVFNASGLQALVAQYGDWFSYDGS PRAQIFRRNQSLVQDMDSMVRLMRYNDFLHDPLSLCKACNPQPNGENAISARSDLNPANG SYPFQALRQRSHGGIDVKVTSMSLARILSLLAASGPTWDQVPPFQWSTSPFSGLLHMGQP DLWKFAPVKVSWD >gi568815586r:113363193_113571498|GENSCAN_predicted_CDS_1|1482_bp nngtgggccttcctggagctgggcacaagtggccaatacaatgacagcttgcaggcctat gcagccggtgtggtggaggctgctgtgtcggaggagctcatctacatgcactggatgaac acggtggtgaattactgcggccccttcgagtatgaagtcggctactgcgagaggctgaag agcttcctggaggccaacctagagtggatgcaggaagagatggagtcaaacccagactca ccttactggcaccaggtgcggctgaccctcctgcagctgaaaggcctggaggacagctac gaaggccgtgtgagcttcccagctgggaagttcaccatcaaacccttggggttcctcctg ctgcagctctctggggacctggaagacctggagctggccctgaacaagaccaagatcaaa ccttctctgggctctggctcctgttctgccctcatcaagctgctccctggccagagtgac ctcctggttgcccacaacacctggaacaactaccagcacatgctgcgtgtcatcaagaag tactggctccagttccgggaaggcccctggggggactacccgctggttcccggcaacaag ctggtcttctcctcctaccccggcaccatcttctcctgcgacgacttctacatcctgggc agtgggctggtgacactggagaccaccattggcaacaagaacccagccctgtggaagtat gtgcggcccaggggctgtgtgctggagtgggtacgcaacatcgtggccaaccgcctggcc tcggatggggccacctgggcagacatcttcaagaggttcaacagcggcacgtataacaac cagtggatgatcgtggactacaaggcgttcatcccgggtgggcccagccccgggagccgg gtgcttaccatcctggagcagatccccggcatggtggtggtggctgacaagacctcggag ctctaccagaagacctactgggccagctacaacataccgtccttcgagactgtgttcaat gccagtgggctgcaggccctagtggcccagtatggggactggttttcttatgacgggagc ccccgggcccagatcttccggcggaaccagtcactggtacaagacatggactccatggtc aggctgatgaggtacaatgacttcctccatgaccctctgtcactgtgcaaagcctgcaac ccccagcccaatggggagaatgctatctccgcccgctccgacctcaacccggccaatggc tcctaccccttccaggccctgcgtcagcgctcccatgggggtatcgatgtgaaggtgacc agcatgtcactggccaggatcctgagcctgctggcggccagcggtcccacgtgggaccag gtgcccccgttccagtggagcacctcgcccttcagcggcctgctgcacatgggccagcca gacctctggaagttcgcgcctgtcaaggtttcatgggactga >gi568815586r:113363193_113571498|GENSCAN_predicted_peptide_2|413_aa MAKVAKVARKRSKYHDVIILQNANNYLEKSGEISNLPSLPDTSETPERSCLLAIIMEARK LTFLVGRMMSGEPLHVKTPIRDSMALSKMAGTSVYLKMDSAQPSGSFKIRGIGHFCKRVR DRWAKQGCAHFVCSSDAPALTRSPPPTLAAGNAGMAAAYAARQLGVPATIVVPSTTPALT IERLKNEGATVKVVGELLDEAFELAKALAKNNPGWVYIPPFDDPLIWEGHASIVKELKET LWEKPGAIALSVGGGGLLCGVVQGLQEVGWGDVPVIAMETFGAHSFHAATTAGKLVSLPK ITSVAKALGVKTVGAQALKLFQEHPIFSEVISDQEAVAAIEKFVDDEKILVEPACGAALA AVYSHVIQKLQLEGNLRTPLPSLVVIVCGGSNISLAQLRALKEQLGMTNRLPK >gi568815586r:113363193_113571498|GENSCAN_predicted_CDS_2|1242_bp atggcaaaagttgcaaaggtggcacgtaagcgttcaaagtaccatgatgtcattatatta cagaatgctaacaactacctggaaaaaagcggggaaatctcaaacttgccttccctgccg gacacaagcgaaactccagaaaggagctgcctgcttgccatcatcatggaagccagaaaa ctcaccttccttgttggaagaatgatgtctggagaacccctgcacgtgaagacccccatc cgtgacagcatggccctgtccaaaatggccggcaccagcgtctacctcaagatggacagt gcccagccctccggctccttcaagatccggggcattgggcacttctgcaagagggtacgg gaccggtgggccaagcaaggctgtgcacattttgtctgctcctcggatgctcccgcactg acacgctctccacccccgaccctggcagcgggcaacgcaggcatggcggctgcatatgcg gccaggcaactcggcgtccccgccaccatcgtggtgcccagcaccacacctgctctcacc attgagcgcctcaagaatgaaggtgccacagtcaaggtggtgggtgagttattggatgaa gccttcgagctggccaaggccctagcgaagaacaacccgggttgggtctacattcccccc tttgatgaccccctcatctgggaaggccacgcttccatcgtgaaagagctgaaggagaca ctgtgggaaaagccgggggccatcgcgctgtcagtgggcggcgggggcctgctgtgtgga gtggtccaggggctgcaggaggtgggctggggggacgtgcctgtcatcgccatggagact tttggtgcccacagcttccacgctgccaccaccgcaggcaaacttgtctccctgcccaag atcaccagtgttgccaaggccctgggcgtgaagactgtgggggctcaggccctgaagctg tttcaggaacaccccattttctctgaagttatctcggaccaggaggctgtggccgccatt gagaagttcgtggatgatgagaagatcctggtggagcccgcctgcggggcagccctggcc gctgtctatagccacgtgatccagaagctccaactggaggggaatctccgaaccccgctg ccatccctcgtggtcatcgtctgcgggggcagcaacatcagcctggcccagctgcgggcg ctcaaggaacagctgggcatgacaaataggttgcccaagtga >gi568815586r:113363193_113571498|GENSCAN_predicted_peptide_3|435_aa MARSRLTATSPHRNLASQAVYLVSRMDGPVAEHAKQEPFHVVTPLLESWALSQVAGMPVF LKCENVQPSGSFKIRGIGHFCQEMAKKGCRHLVCSSGGNAGIAAAYAARKLGIPATIVLP ESTSLQVVQRLQGEGAEVQLTGKVWDEANLRAQELAKRDGWENVPPFDHPLIWKGHASLV QELKAVLRTPPGALVLAVGGGGLLAGVVAGLLEVGWQHVPIIAMETHGAHCFNAAITAGK LVTLPDITSVAKSLGAKTVAARALECMQVCKIHSEVVEDTEAVSAVQQLLDDERMLVEPA CGAALAAIYSGLLRRLQAEGCLPPSLTSVVVIVCGGNNINSRELQALKTHLGQRKTRRRR KEEKKKKKKKKKKKKKKKRKRKRKRKKKKEKKKNRSPHFKNISSVRTGTKIYSTYTEPRE SRECLVYTKPLQNVC >gi568815586r:113363193_113571498|GENSCAN_predicted_CDS_3|1308_bp atggcgcgatctcggctcactgcaacctcgcctcaccgcaacctcgcctctcaggctgtc tacctggtctccagaatggacggccctgtggcagagcatgccaagcaggagccctttcac gtggtcacacctctgttggagagctgggcgctgtcccaggtggcgggcatgcctgtcttc ctcaagtgtgagaatgtgcagcccagcggctccttcaagattcggggcattgggcatttc tgccaggagatggccaagaagggatgcagacacctggtgtgctcctcagggggtaatgcg ggcatcgctgctgcctatgctgctaggaagctgggcattcctgccaccatcgtgctcccc gagagcacctccctgcaggtggtgcagaggctgcagggggagggggccgaggttcagctg actggaaaggtctgggacgaggccaatctgagggcgcaagagttggccaagagggacggc tgggagaatgtccccccgtttgaccaccccctaatatggaaaggccacgccagcctggtg caggagctgaaagcagtgctgaggaccccaccaggtgccctggtgctggcagttgggggt gggggtctcctggccggggtggtggctggcctgctggaggtgggctggcagcatgtaccc atcattgccatggagacccatggggcacactgcttcaatgcggccatcacagccggcaag ctggtcacacttccagacatcaccagtgtggccaagagcctgggtgccaagacggtggcc gctcgggccctggagtgcatgcaggtgtgcaagattcactctgaagtggtggaggacacc gaggctgtgagcgctgtgcagcagctcctggatgatgagcgtatgctggtggagcctgcc tgtggggcagccttagcagccatctactcaggcctcctgcggaggctccaggccgagggc tgcctgcccccttccctgacttcagttgtggtaatcgtgtgtggaggcaacaacatcaac agccgagagctgcaggctttgaaaacccacctgggccagaggaagactagaagaagaaga aaagaagaaaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaggaag aggaagaggaagaggaagaagaagaaggagaagaaaaaaaacagatcccctcatttcaaa aatattagctctgtgaggactgggaccaaaatctattctacttacacggaacccagggag tccagggagtgcctagtgtacactaagccgttacaaaacgtttgctga >gi568815586r:113363193_113571498|GENSCAN_predicted_peptide_4|563_aa MLPGQWYGGALPLASLALWSLIPPVCNRRALEGDELLNTRSPPLDTVGDSARPERKGDKW ASLSPNTGIHKSCSLNLRIHKALSMRKARLGPSPVYDKKKPGPAPGAGRTALTQLQGAEG ARGLALQLLRQEAKGSGGPGRPEGQGPKGRARRPKPPGRGAMMVHCAGCERPILDRFLLN VLDRAWHIKCVQCCECKTNLSEKCFSREGKLYCKNDFFRRFGTKCAGCAQGISPSDLVRK ARSKVFHLNCFTCMVCNKQLSTGEELYVIDENKFVCKDDYLSSSSLKEGSLNSVSSCTDR SLSPDLQDALQDDPKETDNSTSSDKETANNENEEQNSGTKRRGPRTTIKAKQLETLKAAF AATPKPTRHIREQLAQETGLNMRVIQVWFQNRRSKERRMKQLSALGARRHAFFRSPRRMR PLGGRLDESEMLGSTPYTYYGDYQGDYYAPGSNYDFFAHGPPSQAQSPADSSFLAASGPG STPLGALEPPLAGPHAADNPRFTDMISHPDTPSPEPGLPGTLHPMPGEVFSGGPSPPFPM SGTSGYSGPLSHPNPELNEAAVW >gi568815586r:113363193_113571498|GENSCAN_predicted_CDS_4|1692_bp atgctgcctggccagtggtatggcggggcccttcccctggcctccctagccctctggtct ctcatccctcctgtctgtaatagaagagctcttgaaggagatgagctcttaaacacaaga tcacctcccctagatactgtgggggacagtgctaggcctgagaggaagggggacaagtgg gcatccctgagccccaacacaggcatccacaaatcctgctccctcaacctccgaatccac aaagctctctccatgagaaaggcccggctgggccccagccccgtgtatgacaaaaagaag ccggggccggctccaggagccgggagaaccgcgctgacgcagcttcagggcgcagagggg gcgcggggactcgccctgcagctgctgagacaagaggcgaagggcagcggagggcccggc aggcccgagggccaggggcccaaagggagggcaaggcggccgaagccgccggggcgcggg gctatgatggtgcactgcgccggttgcgagcggcccatcctcgaccgctttctgctgaac gtgctggaccgcgcgtggcacatcaaatgtgttcagtgctgcgagtgcaaaaccaacctc tcggagaagtgcttctcgcgcgagggcaagctctactgcaaaaatgactttttcaggcgc tttggcacgaaatgcgccggctgcgcgcaaggcatctcgcccagcgacctggtgcgcaag gcccggagcaaagtctttcacctcaactgtttcacctgcatggtgtgtaacaagcagctg tccaccggcgaggagctctacgtcatcgacgagaacaagttcgtgtgcaaagacgactac ctgagctcatccagcctcaaggagggcagcctcaactcagtgtcatcctgtacggaccgc agtttgtccccggacctccaggacgcactgcaggacgaccccaaagagacggacaactcg acctcgtcggacaaggagacggccaacaacgagaacgaggagcagaactcgggcaccaag cggcgcggcccccgcaccaccatcaaggccaagcagctggagacgctcaaggctgccttc gccgccacgcccaagcccacgcgccacatccgcgagcagctggcgcaggagaccggcctc aacatgcgcgtcatccaggtgtggtttcagaaccgacggtccaaagaacgccggatgaaa cagctgagcgccctaggcgcccggaggcacgccttcttccggagtccgcggcgcatgcgt ccgctgggcggccgcttggacgagtctgagatgttggggtccaccccgtacacctactac ggagactaccaaggcgactactacgcgccgggaagcaactacgacttcttcgcgcacggc ccgccttcgcaggcgcagtccccggccgactccagcttcctggcggcctctggccccggc tcgacgccgctgggagcgctggaaccgccgctcgccggcccgcacgccgcggacaacccc aggttcaccgacatgatctcgcacccggacacaccgagccccgagccaggcctgccgggc acgctgcaccccatgcccggcgaggtattcagcggcgggcccagcccgcccttcccaatg agcggcaccagcggctacagcggacccctgtcgcatcccaaccccgagctcaacgaagcc gccgtgtggtaa >gi568815586r:113363193_113571498|GENSCAN_predicted_peptide_5|303_aa MLVLMIISNFEELNIFETWKCSQLSEPELMEAIGLIHQPVHPYAKTGFQDNLRNSPGMRA SAPLTYESSDPGHCNEYAKAARTFKQLLTITEGSCGVSGVEYSGCGKQPHKDIIEQRQHR RIAQVALRSAPNALTSGCRRPGRNGEGACLPPPSDAASPGCKTRNVEGVKFNQIQAEASR RLIASSLPLRVLPSAYEETQTSLLEDERQNRPTIDQSHGPSKVPDVKEQSHVSKAGPQLS RDSNGSLGTLVLGFPVSSFDILDGILTGPGHRKSRQPSPAARALYSFLIPFLIPGLSEEE REK >gi568815586r:113363193_113571498|GENSCAN_predicted_CDS_5|912_bp atgttggtgcttatgattatcagcaattttgaagagctgaatatttttgaaacctggaaa tgcagtcagttgtctgagcctgagctcatggaagccattgggctgattcaccagccagtg cacccatatgccaagactggcttccaggacaatctcagaaacagccctgggatgagagca tctgcaccactcacctatgagagcagtgacccaggacactgtaatgaatatgctaaagca gccagaacatttaagcaattgctgactattactgagggcagctgtggtgtttcaggagtt gagtacagtggttgtggcaagcagcctcataaggacatcattgagcagagacaacacaga agaattgctcaagttgctcttcgcagtgccccaaatgccttaacttcaggatgcagacgc ccaggccgcaatggggaaggtgcttgtcttcctcctccttcggatgctgcatctcctggc tgcaaaaccaggaatgtggagggagtgaagttcaaccagatccaagctgaggcctcaaga cgcctcatagcttcttctctccctctcagagtcctgccatcagcatatgaagaaacccag accagcctactggaggatgagagacaaaatagaccaaccatagaccagtcccatggcccc agcaaggtcccagatgtgaaagaacaaagccacgtcagcaaagctggaccacagctgtcc agagattctaatgggagccttggaaccctggttctgggattccctgtgtcttcctttgac atcctggatggtatcctcacaggacctggacacagaaagtcccgtcaacccagcccagct gccagagctctctactccttcctcatccctttcctgatcccagggttatcagaagaagaa agagagaaatga