GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:43:52 Sequence gi568815588r:70332939_70541667 : 208729 bp : 49.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 3343 3264 80 1 2 46 100 102 0.233 5.55 1.05 Intr - 4241 4129 113 0 2 97 47 3 0.232 -2.80 1.04 Intr - 7764 7615 150 1 0 93 80 231 0.862 22.93 1.03 Intr - 27532 27409 124 1 1 15 64 157 0.193 6.16 1.02 Intr - 38578 38439 140 2 2 67 78 32 0.246 0.28 1.01 Init - 43595 43514 82 2 1 47 92 111 0.479 8.53 1.00 Prom - 45007 44968 40 -6.86 2.00 Prom + 55465 55504 40 -3.26 2.01 Sngl + 56488 57510 1023 0 0 52 48 361 0.505 25.57 2.02 PlyA + 57672 57677 6 1.05 3.00 Prom + 70174 70213 40 -6.96 3.01 Init + 71464 71608 145 0 1 80 83 167 0.176 15.80 3.02 Intr + 86976 87161 186 1 0 101 91 72 0.759 8.46 3.03 Term + 90084 90157 74 1 2 127 54 6 0.627 -0.83 3.04 PlyA + 93041 93046 6 1.05 4.04 PlyA - 93507 93502 6 1.05 4.03 Term - 100150 99998 153 1 0 115 45 122 0.558 8.52 4.02 Intr - 103045 102348 698 2 2 92 75 559 0.992 46.21 4.01 Init - 108729 108537 193 0 1 98 102 269 0.993 26.43 4.00 Prom - 113762 113723 40 -6.76 5.00 Prom + 122074 122113 40 -5.96 5.01 Init + 125458 125569 112 0 1 79 100 127 0.609 11.41 5.02 Intr + 126932 127020 89 0 2 82 -23 118 0.386 -0.21 5.03 Term + 127402 127500 99 0 0 85 43 82 0.787 1.53 5.04 PlyA + 128125 128130 6 1.05 6.00 Prom + 132356 132395 40 -6.46 6.01 Init + 145977 146121 145 2 1 64 86 195 0.955 15.18 6.02 Intr + 147676 147790 115 2 1 104 37 43 0.539 0.31 6.03 Intr + 149002 149195 194 1 2 84 68 59 0.376 2.64 6.04 Intr + 152981 153160 180 0 0 84 66 37 0.028 0.94 6.05 Term + 162181 162275 95 2 2 122 42 51 0.607 1.89 6.06 PlyA + 162767 162772 6 1.05 7.06 PlyA - 163331 163326 6 1.05 7.05 Term - 163904 163817 88 1 1 85 44 52 0.386 -2.37 7.04 Intr - 164585 164365 221 2 2 38 94 106 0.100 3.30 7.03 Intr - 171014 170921 94 1 1 41 101 88 0.199 5.37 7.02 Intr - 171125 171080 46 0 1 135 57 22 0.272 1.27 7.01 Init - 176148 175995 154 0 1 89 62 65 0.756 4.16 7.00 Prom - 178624 178585 40 -6.56 8.00 Prom + 178786 178825 40 -10.25 8.01 Init + 180947 181027 81 1 0 86 99 128 0.784 12.70 8.02 Intr + 182430 182649 220 2 1 8 44 162 0.348 1.67 8.03 Intr + 190191 190271 81 1 0 99 78 66 0.088 6.31 8.04 Intr + 193057 193198 142 2 1 3 94 153 0.021 6.81 8.05 Intr + 196291 196393 103 1 1 131 63 240 0.995 25.98 8.06 Intr + 196951 197130 180 0 0 93 110 178 0.981 20.56 8.07 Intr + 198352 198516 165 0 0 77 78 253 0.857 23.36 8.08 Intr + 199683 199843 161 2 2 99 81 289 0.977 28.09 8.09 Intr + 200057 200132 76 2 1 105 60 77 0.993 6.12 8.10 Intr + 200984 201135 152 1 2 90 70 142 0.676 11.56 8.11 Intr + 201487 201586 100 1 1 66 105 85 0.998 8.11 8.12 Intr + 201801 201905 105 2 0 75 94 79 0.993 7.61 8.13 Intr + 204873 204968 96 2 0 111 75 145 0.999 15.71 8.14 Intr + 205342 205553 212 0 2 60 86 217 0.536 16.41 8.15 Intr + 205914 206100 187 0 1 58 31 246 0.318 15.59 8.16 Intr + 206154 206309 156 2 0 82 78 159 0.993 14.51 8.17 Intr + 206642 206824 183 1 0 80 78 162 0.974 14.38 8.18 Intr + 208164 208304 141 2 0 99 70 242 0.557 24.05 8.19 Intr + 208525 208596 72 0 0 55 99 89 0.945 6.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 145151 144944 208 2 1 88 97 111 0.899 9.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_1|230_aa MLKKMGEAVARVARKVNETVESGSDTLGSWLCSVHGDWLLKNNRDTEEQDLKRPEARDST THIPGSPQAPSTQQSETLSQKKRNEEQEKEEEEEEGEEEEGDGEEEEEEESNEEPDLAEC KLVSFPIGIYKVLRNVSGQIHLITLANNELKSLTSKFMTTFSQLREVGCLSAGERTSCFL PACHNVSWGCPGDSSARPKALAGSLTSLPVNSIQQIPTSIGFGVAVGMHG >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_1|690_bp atgctgaagaagatgggtgaggccgtggccagagtagcaaggaaggtcaacgagacggtg gagagcggctctgacactctgggctcctggctctgcagtgtgcatggggactggctttta aagaataatcgtgacacagaggagcaggatttgaaaaggcctgaggcccgagactctact acacacatccctgggagcccccaggcacctagtacacagcagagtgagaccctgtctcaa aagaaaagaaatgaagagcaagagaaggaggaagaagaggaggagggggaggaggaggag ggagatggggaggaggaggaggaggaggaaagcaatgaggaaccagacctggccgagtgc aagctggtctcctttcccattggcatctacaaggtcctgcggaatgtctctggccagatc cacctcatcaccctggctaacaacgagcttaagtccctcaccagcaagttcatgaccaca ttcagtcagctccgagaggtgggctgccttagtgctggggagagaacaagctgcttcctc cctgcctgccacaatgtctcctggggctgccctggggactcttctgcaaggcccaaggcc ctggcagggtctctcacatccctgcccgtcaactcaatccagcagattcccacaagcatt gggttcggtgtggctgtgggaatgcacggn >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_2|340_aa MDSELMHSIVGSYHKPPERVFVPSFTQNEPSQNCHPANLEVTSPKILHSPNSQALILALK TLQEKIHCLELERTQAEDDLNILSREAAQYKKALENETNERNLAHQELIKQKKDVSIQLS SAQSRCTLLEKQLEYTKRMVLNVEREKNMILEQQAQLQREKEQDQMKLYAKLDKLDVLEK ECFRLTTTQKTAEDKIKHLEEKLKEEEHQRKLFQDKASELQTGLEISKIMSSVSNLKHSK EKKKSSKKTKCIKRGPPWQICSKFGALPFVAEKMRQHRDPHILQKPFNVTETRCLPKPSR TTAWCKAITPDSEKSISICDNLSELLMAMQDELDQMSMEH >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_2|1023_bp atggattctgaattaatgcatagtatagtaggaagctatcataaacctccagaaagagta tttgttccctcattcacccagaatgaaccatctcagaattgccatcctgcgaacttagaa gttacctctcctaagatacttcatagcccaaatagccaagctcttattttagccttaaaa actcttcaggaaaaaattcattgtttagagctggagagaacacaagctgaagatgacctg aacattctttccagagaagcagcacagtataagaaggccttagagaatgaaacaaatgag agaaatctagcacatcaggagctgataaagcagaaaaaagatgtaagtatacagttaagc tcagcccagtctcgttgcactcttctagagaagcaactagaatatacaaagagaatggtt ctcaatgtagagcgagaaaagaacatgatcctagaacaacaggcccagcttcagagggaa aaagaacaagatcagatgaagctgtatgcaaaacttgacaagcttgatgtcttagaaaaa gagtgttttagacttacaacaactcagaaaactgctgaggacaagattaaacatttagaa gaaaaacttaaggaagaagaacatcagcgtaagctatttcaagacaaagcttctgagctt caaactggacttgaaatcagtaaaattatgtcttcagtttcaaatttaaagcactccaag gaaaagaagaaatcttcaaagaaaactaaatgtataaagagaggaccaccttggcaaatt tgttcaaagtttggagcactgccttttgtggctgaaaagatgaggcaacatcgtgaccca catatccttcagaaaccttttaacgtgactgagactagatgtctccccaagccttctaga acaactgcctggtgtaaagctattactcctgactcagaaaagtccatttccatttgtgac aatttatctgaacttttgatggcaatgcaagatgagctggaccaaatgagcatggagcac taa >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_3|134_aa MSSSAGSGHQPSQSRAIPTRTVAISDAAQLPHDYCTTPGGTLFSTTPGGTRIIYDRKFLL DRRNSPMAQTPPCHLPNIPGVTSPGTLIEDSKVEVNNLNNLNNHDRKHAVGHFSNLHQRY LKCWYLRISVTPLI >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_3|405_bp atgtcctcgtcagccggcagcggccaccagcccagccagagccgcgccatccccacccgc accgtggccatcagcgacgccgcgcagctacctcatgactattgcaccacgcccgggggg acgctcttctccaccacaccgggaggaactcgaatcatttatgacagaaagtttctgttg gatcgtcgcaattctcccatggctcagaccccaccctgccacctgcccaatatcccagga gtcactagccctggcaccttaattgaagactccaaagtagaagtaaacaatttgaacaac ttgaacaatcacgacaggaaacatgcagttggccatttttcaaatttacatcaaagatac ctgaagtgttggtatctgagaatatctgtcactcctcttatctga >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_4|347_aa MHAHCLPFLLHAWWALLQAGAATVATALLRTRGQPSSPSPLAYMLSLYRDPLPRADIIRS LQAEDVAVDGQNWTFAFDFSFLSQQEDLAWAELRLQLSSPVDLPTEGSLAIEIFHQPKPD TEQASDSCLERFQMDLFTVTLSQVTFSLGSMVLEVTRPLSKWLKHPGALEKQMSRVAGEC WPRPPTPPATNVLLMLYSNLSQEQRQLGGSTLLWEAESSWRAQEGQLSWEWGKRHRRHHL PDRSQLCRKVKFQVDFNLIGWGSWIIYPKQYNAYRCEGECPNPVGEEFHPTNHAYIQSLL KRYQPHRVPSTCCAPVKTKPLSMLYVDNGRVLLDHHKDMIVEECGCL >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_4|1044_bp atgcacgcccactgcctgcccttccttctgcacgcctggtgggccctactccaggcgggt gctgcgacggtggccactgcgctcctgcgtacgcgggggcagccctcgtcgccatcccct ctggcgtacatgctgagcctctaccgcgacccgctgccgagggcagacatcatccgcagc ctacaggcagaagatgtggcagtggatgggcagaactggacgtttgcttttgacttctcc ttcctgagccaacaagaggatctggcatgggctgagctccggctgcagctgtccagccct gtggacctccccactgagggctcacttgccattgagattttccaccagccaaagcccgac acagagcaggcttcagacagctgcttagagcggtttcagatggacctattcactgtcact ttgtcccaggtcaccttttccttgggcagcatggttttggaggtgaccaggcctctctcc aagtggctgaagcaccctggggccctggagaagcagatgtccagggtagctggagagtgc tggccgcggccccccacaccgcctgccaccaatgtgctccttatgctctactccaacctc tcgcaggagcagaggcagctgggtgggtccaccttgctgtgggaagccgagagctcctgg cgggcccaggagggacagctgtcctgggagtggggcaagaggcaccgtcgacatcacttg ccagacagaagtcaactgtgtcggaaggtcaagttccaggtggacttcaacctgatcgga tggggctcctggatcatctaccccaagcagtacaacgcctatcgctgtgagggcgagtgt cctaatcctgttggggaggagtttcatccgaccaaccatgcatacatccagagtctgctg aaacgttaccagccccaccgagtcccttccacttgttgtgccccagtgaagaccaagccg ctgagcatgctgtatgtggataatggcagagtgctcctagatcaccataaagacatgatc gtggaagaatgtgggtgcctctga >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_5|99_aa MAGPTEPGEVSAPGRPAGLEAEAQAPALVGKRAEGDPASPLPSSCRTFSTAVVRGAYTPR FTVTLHPTSPQNSGPLHPPAYRASPVGSLTVISGSIHQD >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_5|300_bp atggcaggacctacggagcccggtgaggtgagcgcgccaggccggccggctgggctggag gcagaggcccaggcgcccgccctcgtgggaaagcgcgcggagggcgacccggccagccct ctcccctccagctgccgcaccttctccacagcagttgtccgtggtgcctacactccccgc ttcacggtcacccttcacccgaccagtcctcagaactctggacccctgcacccaccggcc taccgtgcgtctcccgtgggctccctcacagtcatctccggctccattcatcaagactga >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_6|242_aa MGWRSWEERAGARARAGRAPSLPDSAPAVRAQHAGRRSTDAPRSRGLAGFCFSGALRGGG SVLMLVALGAVLGVGVAELTRSPWLALVGSVTSTSLWVQWEKISAKHVAQCMAWSECSVD VAVGAFDYEPSPVARTLQGFLSSQVHGPGPLDEDSGTVAVQMFSHGRVIQEHLRAPFMRQ AVVLGASSYLIFLGFIPASYADCLGFERLLPGHLHGPFPLLPGMCLHTAGAFLSSYVGIG LP >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_6|729_bp atgggctggcgaagttgggaggagcgagctggagccagagcgcgcgccgggcgcgccccg tcgctgcctgactcggcgcccgcagttcgggcgcagcacgccggccgcaggagcacggat gccccccggagccgcgggctggcaggtttctgtttttctggagcacttcgtggaggtggc agtgtgctcatgctcgtggccttgggtgcagttctgggagtgggagttgctgagctcacc cgaagcccttggctggcactagtggggtcagtaacctctacctccttgtgggttcagtgg gagaaaatcagtgcaaagcatgtggcccagtgcatggcatggagtgagtgctcagttgat gtggcggtgggggcctttgattatgagccatcccctgtggccaggactctgcagggcttc ctcagcagccaggtgcatggccctgggcctctggacgaggactctgggactgtggctgtg cagatgttttctcatggcagagttatccaagaacatcttcgtgctccctttatgaggcaa gctgttgttcttggggcatcatcttatttaatttttcttggcttcattccagcaagttat gccgattgtttaggatttgagaggttactgccagggcatctgcacgggccgttccctctg ctgccgggcatgtgtcttcacacggccggggccttcctgtcttcatacgtcggcataggc cttccgtga >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_7|200_aa MELKIGPDSGGRQRAEWPAGGNICLSSRTNTEEAAVLQGSPTVGAEHMKRPGADTEAQMV RVNLPQGCPSSYITLPKEVRKDKRRRDTKEATDMKEENSSGWRDGADMERAKGTTTQRVG TVRRAVRVALRNDPNATSGQGVGGLVARHAPAPDRLGLSEAAVHTCTRQEHTPSGTSSVS LLVKGGFYPSTQQFTPGSTS >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_7|603_bp atggagctaaaaatagggcccgactctgggggacggcagagggccgagtggccagctgga ggaaacatctgcctcagctcaagaacaaatacagaggaagctgccgtgctgcagggttcc cctactgtgggtgcagagcacatgaaacgtccaggtgcagacactgaggcccagatggtc cgggttaacttgccccaaggctgcccctcatcttacatcacgctcccaaaggaagtgcgg aaggataaaagaagacgtgatacaaaggaagccacagacatgaaagaagagaatagctca ggatggagagatggagcagatatggaaagagctaaaggcaccaccacccagagagttggc acagtcaggagagctgtgcgagtggcgctgaggaatgacccaaacgctacctcgggtcag ggagttggcggccttgtggccaggcatgcgccggccccagacaggctgggtctgagcgag gcagctgtgcacacctgcacccgccaggagcacacaccctctggaacctcctcggtctcc ctcttggtgaaaggaggattctaccccagcacccagcagttcacacctggctctacgtcc tag >gi568815588r:70332939_70541667|GENSCAN_predicted_peptide_8|871_aa MAGPHPVTQGPLLAAGQALSSREPACQTKQNEESDVMIVVVKEGGCCGPAQPGSALLAAL VESQGLLAIHRWRLGEAPGSSPGPEGSRLWQLAVAGSSVQAVNICVVSGPPLQGTSCSLS LPLEPMRGTPFEGLQGSGTMDSRHSVSIHSFQSTSLHNSKAKSIIPNKVAPVVITYNCKE EFQIHDELLKAHYTLGRLSDNTPEHYLVQGRYFLVRDVTEKMDVLGTVGSCGAPNFRQVQ GGLTVFGMGQPSLSGFRRVLQKLQKDGHRECVIFCVREEPVLFLRADEDFVSYTPRDKQN LHENLQGLGPGVRVESLELAIRKEIHDFAQLSENTYHVYHNTEDLWGEPHAVAIHGEDDL HVTEEVYKRPLFLQPTYRYHRLPLPEQGSPLEAQLDAFVSVLRETPSLLQLRDAHGPPPA LVFSCQMGVGRTNLGMVLGTLILLHRSGTTSQPEAAPTQAKPLPMEQFQVIQSFLRMVPQ GRRMVEEVDRAITACAELHDLKEVVLENQKKLEGIRPESPAQGSGSRHSVWQRALWSLER YFYLILFNYYLHEQYPLAFALSFSRWLCAHPELYRLPVTLSSAGPVAPRDLIARGSLVSM TGSRRSGHGCVLALAPKHWIPVGLGRLLWVPGLVLSPQREDDLVSPDALSTVREMDVANF RRVPRMPIYGTAQPSAKVTGPQGLGPPALGSILAYLTDAKRRLRKVVWVSLREEAVLECD GHTYSLRWPGPPVAPDQLETLEAQLKAHLSEPPPGKEGPLTYRFQTCLTMQEVFSQHRRA CPGLTYHRIPMPDFCAPREEDFDQLLEALRAALSKDPGTGFVFSCLSGQGRTTTAMVVAV LAFWHIQGFPEVGEEELVSVPDAKFTKGEFQ >gi568815588r:70332939_70541667|GENSCAN_predicted_CDS_8|2613_bp atggctggcccacacccggtgacccagggaccgctgctggctgcggggcaggctttgtca tcccgagaacccgcctgccagacaaagcagaatgaggagagtgatgtgatgatcgtggtt gttaaggagggtggctgctgtggcccagcccagcctggctctgccctgctggctgccttg gttgagagccagggactgctggccattcacagatggcgcctgggcgaggcccctggttct agtcctggtcctgaaggttcacgtctgtggcagctggcagttgctgggagttcagtgcag gctgtgaacatctgtgtcgtctctgggccacccttgcagggcaccagctgtagcctgtca ttgcctctggagcccatgagaggcaccccatttgagggcctacagggcagtggcacgatg gacagtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaag gccaagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggag gagttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggac aacacccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgag aagatggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcag ggtgggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctc cagaaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacct gtgcttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaac cttcatgagaacctccagggccttggacccggggtccgggtggagagcctggagctggcc atccggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccat aacaccgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttg catgtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccac cgcctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagt gttctccgggagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagcc ctcgtcttcagctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcacc ctcatcctgcttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggcc aagcccctgcctatggagcagttccaggtgatccagagctttctccgcatggtgccccag ggaaggaggatggtggaagaggtggacagagccatcactgcctgtgccgagttgcatgac ctgaaagaagtggtcttggaaaaccagaagaagttagaaggtatccgaccggagagccca gcccagggaagcggcagccgacacagcgtctggcagagggcgctgtggagcctggagcga tacttctacctgatcctgtttaactactaccttcatgagcagtacccgctggcctttgcc ctcagtttcagccgctggctgtgtgcccaccctgagctgtaccgcctgcccgtgacgctg agctcagcaggccctgtggctccgagggacctcatcgccaggggctccctagtgagtatg actggcagtcggagaagtgggcatggctgtgtactggccctggcccctaaacactggatt cctgtagggttgggtcggctgctgtgggtcccaggcttggtgctctccccacagcgggag gacgatctggtctccccggacgcgctcagcactgtcagagagatggatgtggccaacttc cggcgggtgccccgcatgcccatctacggcacggcccagcccagcgccaaggtgaccggc cctcagggcctgggtcccccagccctggggagcatcctggcctacctgacggacgccaag aggaggctgcggaaggttgtctgggtgagccttcgggaggaggccgtgttggagtgtgac gggcacacctacagcctgcggtggcctgggccccctgtggctcctgaccagctggagacc ctggaggcccagctgaaggcccatctaagcgagcctcccccaggcaaggagggccccctg acctacaggttccagacctgccttaccatgcaggaggtcttcagccagcaccgcagggcc tgtcctggcctcacctaccaccgcatccccatgccggacttctgtgccccccgagaggag gactttgaccagctgctggaggccctgcgggccgccctctccaaggacccaggcactggc ttcgtgttcagctgcctcagcggccagggccgtaccacaactgcgatggtggtggctgtc ctggccttctggcacatccaaggcttccccgaggtgggtgaggaggagctcgtgagtgtg cctgatgccaagttcactaagggtgaatttcag