GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:07:02 Sequence gi568815595r:184610901_184811821 : 200921 bp : 48.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 95 430 336 1 0 36 -35 234 0.034 1.24 1.02 Intr + 6343 6610 268 0 1 29 89 467 0.466 38.01 1.03 Term + 7861 8048 188 2 2 22 55 136 0.278 1.35 1.04 PlyA + 10008 10013 6 1.05 2.04 PlyA - 10343 10338 6 1.05 2.03 Term - 19308 19132 177 0 0 75 54 146 0.540 7.59 2.02 Intr - 30608 30446 163 1 1 69 56 75 0.190 2.38 2.01 Init - 33544 33525 20 1 2 89 110 9 0.393 2.77 2.00 Prom - 35013 34974 40 -4.86 3.12 PlyA - 38025 38020 6 1.05 3.11 Term - 46234 46032 203 2 2 48 55 106 0.083 0.85 3.10 Intr - 55018 54919 100 1 1 16 100 84 0.020 2.08 3.09 Intr - 59352 59239 114 0 0 22 47 114 0.026 1.34 3.08 Intr - 61712 61534 179 0 2 53 103 12 0.004 -1.16 3.07 Intr - 73284 73119 166 0 1 95 78 47 0.834 3.93 3.06 Intr - 73821 73696 126 0 0 101 109 49 0.982 9.18 3.05 Intr - 74063 74028 36 2 0 133 40 53 0.859 3.46 3.04 Intr - 81293 81113 181 1 1 46 70 112 0.013 5.07 3.03 Intr - 82496 82407 90 1 0 70 92 27 0.010 0.41 3.02 Intr - 101251 100002 1250 1 2 46 81 934 0.046 75.80 3.01 Init - 109005 108946 60 0 0 48 111 26 0.255 2.15 3.00 Prom - 114027 113988 40 -0.56 4.00 Prom + 115028 115067 40 -5.76 4.01 Init + 115232 115305 74 1 2 57 81 6 0.183 -2.64 4.02 Intr + 116500 116680 181 1 1 113 62 71 0.074 6.87 4.03 Intr + 123293 123340 48 1 0 104 98 1 0.027 1.58 4.04 Intr + 130369 130471 103 0 1 6 79 117 0.063 2.35 4.05 Intr + 135443 135506 64 0 1 95 100 -10 0.036 -1.32 4.06 Intr + 135743 135956 214 2 1 66 95 90 0.067 6.22 4.07 Intr + 137836 137885 50 0 2 149 64 16 0.472 2.78 4.08 Intr + 140214 140382 169 0 1 -17 95 112 0.402 1.35 4.09 Intr + 140878 140916 39 0 0 92 95 14 0.567 0.92 4.10 Intr + 141736 141790 55 0 1 52 73 48 0.013 -1.75 4.11 Intr + 148720 148786 67 2 1 119 105 30 0.883 5.76 4.12 Intr + 151044 151087 44 0 2 123 97 -16 0.348 0.68 4.13 Intr + 165076 165138 63 2 0 105 121 8 0.648 4.49 4.14 Intr + 165267 165377 111 1 0 129 53 61 0.534 7.05 4.15 Term + 174640 174713 74 2 2 131 51 55 0.622 4.17 4.16 PlyA + 174915 174920 6 1.05 5.00 Prom + 176462 176501 40 -7.26 5.01 Init + 176505 176556 52 2 1 92 92 52 0.062 7.32 5.02 Intr + 186175 186404 230 2 2 75 70 119 0.264 6.39 5.03 Intr + 192196 192328 133 0 1 42 45 97 0.428 1.02 5.04 Intr + 195999 196151 153 1 0 95 113 35 0.670 6.74 5.05 Term + 196802 196905 104 0 2 33 46 83 0.516 -2.96 5.06 PlyA + 197899 197904 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 54930 55034 105 2 0 107 49 86 0.886 5.01 S.002 Intr + 145624 145688 65 0 2 99 81 39 0.934 1.82 S.003 Intr + 145947 146086 140 0 2 48 80 94 0.970 4.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:184610901_184811821|GENSCAN_predicted_peptide_1|263_aa MPGRRAVKPRFTIASLGLLAAAMTTTDHHHCHISVSTTITVSISTIAIFISTTLSVSTII TTSTPATISISITIISTSTITSSSTVITMHRPCRLHHNHCFHLYYHHQLPPPPGQQSETL PEKRKKKKEKEKGEEEEEEEEKEKKKKKRKKKKKKKKKKKKKKKRRRRRRRRRRRRRRRR RRRGRKGSSSSSSSKMEGGGCGGVHVALDGSHENAPSDLLLLGVSSTKATAPKAMLPMGF LPPKAEHSRGAKTGPILGDAGLL >gi568815595r:184610901_184811821|GENSCAN_predicted_CDS_1|792_bp atgccaggcaggagggctgtgaagcccagatttaccatcgctagcctgggtttgcttgct gctgccatgaccaccaccgaccaccaccattgtcacatttccgtttctaccaccatcact gtttccatctctaccatcgctatcttcatctcaaccaccctctctgtctccactattatc accacctccacacccgccaccatcagcatctctatcaccatcatctccacctcaaccatc acctccagctccaccgtcatcactatgcaccgcccctgccgcctccaccataaccactgc ttccatctttactaccatcaccaactacctccgccacctgggcaacagagcgagactctg cctgaaaaaagaaagaagaagaaggagaaggagaagggggaggaggaggaggaggaggag gagaaggagaagaagaagaagaagaggaagaagaagaagaagaagaagaagaagaagaag aagaagaagaaaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaga agaagaagaggaagaaaaggaagcagcagcagcagcagcagcaagatggaaggcggaggt tgcggtggtgtgcacgttgctttggatggcagccatgaaaatgccccatcagatctccta ctgctgggagtgtcatcgaccaaggccacggctcctaaggccatgcttcccatgggcttc ctgccacccaaggctgagcacagcaggggcgctaagacaggaccaatcttgggagatgca gggctgctctga >gi568815595r:184610901_184811821|GENSCAN_predicted_peptide_2|119_aa MDMDSSRIPGCCLCNFESFSKRLHGIPTPLNLPLDPLVFNLQILALEIQESIRIHQHYVQ NSYRGLNVHRAPREAVPGEGQIEFDIVVAREHPGRRFGFDYRRPGTGLLAMASSPELPA >gi568815595r:184610901_184811821|GENSCAN_predicted_CDS_2|360_bp atggacatggactcaagcagaatccctgggtgttgcctctgtaattttgaaagtttctcc aagaggcttcatggtattcccactcccctcaacctgcccctggatccactggtcttcaat ttgcagatcctagccctggagatccaagaatcaattcgaatccaccaacactatgttcag aatagctaccgaggcctcaacgtgcatagggctcccagagaagctgtcccaggtgagggc cagattgagtttgacatcgtcgttgccagagagcaccctggccgccgctttggttttgat tacaggcggcccggcaccgggctactagccatggcctcttcacctgaactgccggcatga >gi568815595r:184610901_184811821|GENSCAN_predicted_peptide_3|834_aa MQQQGSGPQESLVQPLTFQQQPGHTPNPEGTGKGGAARGTRVAVLAEVAGSGPSRGRGLR LRAAGSGCWRRCGRRFDSRAVRPSSHKAPAAIAPCTPAVTAAVQPLPRGLRGGSHTPRPV YALCTCLPENMLQTPESRGLPVPQAEGEKDGGHDGETRAPTASQERPKEELGAGREEGAA EPALTRKGARALAAKALARRRAYRRLNRTVAELVQFLLVKDKKKSPITRSEMVKYVIGDL KILFPDIIARAAEHLRYVFGFELKQFDRKHHTYILINKLKPLEEEEEEDLGGDGPRLGLL MMILGLIYMRGNSAREAQVWEMLRRLGVQPSKYHFLFGYPKRLIMEDFVQQRYLSYRRVP HTNPPEYEFSWGPRSNLEISKMEVLGFVAKLHKKEPQHWPVQYREALADEADRARAKARA EASMRARASARAGIHLCSVKTKSFSKTSLLREKVEENESRDYKHLSGEPEVQGVVKGSWF LFHPKSGLLVSELRASSHSQWGQGVGELRLQDFGSSHRPPGPALPASEGVEETEDEGVRC LVMVIMLLLECGSDAAPTSRPSSLSFRATVRFARPPSEWLQSCPVLVRGLPAEGSPETPL ALKLLGAQQSAAQTRILMGPEASSPASQPAFGYSQTAQVKPGACVSSLRTGTSLYSRLIP RGQQVHDTSKTSGNVCEMDSCTPSIPSLMDASIQHVIRVFPVTCTSPNDMGQVSDLTDVD IPNQVVVHTHVFVHEPQIRSIKRKGFGVASTNLQASTCTGTLRDCEEDTQATLGRLDKST YTFKLYLPTQGLEGLTLESVPGFCRKHLPGPRAPREQIHNVNCIWMKKQLEKAA >gi568815595r:184610901_184811821|GENSCAN_predicted_CDS_3|2505_bp atgcagcaacaaggcagtggtcctcaagaatcgctggtccagccactcaccttccagcag cagcctgggcacacccccaacccagaagggaccggaaaaggaggggctgcgcgcggcact agggtcgccgtcctggctgaggtggccggaagtggcccctcgcgcgggcgcggactgagg ctgcgcgccgcaggttccggctgctggcggcgttgcggccgcaggtttgactcccgtgcg gtgcggcccagcagccacaaagctcccgctgccattgctccttgtactcccgccgtcact gccgctgtccaacccctcccccggggcttgcgcggcggctcccacacccctcggcccgtg tacgcgctctgcacctgcctgcccgaaaacatgttgcagacaccagagagcagggggctc ccggtcccgcaggccgagggggagaaggatggcggccatgatggtgagacccgggccccg accgcctcgcaggagcgccccaaggaggagcttggcgccgggagggaggagggggctgcg gagcccgccctcacccggaaaggcgcgagggccttggcggccaaagccttggcaaggcgc agggcctaccgccggctgaatcggacggtggcggagttggtgcagttcctcctggtgaaa gacaagaagaagagtcccatcacacgctcggagatggtgaaatacgttattggagacttg aagattctgttcccggacatcatcgcaagggccgcagagcatctgcggtatgtctttggt tttgagctgaaacagtttgaccgcaagcaccacacttacatcctgatcaacaaactaaaa cctctggaggaggaggaggaggaggatctgggaggagatggccccagattgggtctgtta atgatgatcctgggccttatctatatgagaggtaatagcgccagggaggcccaggtctgg gagatgctgcgtcggttgggggtgcaaccctcaaagtatcatttcctctttgggtatccg aagaggcttattatggaagattttgtgcagcagcgatatctcagttacaggcgggtgcct cacaccaatccaccagaatatgaattctcttggggtccccgaagcaacctggaaatcagc aagatggaagtcctggggttcgtggccaaactgcataagaaggaaccgcagcactggcca gtgcagtaccgtgaggccctagcagacgaggccgacagggccagagccaaggccagagct gaagccagtatgagggccagggccagtgctagggccggcatccacctctgctcagtgaaa acgaagagtttctctaaaacatctttgctgagagagaaagtagaagaaaatgaatcaaga gactataagcacttgagtggggagcccgaagtccagggtgttgtaaaaggctcctggttt ctctttcaccccaagtctggcctactcgtgtctgagctgagagccagcagccacagccaa tggggccagggagttggtgagctgcggctccaggactttggcagcagccacaggccccca ggcccagcactgcctgcctcagaaggtgtggaggagacagaagatgaaggagtgcggtgt ttagtaatggtgattatgctacttttagaatgtggatcggatgctgcccccacttcgcgg ccatcatccttgtctttccgtgccactgtgcggtttgcccgtcctccttctgaatggctg cagagctgtcctgtccttgttagaggcctgcctgcggagggcagccctgaaactcccctg gctctgaaactgctcggcgcacagcagagcgcagcacagacccgaatccttatgggcccc gaggctagctctccagcctctcagcctgcttttggatactctcaaacagcccaggtgaag ccgggggcctgtgtgagctccttgagaacagggaccagtttatattcccgtctgatcccc agaggccagcaggtgcatgacacctcgaagacctcaggaaatgtgtgcgaaatggactcc tgtactccatccatcccatccttaatggacgcatccatccaacacgtgataagagtcttt cccgtcacctgtaccagccccaatgatatgggccaggtgtcagacctcactgatgtggac atcccaaaccaagtggttgtgcacacgcatgtatttgtgcatgagccacagatacgcagc attaagaggaagggttttggtgtggccagcacgaatctccaggcctccacatgcacagga acactgcgggactgtgaggaagacactcaggcaacacttggccgattagataaatccacc tacacctttaagctgtacctccccacgcagggcctagaaggtctaacccttgaatcagtc ccagggttctgccgcaaacatctgccagggccaagagcccccagggagcaaatacataat gtgaactgcatttggatgaagaagcagttagagaaggcagcatga >gi568815595r:184610901_184811821|GENSCAN_predicted_peptide_4|451_aa MCPACLVPNFAGSILSVDGATISERRDLIRCPLGNGLALWPLPNKNVAEHSKTAESECEV LIEFPNADCLLCISGDCMPVILHSLGESIPAVRGLGSGVQRLPSGAEKEAEASLVEESRQ AGKSQQADVGLLQADSQESGMLWCSRPLSQVQLQGPRQGMRLGHKGQARGECPQLYDSLA PHSGLDATSDSPTPVMHWNLLCRFAELDAHTYSQLSRFFPPFKGPVIMVRRDKLRVRNSS LGMRSLFKTILVLSSVEPVETQSPSGMKGVNSESPVNSRHQIQKQRELISEAPSDAPPLQ PAGGGLDWISLDCPQRERKCCGGSSFYAKRRGSSAEHQVLSWHPQKSRADDTESCFSVPP EEPGLHSPQDYEQSDASPDVLVISCVSSTQGKEHPMATTLKHRNGTCESSSPHRELLPPN LGERERGCRSPQEGPEPLGVEGFCGGTQIMD >gi568815595r:184610901_184811821|GENSCAN_predicted_CDS_4|1356_bp atgtgtccagcctgcttggttcccaactttgctggatccattctatctgtggacggagcc actatctctgaaaggagagatctcatccgctgtccactggggaatggccttgccctatgg cccctaccaaacaaaaacgtagcagaacacagcaaaacagcagagtcggaatgtgaagtg ctgatagagttccctaacgctgactgtctgctctgcatttctggagattgtatgcctgtc atcctgcattccttgggagaaagtatccctgctgtgagaggtctgggatctggagtacag aggctcccatcgggagctgagaaagaagcagaggcctctttggtggaggaatcccggcag gccggcaagtcccaacaggccgatgttgggctgctgcaggcagacagccaggagtctgga atgctatggtgttccaggcccctgagtcaggtgcagctgcagggaccaaggcagggcatg cgtctagggcataagggccaggccaggggagaatgtcctcagctgtatgactccctggct ccacactctggccttgatgcaacttcagattctcctactccagtaatgcactggaacctg ctttgtcgatttgctgaactcgatgcacacacctactctcaactcagtaggttcttccct cccttcaaagggcctgtgatcatggtgagaagagacaagttgagagtcagaaacagcagc ttagggatgcgcagtcttttcaaaaccattttggttctgagctcagtggagccagtggag acccagtcaccatctggcatgaaaggtgttaactcggagtctccggtcaactctagacat cagatccagaaacaaagagaattaatttcggaggccccctcagatgcccctcctctccag cctgccggtggtggcctggactggatctcacttgactgtcctcaacgggaacggaagtgt tgtggaggaagcagcttctacgctaagaggagaggcagctcagctgagcaccaggtcctg agctggcacccacagaagagcagagcagatgacactgagagctgcttctctgtccctccg gaagagcctgggctgcatagcccccaagactatgagcaatctgatgctagtcccgatgtc ttggttatctcttgtgtatccagcacccaagggaaggagcatcccatggctacaactctc aagcatcgaaatggaacctgtgagagttcttcaccccacagggaactccttccaccaaac ttaggggagcgagaaaggggttgcagatctccccaagaagggcccgagcccctgggcgta gaaggcttttgcggagggacacagattatggactga >gi568815595r:184610901_184811821|GENSCAN_predicted_peptide_5|223_aa MVLEPSVNELNYQKSLGVLPCDLAMPVSGTTVLNPPGPHGPPAGTVPIMAIHLLWPNALE TPVKISSEVETEAGSGFLQPLSYPEIMTPSSFALPERQMILVDILQAVKCWRAYEQEDTG LDTGLQCNSREDVRVSLLEHLEAMCLENSVTRWKETGFPRTEWRRELLLIFLVFIALTNA VTFEQSPEEGTDGLLHQSATLVTADGPALTHHYIIIIHIYSLP >gi568815595r:184610901_184811821|GENSCAN_predicted_CDS_5|672_bp atggtgctagaaccttccgtcaatgaactcaattatcagaaaagcctgggagtcctcccc tgtgaccttgctatgcctgtaagtggtaccaccgttcttaatcctcctggaccacacgga ccacccgctggcacagtgcccataatggccatccacctgctgtggcctaatgctttggag accccagtgaagatcagttctgaagttgagactgaggctgggtctggcttcctgcagccc ttgtcctaccctgagatcatgaccccatcttcctttgcactgccagagagacagatgata cttgtggacatcctgcaggctgttaaatgttggcgagcctacgaacaagaagacactggt ctggacactggtctccagtgcaactccagagaggatgtgagggtatcattactggagcat ttagaggccatgtgtttagaaaacagtgtcaccagatggaaagaaactgggttcccgaga actgagtggagaagagaactgctcctcatcttccttgtatttattgccctgactaatgca gtgacatttgagcaaagccctgaagaaggcacagatggtctcctccaccagagtgctaca ttagttacagctgatggacctgcactgacacatcattacatcattatcatccacatctac agtttaccttag