GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:00:50 Sequence gi568815588r:91707182_91907439 : 200258 bp : 38.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9151 9203 53 0 2 114 119 13 0.812 7.68 1.02 Intr + 32081 32198 118 2 1 101 96 45 0.064 6.15 1.03 Term + 32947 33396 450 0 0 38 51 204 0.049 5.90 1.04 PlyA + 34948 34953 6 1.05 2.00 Prom + 36872 36911 40 -7.85 2.01 Sngl + 43342 44154 813 0 0 88 43 585 0.805 49.52 2.02 PlyA + 44218 44223 6 1.05 3.00 Prom + 44742 44781 40 -6.15 3.01 Init + 44835 45872 1038 2 0 61 41 518 0.297 38.43 3.02 Intr + 46697 46864 168 1 0 26 86 122 0.175 5.02 3.03 Term + 51311 51316 6 1 0 131 33 0 0.102 -4.31 3.04 PlyA + 54038 54043 6 1.05 4.02 PlyA - 55484 55479 6 1.05 4.01 Sngl - 59327 59022 306 2 0 85 43 321 0.974 20.92 4.00 Prom - 65330 65291 40 -3.05 5.00 Prom + 70565 70604 40 -3.45 5.01 Init + 91510 91708 199 0 1 69 91 343 0.834 29.81 5.02 Intr + 91976 92077 102 0 0 90 86 67 0.205 5.93 5.03 Intr + 105802 106026 225 2 0 68 91 249 0.998 20.23 5.04 Intr + 109953 110048 96 1 0 68 119 114 0.940 11.56 5.05 Intr + 112089 112125 37 1 1 102 98 36 0.917 2.40 5.06 Intr + 112301 112376 76 2 1 72 75 57 0.709 1.40 5.07 Intr + 112758 112852 95 2 2 36 98 66 0.492 0.24 5.08 Intr + 115115 115181 67 2 1 68 96 34 0.453 0.19 5.09 Intr + 119836 120022 187 0 1 85 84 37 0.656 1.34 5.10 Intr + 121104 121225 122 1 2 75 23 138 0.736 5.29 5.11 Intr + 123742 123833 92 0 2 18 89 48 0.696 -4.23 5.12 Intr + 123922 124000 79 1 1 70 69 117 0.845 6.63 5.13 Intr + 126672 126843 172 2 1 73 62 66 0.833 1.09 5.14 Intr + 129738 129817 80 1 2 113 69 46 0.952 3.55 5.15 Intr + 133380 133525 146 2 2 92 61 149 0.977 10.76 5.16 Intr + 134102 134267 166 2 1 76 76 127 0.999 9.34 5.17 Intr + 134991 135210 220 2 1 54 111 130 0.999 8.85 5.18 Intr + 137738 137847 110 0 2 40 98 97 0.974 4.98 5.19 Intr + 138571 138759 189 0 0 77 89 224 0.998 20.26 5.20 Intr + 141202 141454 253 0 1 51 76 217 0.666 12.88 5.21 Intr + 142331 142413 83 0 2 74 109 8 0.797 -0.06 5.22 Intr + 144035 144155 121 1 1 32 63 126 0.819 3.65 5.23 Intr + 147848 147945 98 0 2 78 87 77 0.996 5.41 5.24 Intr + 148433 148507 75 1 0 61 85 56 0.724 1.39 5.25 Intr + 150244 150349 106 0 1 74 95 105 0.997 8.67 5.26 Intr + 152281 152467 187 2 1 82 110 73 0.997 6.73 5.27 Intr + 154818 154974 157 0 1 116 95 66 0.992 9.19 5.28 Intr + 176610 176704 95 2 2 89 23 103 0.088 1.74 5.29 Intr + 178107 178229 123 0 0 110 40 100 0.011 6.18 5.30 Intr + 182775 182965 191 0 2 46 53 151 0.015 5.71 5.31 Term + 187237 187481 245 2 2 21 43 168 0.010 0.68 5.32 PlyA + 187557 187562 6 1.05 6.02 PlyA - 187671 187666 6 1.05 6.01 Term - 190062 189752 311 1 2 11 50 205 0.194 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100258 99998 261 1 0 108 42 233 0.931 15.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_1|206_aa MPVQDGLFLKKTNKTYLWSSLHFLKLNVGLSSEVGTLFIDDILKYVFQVAGFLPLFQKSA ALEGPMCSWTAGNNNPNGWYQPKHFIRLVTAGSVLIHMCQQHQQNGRVHTCQLGWSSGRH GAASLFTCIHSNSNGSVSQEQDHWHPCVCSHWWWCWCQGRVLACTGLPASMHSFMPAAMA ALVGGWGHTGFSVHVHTGNSSAAVDM >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_1|621_bp atgcccgtccaagatggtttatttttaaagaagaccaataaaacttacctctggagttct ctgcattttctgaaattgaatgttggcctctctagtgaagtgggaacacttttcatagat gatatcctcaaatatgttttccaagttgctggctttctgcctctctttcagaaatctgct gcactagagggaccaatgtgttcctggactgctggcaacaataaccccaatgggtggtac cagccaaagcacttcataaggttggtgacagcaggatctgtgctcattcacatgtgccag cagcaccagcagaatggcagggtgcacacttgtcagctggggtggagctctggtaggcat ggggctgccagcctctttacatgcattcacagcaacagcaatggcagtgtgagtcaggaa caggaccactggcatccatgtgtgtgttcacattggtggtggtgttggtgccaaggcagg gtgctggcctgcacaggactgccagcctctatgcactcattcatgccagcagcaatggca gcattggttggaggatggggccacactggcttctcagtgcatgttcacactggcaatagc agtgcagcagtggacatgtga >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_2|270_aa MGKKQSRKTGNSKKQSTSPPPKEGSSSPATEQSWTGNDFDELREEGFRRSNYSELQEDIQ TKGKEVENFEKNLEECITTITNTEKYLKELMELKTKARELREECRSFRSQCNQLEERVPA MKDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPSLARQANIQIQEIQRTPQRYSSRRATPRHIIVRLTKVEMKEKMLRAARERSGY PQREAHQTNSRSLGRNSTSQKRVGANIQHS >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_2|813_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaaggcagttcctcaccagcaacggaacaaagctggacagggaatgactttgac gagctgagagaagaaggcttcagaagatcaaattactctgagctacaggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactacaata accaatacagagaagtacttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcttcaggagccaatgcaatcaactggaagaaagggtaccagcg atgaaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaacgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagcgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccagtctagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttgaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagaaaggtcgggttac ccacaaagggaagcccatcagactaacagcagatctctcggcagaaactctacaagccag aagagagtgggggccaatattcaacattcttaa >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_3|403_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALRQADLIDIYRTLHPKSTEYTFFSAPRHTYS KIDHILGSKALLSKCKRTEIITNCLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELRIKYLGIQLTRDVK DLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKN >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_3|1212_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcgccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacgccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctggcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccact gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctccccagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagctgaattctaccag aggtacaaggaggaactgagaataaaatacctaggaatccaacttacaagggacgtgaag gacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaagaattag >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_4|101_aa MVMLKDQRMTVVGWLEGLLQLDDLINQLTSIMDTNQTYLVSECLEREERNQTQVLRQKQD EAYLASPRADQEKEMKKWEEWELKQQKEEEVQEQKLAEKRW >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_4|306_bp atggtgatgctgaaggatcagcggatgactgtggtgggatggctagaaggcctccttcaa ctggatgacctcattaaccaactgacatctatcatggatacaaaccagacttacctggtg tcagaatgcctggaaagggaagaaagaaaccagacccaggtgttgagacaaaagcaggat gaggcctacctggcttctcccagggctgaccaggagaaagaaatgaagaaatgggaggag tgggagcttaagcagcagaaggaggaggaggtgcaagagcaaaagctggcagagaagaga tggtag >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_5|1397_aa MSGRRCAGGGAACASAAAEAVEPAARELFEACRNGDVERVKRLVTPEKVNSRDTAGRKST PLHFAAVSTIPVPTSGLVTLEGLYTQERVGDVVDRPYDLKGFGRKDVVEYLLQNGANVQA RDDGGLIPLHNACSFGHAEVVNLLLRHGADPNARDNWNYTPLHEAAIKGKIDVCIVLLQH GAEPTIRNTDGRTALDLADPSAKAVLTGEYKKDELLESARSGNEEKMMALLTPLNVNCHA SDGRKSTPLHLAAGYNRVKIVQLLLQHGADVHAKDKGDLVPLHNACSYGHYEVTELLVKH GACVNAMDLWQFTPLHEAASKNRVEVCSLLLSYGADPTLLNCHNKSAIDLAPTPQLKERL AYEFKGHSLLQAAREADVTRIKKHLSLEMVNFKHPQTHETALHCAAASPYPKRKQICELL LRKGANINEKTKEFLTPLHVASEKAHNDVVEVVVKHEAKVNALDNLGQTSLHRAAYCGHL QTCRLLLSYGCDPNIISLQGFTALQMGNENVQQLLQEGISLGNSEADRQLLEAAKAGDVE TVKKLCTVQSVNCRDIEGRQSTPLHFAAGYNRVSVVEYLLQHGADVHAKDKGGLVPLHNA CSYGHYEVAELLVKHGAVVNVADLWKFTPLHEAAAKGKYEICKLLLQHGADPTKKNRDGN TPLDLVKDGDTDIQDLLRGDAALLDAAKKGCLARVKKLSSPDNVNCRDTQGRHSTPLHLA AGYNNLEVAEYLLQHGADVNAQDKGGLIPLHNAASYGHVDVAALLIKYNACVNATDKWAF TPLHEAAQKGRTQLCALLLAHGADPTLKNQEGQTPLDLVSADDVSALLTAAMPPSALPSC YKPQVLNGVRSPGATADALSSGPSSPSSLSAASSLDNLSGSFSELSSVVSSSGTEGASSL EKKEVPGVDFSITQFVRNLGLEHLMDIFEREQITLDVLVEMGHKELKEIGINAYGHRHKL IKGVERLISGQQGLNPYLTLNTSGSGTILIDLSPDDKEFQSVEEEMQSTVREHRDGGHAG GIFNRYNILKIQKVCNKKLWERYTHRRKEVSEENHNHANERMLFHGSPFVNAIIHKGFDE RHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPVHKDRSCYICHRQLLFCRVTLGKS FLQFSAMKMAHSPPGHHSVTGRPSVNGLALAEYVIYRGEQALRHVRSASTAQGFVLIMHY GLASEAQTPYRLFQFSISSFRKPAGILLLPPSSHSCDPEPVEYRSLRGSCSTWRWSSGGN AHSLATHLLLCGPVPNRPSSTRRLGIPAQYDILPLRLLASPEPSLYLIREEPVKLGVIRR PKPLPVVLCFNAHAPGPLSCSSCTPGSSLKFVVDGGRRNSESLKVFDLSYKCIEENADVC CLLSLLQLHKRESTPVL >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_5|4194_bp atgtcgggtcgccgctgcgccggcgggggagcggcctgcgcgagcgccgcggccgaggcc gtggagccggccgcccgagagctgttcgaggcgtgccgcaacggggacgtggaacgagtc aagaggctggtgacgcctgagaaggtgaacagccgcgacacggcgggcaggaaatccacc ccgctgcacttcgccgcagtatcaacaatacctgtaccaacttccggtcttgttacccta gaaggcctctacacacaagaaagagtaggggacgtagttgatagaccttatgatttaaaa ggttttgggcggaaagacgtagttgaatatttgcttcagaatggtgcaaatgtccaagca cgtgatgatgggggccttattcctcttcataatgcatgctcttttggtcatgctgaagta gtcaatctccttttgcgacatggtgcagaccccaatgctcgagataattggaattatact cctctccatgaagctgcaattaaaggaaagattgatgtttgcattgtgctgttacagcat ggagctgagccaaccatccgaaatacagatggaaggacagcattggatttagcagatcca tctgccaaagcagtgcttactggtgaatataagaaagatgaactcttagaaagtgccagg agtggcaatgaagaaaaaatgatggctctactcacaccattaaatgtcaactgccacgca agtgatggcagaaagtcaactccattacatttggcagcaggatataacagagtaaagatt gtacagctgttactgcaacatggagctgatgtccatgctaaagataaaggtgatctggta ccattacacaatgcctgttcttatggtcattatgaagtaactgaacttttggtcaagcat ggtgcctgtgtaaatgcaatggacttgtggcaattcactcctcttcatgaggcagcttct aagaacagggttgaagtatgttctcttctcttaagttatggtgcagacccaacactgctc aattgtcacaataaaagtgctatagacttggctcccacaccacagttaaaagaaagatta gcatatgaatttaaaggccactcgttgctgcaagctgcacgagaagctgatgttactcga atcaaaaaacatctctctctggaaatggtgaatttcaagcatcctcaaacacatgaaaca gcattgcattgtgctgctgcatctccatatcccaaaagaaagcaaatatgtgaactgttg ctaagaaaaggagcaaacatcaatgaaaagactaaagaattcttgactcctctgcacgtg gcatctgagaaagctcataatgatgttgttgaagtagtggtgaaacatgaagcaaaggtt aatgctctggataatcttggtcagacttctctacacagagctgcatattgtggtcatcta caaacctgccgcctactcctgagctatgggtgtgatcctaacattatatcccttcagggc tttactgctttacagatgggaaatgaaaatgtacagcaactcctccaagagggtatctca ttaggtaattcagaggcagacagacaattgctggaagctgcaaaggctggagatgtcgaa actgtaaaaaaactgtgtactgttcagagtgtcaactgcagagacattgaagggcgtcag tctacaccacttcattttgcagctgggtataacagagtgtccgtggtggaatatctgcta cagcatggagctgatgtgcatgctaaagataaaggaggccttgtacctttgcacaatgca tgttcttatggacattatgaagttgcagaacttcttgttaaacatggagcagtagttaat gtagctgatttatggaaatttacacctttacatgaagcagcagcaaaaggaaaatatgaa atttgcaaacttctgctccagcatggtgcagaccctacaaaaaaaaacagggatggaaat actcctttggatcttgttaaagatggagatacagatattcaagatctgcttaggggagat gcagctttgctagatgctgccaagaagggttgtttagccagagtgaagaagttgtcttct cctgataatgtaaattgccgcgatacccaaggcagacattcaacacctttacatttagca gctggttataataatttagaagttgcagagtatttgttacaacacggagctgatgtgaat gcccaagacaaaggaggacttattcctttacataatgcagcatcttacgggcatgtagat gtagcagctctactaataaagtataatgcatgtgtcaatgccacggacaaatgggctttc acacctttgcacgaagcagcccaaaagggacgaacacagctttgtgctttgttgctagcc catggagctgacccgactcttaaaaatcaggaaggacaaacacctttagatttagtttca gcggatgatgtcagcgctcttctgacagcagccatgcccccatctgctctgccctcttgt tacaagcctcaagtgctcaatggtgtgagaagcccaggagccactgcagatgctctctct tcaggtccatctagcccatcaagcctttctgcagccagcagtcttgacaacttatctggg agtttttcagaactgtcttcagtagttagttcaagtggaacagagggtgcttccagtttg gagaaaaaggaggttccaggagtagattttagcataactcaattcgtaaggaatcttgga cttgagcacctaatggatatatttgagagagaacagatcactttggatgtattagttgag atggggcacaaggagctgaaggagattggaatcaatgcttatggacataggcacaaacta attaaaggagtcgagagacttatctccggacaacaaggtcttaacccatatttaactttg aacacctctggtagtggaacaattcttatagatctgtctcctgatgataaagagtttcag tctgtggaggaagagatgcaaagtacagttcgagagcacagagatggaggtcatgcaggt ggaatcttcaacagatacaatattctcaagattcagaaggtttgtaacaagaaactatgg gaaagatacactcaccggagaaaagaagtttctgaagaaaaccacaaccatgccaatgaa cgaatgctatttcatgggtctccttttgtgaatgcaattatccacaaaggctttgatgaa aggcatgcgtacataggtggtatgtttggagctggcatttattttgctgaaaactcttcc aaaagcaatcaatatgtatatggaattggaggaggtactgggtgtccagttcacaaagac agatcttgttacatttgccacaggcagctgctcttttgccgggtaaccttgggaaagtct ttcctgcagttcagtgcaatgaaaatggcacattctcctccaggtcatcactcagtcact ggtaggcccagtgtaaatggcctagcattagctgaatatgttatttacagaggagaacag gctttgcgtcatgttaggtcagcttccactgctcagggctttgtgctgatcatgcattac gggttggcttcagaggcacagacaccttaccgcctgtttcagtttagcatttcttccttc aggaaacctgccggaattctccttcttcccccgtcttcacacagctgtgaccccgaaccc gtggagtatcgctccttgaggggctcctgcagcacctggaggtggagctcaggcggtaat gcccactcgctggccactcacctcctgctgtgcggcccagttcctaacaggccaagctca acccggaggttggggatccctgctcagtacgacatactacctttgagacttttagcttct ccagagccctccctttatttaatacgtgaagagcctgtaaaattgggagtcatccggagg cctaaacccctccctgtggtgctgtgcttcaatgcgcacgctcctggtccactttcatgt tcctcctgtactcccggttcctctttgaagttcgtagtagatggcggtcgaagaaatagt gaaagtcttaaagtctttgatctttcttataagtgcatagaagaaaatgctgacgtatgc tgccttctctctctccttcagctacataaaagggaaagcacccctgtcctatga >gi568815588r:91707182_91907439|GENSCAN_predicted_peptide_6|103_aa XCFHAADKDIPETGKKKRFNWTYSSIWLPRPQNHGGRQKALLTWQQQEKMRKIQKRKPLI NPSDLVRLIRYHKNNRGNCLHDSNYLSPGPSHNKWELEEYSSR >gi568815588r:91707182_91907439|GENSCAN_predicted_CDS_6|312_bp ntctgttttcatgctgctgataaagacataccagagactgggaagaaaaagaggtttaat tggacttacagttccatatggctgccgaggcctcagaatcatggcgggaggcaaaaggca cttcttacatggcagcagcaagagaaaatgaggaagatccaaaagcggaaacccctgata aacccatcagatctcgtgagacttattcgctaccacaagaacaataggggaaactgcctc catgattcaaattatctctcaccaggtccctcccacaacaagtgggaattagaggagtac agttcaagatga