GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:00:35 Sequence gi568815584r:93834659_94056932 : 222274 bp : 50.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1407 1619 213 2 0 120 94 32 0.809 5.91 1.02 Intr + 12710 12844 135 1 0 49 93 84 0.368 5.76 1.03 Intr + 16204 16408 205 0 1 55 81 85 0.268 3.27 1.04 Intr + 18505 18559 55 2 1 42 71 54 0.101 -2.86 1.05 Intr + 22135 22326 192 1 0 78 65 88 0.247 4.11 1.06 Intr + 23800 23875 76 1 1 61 40 83 0.330 0.32 1.07 Intr + 34412 34522 111 1 0 70 94 27 0.086 2.08 1.08 Intr + 53449 53527 79 0 1 139 -3 71 0.089 2.22 1.09 Intr + 54527 54602 76 0 1 15 96 61 0.097 -1.83 1.10 Intr + 55559 55747 189 2 0 58 81 117 0.154 6.70 1.11 Intr + 58407 58497 91 0 1 100 80 21 0.821 2.50 1.12 Intr + 65748 65939 192 2 0 105 75 85 0.836 8.59 1.13 Intr + 68461 68540 80 0 2 78 94 40 0.533 1.95 1.14 Intr + 80424 80457 34 0 1 85 74 39 0.264 0.43 1.15 Intr + 83541 83628 88 2 1 118 86 27 0.638 5.04 1.16 Intr + 86971 87026 56 2 2 77 46 44 0.092 -2.20 1.17 Term + 93541 94506 966 0 0 109 42 858 0.510 75.36 1.18 PlyA + 94921 94926 6 1.05 2.11 PlyA - 99527 99522 6 1.05 2.10 Term - 100134 99998 137 1 2 125 36 96 0.995 6.28 2.09 Intr - 103193 103040 154 2 1 65 91 313 0.927 28.95 2.08 Intr - 105014 104450 565 1 1 91 80 1450 0.981 137.30 2.07 Intr - 110476 110333 144 0 0 27 77 85 0.607 0.70 2.06 Intr - 112862 112691 172 0 1 131 76 398 0.998 41.90 2.05 Intr - 116622 116341 282 1 0 34 113 488 0.651 43.29 2.04 Intr - 118849 118694 156 2 0 53 100 147 0.979 12.38 2.03 Intr - 119825 119659 167 1 2 134 78 301 0.508 33.30 2.02 Intr - 122212 122108 105 0 0 114 88 26 0.806 4.53 2.01 Init - 129881 129676 206 2 2 76 99 253 0.993 23.52 2.00 Prom - 132367 132328 40 -7.06 3.00 Prom + 137423 137462 40 -8.36 3.01 Init + 141560 141694 135 1 0 59 100 140 0.034 12.44 3.02 Intr + 148911 149061 151 2 1 125 44 80 0.008 7.04 3.03 Intr + 163457 163577 121 0 1 101 44 85 0.172 4.95 3.04 Intr + 164925 165007 83 0 2 81 75 115 0.917 8.78 3.05 Intr + 166487 166665 179 0 2 79 20 335 0.694 25.44 3.06 Intr + 168565 168696 132 0 0 56 82 246 0.739 21.64 3.07 Intr + 170197 170313 117 0 0 152 116 157 0.999 25.56 3.08 Intr + 189145 189296 152 0 2 51 87 133 0.008 8.46 3.09 Intr + 193704 193812 109 0 1 15 97 21 0.012 -4.01 3.10 Intr + 193972 194142 171 0 0 79 42 78 0.079 2.44 3.11 Intr + 195960 196070 111 2 0 53 75 51 0.298 0.88 3.12 Intr + 202087 202191 105 0 0 52 20 107 0.464 0.71 3.13 Intr + 202722 202817 96 2 0 100 78 73 0.993 7.71 3.14 Intr + 204305 204423 119 1 2 116 89 163 0.996 18.46 3.15 Intr + 209313 209397 85 0 1 113 68 125 0.996 12.82 3.16 Intr + 209907 210122 216 2 0 39 63 379 0.853 29.20 3.17 Term + 211058 211264 207 1 0 101 38 356 0.998 29.24 3.18 PlyA + 211838 211843 6 -0.45 4.04 PlyA - 213653 213648 6 1.05 4.03 Term - 216804 216533 272 1 2 21 36 256 0.638 9.35 4.02 Intr - 218469 218340 130 0 1 90 98 244 0.999 25.87 4.01 Intr - 220526 220338 189 2 0 93 70 238 0.972 22.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 149115 149034 82 2 1 79 55 89 0.807 1.87 S.002 Intr - 152157 152020 138 2 0 70 96 64 0.804 4.98 S.003 Intr - 152385 152261 125 0 2 70 72 96 0.836 5.58 S.004 Init - 191822 191720 103 2 1 76 72 108 0.851 6.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:93834659_94056932|GENSCAN_predicted_peptide_1|945_aa VSGPPCGITTWPHQHCDFGLQGILANNDGSGVPEQTVKDAWPSLSQPPKLHDVTSALLDG IKAMASPPRFQCSLTILQKERLRLRELLRYSRTHSCYVAELVQSRGVLASKDDVSTLLRF PIPYLEKVNASGQDARSQAGIKSRDLYLHYANFTEFVLVFSLVSPVLQPSISSSLENKPP AFCKVTIQDGQVVLLHEVVHGPRAADGYAKRPLGRMLSVSGSWGDMQSRAQLSCELWSEL AKQNVMPVPKECPVFTGNMGDNGLMSTPGICFPLNLAVSTCRSAEFLLLLVVTQQLTSMN VNTETKRLTEWTRCGNKITNYKPDLSPCQGLAVSPKLECSGIVNMAHGSLNLLDSRRQIS THTPDDGQRDLVVQRVYSLQGTEAQGKFSAPGADLGGAQKGHMALSVLLQFPETNDSGNK NSEVGQGLRMARPGRFMEKETEAQPWKLSRAVWIMKPESARPGPPGGYSGNCRQFHALPH LARFLRQPCTGSQLQPNPTLHMALDDLLRASFTCEEPLKEAASPSGCDGRTHRQEGRQWN DLGPSWGPATSPPPGDDPALSRYSSGVTKKDPIALSRTLFAAALSFSLSPLSKMPNTLRL SALTQEALPGHSVLRVAFVMFPELVSSVPFLGAAGHQQSLPSSWKASCSGPLVMASDSDV KMLLNFVNLASSDIKAALDKSAPCRRSVDHRKYLQKQLKRFSQKYSRLPRGLPGRAAEPY LKRGSEDRPRRLLLDLGPDSSPGGGGGCKEKVLRNPYREECLAKEQLPQRQHPEAAQPGQ VPMRKRQLPASFWEEPRPTHSYHVGLEGGLGPREGPPYEGKKNCKGLEPLGPETTLVSMS PRALAEKEPLKMPGVSLVGRVNAWSCCPFQYHGQPIYPGPLGALPQSPVPSLGLWRKSPA FPGELAHLCKDVDGLGQKVCRPVVLKPIPTKPAVPPPIFNVFGYL >gi568815584r:93834659_94056932|GENSCAN_predicted_CDS_1|2838_bp gtctcagggcctccatgtggtatcaccacatggcctcaccagcactgtgacttcgggctt cagggtattttggcaaataatgatggctcaggggtcccagagcaaacagtgaaagatgca tggccttctctgagccagcctccaaaattacatgatgtcacttctgccctactggatggg attaaagccatggcaagtccacccagattccagtgttctctgaccattttacagaaggaa agactgaggctcagagagcttctgcgttattcaaggacacacagctgctacgtggcagag ctggtgcagagtcgaggtgtgctggcatctaaagatgatgtttccactcttctcagattc cctatcccttacctggaaaaggttaatgctagtggccaggatgctcggagccaggcaggc ataaaatccagggatctctaccttcactatgcaaattttaccgaatttgtcctcgtgttc tccctggtgtccccagtcctacaaccttctatttcatcctctttggagaataaacctcca gccttctgtaaagtgacaatccaggatgggcaggtggttctgctccatgaagttgtccat ggacccagggctgcagatggttatgccaagcgtccgctgggtaggatgctgagtgtgagt ggctcatggggtgacatgcagagcagagcacagctcagctgtgagttatggtcagagttg gccaagcagaatgtcatgccggtgccgaaggaatgtcccgtgtttaccgggaacatggga gataatggcctcatgagtacccctggtatctgtttccctctgaatcttgctgtcagcaca tgtcgaagtgctgagtttttgctgctgctggtggtgacacaacaactgaccagcatgaat gttaacacagagaccaaaagactgacagaatggactcgttgtggcaataagataacaaat tataaaccagacctaagcccatgccagggtcttgctgtgtcacccaagctggagtgcagt ggcatcgtgaacatggctcatggcagcctcaacctcctggactcaagaaggcagatatca actcacacacccgatgatggacagcgggacctcgtggtgcagcgtgtttacagcttacaa ggcacagaagcacaaggaaaattctcagctcctggggctgatctgggaggagcccagaaa ggccacatggccctctcggtgctgctgcaatttccagagacaaatgattctggcaataag aactcagaagttggacaaggcctcagaatggctcgtcctggccgctttatggagaaagag acagaggcccagccctggaagttgagccgggctgtttggattatgaagccagagtcggca aggcctgggccacctggtggctactctgggaactgcaggcagttccatgctctgccccat ctcgccagattcctgcggcaaccctgcactggttcccagcttcaaccaaaccccaccctc cacatggccctggacgaccttcttcgtgcatccttcacttgtgaagagcccctcaaggag gcggcatcaccttcaggatgtgatggacgcactcacaggcaggaggggaggcagtggaat gatctggggccttcctggggtcctgccacatccccacctccaggagatgacccggcactg agcaggtactccagcggggttacgaagaaggacccgatagccctgtcgcgaactctcttc gccgctgcactgagcttctcactgtctccgctctccaaaatgcctaacactctccgtctc agtgccttgactcaagaggcgctgccaggccactctgtcctccgcgtggcctttgttatg ttccccgagctggtcagctcggtgcccttccttggagctgccggccaccagcagagccta ccctcttcatggaaagcctcgtgcagtggccccctggtgatggcatccgacagtgatgtg aagatgctgctgaacttcgtgaacctggcgtccagcgacatcaaggcagccctggataag tccgcaccctgccgccgctccgtggaccatcgcaagtacctgcagaagcagctcaagcgc ttctcccagaagtattcccggctcccgcggggccttcctggcagagctgctgagccctac ctgaaaagggggtctgaggaccggcccaggaggctgctcctggatttgggccctgattcc agccccggcgggggtgggggctgcaaggagaaggtgctgaggaacccctacagggaggaa tgtcttgctaaggagcagctcccacagaggcagcatccagaagctgcccagcctggccag gtgcccatgaggaaaagacagctgcccgcttccttctgggaagagccaaggcccacccac agctaccatgtggggctggaggggggactgggccccagggagggacctccctatgagggt aagaaaaattgcaagggcttggagcccctgggacctgagactaccctggtgtccatgtct ccaagggccctggctgaaaaggagccgctcaagatgcctggggtctccttggtgggccgc gtcaatgcctggagttgctgccccttccagtaccatggacagcccatctatccgggcccc ctgggggcactgcctcagagtcctgtccccagcctgggcctttggaggaagagcccagcc tttcccggggagctggcgcacctctgcaaggatgtggacggcctggggcagaaggtgtgc aggcccgtggtgctgaaacccatccccaccaagccagccgtgcccccacccatcttcaat gtctttggctacctctag >gi568815584r:93834659_94056932|GENSCAN_predicted_peptide_2|695_aa MATQISTRGSQCTIGQEEYSLYSSLSEDELVQMAIEQSLADKTRGPTTAEATASACTNRQ PAHFYPWTRSTAPPESSPARAPMGLFQGVMQKYSSSLFKTSQLAPADPLIKAIKDGDEEA LKTMIKEGKNLAEPNKEGWLPLHEAAYYGQVGCLKVLQRAYPGTIDQRTLQEETAVYLAT CRGHLDCLLSLLQAGAEPDISNKSRETPLYKGPADHPCFPLSTACERKNAEAVKILVQHN ADTNHRCNRGWTALHESVSRNDLEVMQILVSGGAKVESKNAYGITPLFVAAQSGQLEALR FLAKYGADINTQASDNASALYEACKNEHEEVVEFLLSQGADANKTNKDGLLPLHIASKKG NYRSGESESPGEKEPHCLFEKLPEQIRYSLSIAAGTIDLIQALVLQRRKPRIVQMLLPVT SRTRIRRSGVSPLHLAAERNHDEVLEALLSARFDVNTPLAPERARLYEDRRSSALYFAVV NNNVYATELLLQHGADPNRDVISPLLVAIRHGCLRTMQLLLDHGANIDAYIATHPTAFPA TIMFAMKCLSLLKFLMDLGCDGEPCFSCLYGNGPHPPAPQPSSRFNDAPAADKEPSVVQF CEFVSAPEVSRWAGPIIDVLLDYVGNVQLCSRLKEHIDSFEDWAVIKEKAEPPRPLAHLC RLRVRKAIGKYRIKLLDTLPLPGRLIRYLKYENTQ >gi568815584r:93834659_94056932|GENSCAN_predicted_CDS_2|2088_bp atggccacgcagatcagcactcggggcagccagtgtaccattgggcaggaggagtacagc ctgtacagcagcctgagcgaggatgaactggtgcagatggccatcgagcagagcctagcg gacaagacaaggggcccaaccactgctgaggccaccgcgtctgcatgtaccaaccgccaa cctgcccatttctacccatggaccaggtccactgcacctcctgagagttcgccggcccgg gccccaatgggcttgttccaaggggtcatgcagaaatacagcagcagcttgttcaagacc tcccagctggcgcctgcggaccccttgataaaggccatcaaggatggcgatgaagaggcc ttgaagaccatgatcaaggaagggaagaatctcgcagagcccaacaaggagggctggctg ccgctgcacgaggccgcatactatggccaggtgggctgcctgaaagtcctgcagcgagcg tacccagggaccatcgaccagcgcaccctgcaggaggaaacagccgtttacttggcaacg tgcaggggccacctggactgtctcctgtcactgctccaagcaggggcagagccggacatc tccaacaaatcccgagagacaccgctctacaaaggccctgctgaccatccctgcttcccc ctgagcacagcctgcgagcgcaagaacgcggaggccgtgaagattctggtgcagcacaat gcagacaccaaccaccgctgcaaccgcggctggaccgctctgcacgagtctgtgtctcgc aatgacctggaggtcatgcagatcctggtgagcggaggagccaaggtggaatccaagaac gcctacggcatcacccccttgttcgtggccgcccagagtggacagttggaggccttgagg ttcttagccaagtacggtgctgacatcaacacgcaggccagcgacaacgcgtctgccctc tacgaggcctgcaagaatgagcatgaggaggtggtggagtttctgctgtcacagggtgcc gacgccaacaagaccaacaaggacggcttgctcccgctgcacatcgcctccaagaagggc aactacagatccggtgaatcagaatctccaggagagaaggagccccactgtctctttgaa aagctccctgagcaaatcagatacagcctgtctatagctgcagggaccattgatctaatt caggccctggttttacagagaaggaagccaaggatcgtgcagatgctgctgccggtgacc agccgcacgcgcatacgccgtagcggcgtcagtccgctgcacctggcggccgagcgcaac cacgacgaggtgctggaggcgctgctgagcgcgcgcttcgacgtgaacacgccgctggcc cccgagcgcgcgcgcctctacgaagaccggcgcagctccgcgctgtacttcgcggtggtc aacaacaacgtgtacgccaccgagctgctgctgcaacacggcgccgaccccaaccgcgac gtcatcagccccttgctcgtggccatccgccacggctgcctgcgcacaatgcagctgctg ctggaccacggcgcgaacatcgacgcctatatcgccacgcaccccaccgccttccccgcc accatcatgttcgccatgaagtgcctgtcgctgctcaagttcctcatggacctgggctgc gacggcgagccctgcttctcatgcctctacggcaacggcccgcacccgccggccccgcag ccctccagcaggttcaacgacgcgcccgcggccgacaaggagcccagcgtggtgcagttc tgtgagttcgtatctgccccagaggtgagccgctgggcggggcccatcatcgatgtcctc ctggactacgtgggcaacgtgcagctctgctcgcggctgaaggaacacatcgacagcttt gaggactgggccgtcatcaaggagaaggcagaacctccaagacctctggctcacctttgc cgactgcgggttcgaaaggccattgggaaataccgtataaaactcctagacaccttgccg ctcccaggcaggctgattagatacctgaaatacgagaacacccagtaa >gi568815584r:93834659_94056932|GENSCAN_predicted_peptide_3|762_aa MGKYMQIAESLFIASKTKVTSVLYNSRKYEREGKATSQFDYKRFPVERDGAGNAHLPALS WDLGTRTAILRGRLYHPDVTDEELRPSWDLPELGDACGGEVMAAMDTGQRADPSNPGDKE GDLQGLWQELYQLQAKQKKLKREVEKHKLFEDYLIKVLEKIPEGCTGWEEPEEVLVEATV KHYGKLFTASQDTQKRLEAFCQMIQAVHRSLESLEEDHRALMLSLKIRLCQLQKKCYRKQ EQWWQLKHSITYQKDIDFDTHTSSSYNDQLLGYMQMTITNMARQCCPSAHGVPKSMDLFS KLDLIKQGISTKGGQTWWGNSKGFMTIWAIREGLMDNDPGFERQVVSDKWAFQKEQRRNI LGKYSIFFTFLFLHHWAKLEEGRPVEDTKTLGKAPSGHLCQACAGHCDTDEKDKVPARCR SSLPAGQDSWTQAVSELQVMPKRPKDIGNTWSSREGTCSRLNNGPNDVHVLIPTTCDYVT VARGTWQVVHEDALLPCAPRAQISDTFSGISMYQCHTLRPLLSETSFNLISEKCDILSIL RDHPENRIYRRKIEELSKRFTAIRKTKGDGNCFYRALGYSYLESLLGKSREIFKFKERVL QTPNDLLAAGFEEHKFRNFFNALRAGRAQFYSVVELVEKDGSVSSLLKVFNDQSASDHIV QFLRLLTSAFIRNRADFFRHFIDEEMDIKDFCTHEVEPMATECDHIQITALSQALSIALQ VEYVDEMDTALNHHVFPEAATPSVYLLYKTSHYNILYAADKH >gi568815584r:93834659_94056932|GENSCAN_predicted_CDS_3|2289_bp atgggtaagtacatgcaaattgctgagtctctgttcattgcatccaaaacgaaagtcacc agcgtgctgtataactcaagaaaatacgagagagagggaaaagcaacatcacagtttgac tacaagcggtttccagtagaaagggatggggcaggcaacgcccatttaccagccctctcc tgggatctgggcactcgcacggccatcctgcgagggaggctttatcatcctgatgttacg gatgaggagctgagaccctcatgggacttgcccgaactcggtgatgcttgcggtggtgag gtgatggcagccatggacacaggccagagagctgacccaagcaatcctggtgacaaggaa ggggaccttcaagggctgtggcaggaactctaccagctccaggctaagcagaagaagctc aagagagaagtcgagaagcacaagctttttgaagactatctgattaaggtccttgagaaa atccccgagggctgcacgggatgggaggagccggaggaggtgctggtggaggccacggtg aagcactacgggaagctcttcacagccagccaggacacgcagaagcgcctcgaggccttc tgccagatgatccaggctgtccaccggagcctggagtctctggaggaggaccacagggct ctcatgttgagcctcaagatccggctgtgtcagctgcagaagaagtgctaccgcaagcag gagcagtggtggcagctgaagcacagcatcacttaccagaaggacattgactttgacaca cacaccagcagcagctataatgatcagctgctcggctacatgcaaatgaccatcaccaac atggcccggcagtgctgcccctctgcccacggcgtgcccaagagcatggatctcttctcc aagctcgatctgattaagcaagggatcagcaccaaaggaggacagacatggtggggaaat tcaaagggattcatgaccatctgggcaatccgggaaggcctcatggacaatgacccagga tttgaaagacaggtggtatctgacaagtgggcattccagaaggagcaaagaaggaatatc cttggaaaatacagcatcttctttactttcctgtttctacatcactgggctaagcttgag gaagggcgccccgtggaagacacaaagacacttgggaaggcccccagtgggcacctttgt caggcctgtgctgggcactgtgacactgatgagaaagacaaggtgcctgcccgctgtagg agctcactgcctgctggccaagacagctggacacaagcagtgtctgagctacaggtcatg cctaagaggcctaaggacattgggaacacatggagcagcagagaagggacatgtagcagg ctgaacaatggccccaatgatgtccacgtcctaatccccacaacctgtgactatgttacc gtggcgagagggacttggcaggtggtgcatgaagatgccctcctgccctgtgctccccga gctcagatctctgacaccttctctggcatctccatgtaccagtgccacacactgcggcct cttctgagtgaaacatctttcaacctaatatcagaaaaatgtgacattctatccattctt cgggaccatcctgaaaacaggatttaccggaggaaaatcgaggaactcagcaaaaggttc accgccatccgcaagaccaaaggggatgggaactgcttctacagggccttgggctattcc tacctggagtccctgctggggaagagcagggagatcttcaagttcaaagaacgcgtactg cagaccccaaatgaccttctggctgctggctttgaggagcacaagttcagaaacttcttc aatgctctgagggccgggcgggcgcagttttacagtgtggtggaactggtagagaaggat ggctcagtgtccagcctgctgaaggtgttcaacgaccagagtgcctcggaccacatcgtg cagttcctgcgcctgctcacgtcggccttcatcaggaaccgagcagacttcttccggcac ttcattgatgaggagatggacatcaaagacttctgcactcacgaagtagagcccatggcc acggagtgtgaccacatccagatcacggcgttgtcgcaggccctgagcattgccctgcaa gtggagtacgtggacgagatggataccgccctgaaccaccacgtgttccctgaggccgcc accccttccgtttacctgctctataaaacatcccactacaacatcctttatgcagccgat aaacattga >gi568815584r:93834659_94056932|GENSCAN_predicted_peptide_4|196_aa VPRTSEIYVHRSGRTARATNEGLSLMLIGPEDVINFKKIYKTLKKDEDIPLFPVQTKYMD VVKERIRLARQIEKSEYRNFQACLHNSWIEQAAAALEIELEEDMYKGGKADQQEERRRQK QMKVLKKELRHLLSQPLFTESQKTKYPTQSGKPPLLVSAPSKSESALSCLSKQKKKKTKK PKEPQPEQPQPSTSAN >gi568815584r:93834659_94056932|GENSCAN_predicted_CDS_4|591_bp gtcccacgtacctcggagatttatgtccaccgaagtggtcgaactgctcgagctaccaat gaaggcctcagtctgatgctcattgggcctgaggatgtgatcaactttaagaagatttac aaaacgctcaagaaagatgaggatatcccactgttccccgtgcagacaaaatacatggat gtggtcaaggagcgaatccgtttagctcgacagattgagaaatctgagtatcggaacttc caggcttgcctgcacaactcttggattgagcaggcagcagctgccctggagattgagctg gaagaagacatgtataagggaggaaaagctgaccagcaagaagaacgtcggagacaaaag cagatgaaggttctgaagaaggagctgcgccacctgctgtcccagccactgtttacggag agccagaaaaccaagtatcccactcagtctggcaagccgcccctgcttgtgtctgcccca agtaagagcgagtctgctttgagctgtctctccaagcagaagaagaagaagacaaagaag ccgaaggagccacagccggaacagccacagccaagtacaagtgcaaattaa