GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:38:07 Sequence gi568815587r:111642053_111866361 : 224309 bp : 41.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 11397 11621 225 2 0 58 54 194 0.552 8.80 1.02 PlyA + 12026 12031 6 1.05 2.04 PlyA - 12104 12099 6 1.05 2.03 Term - 29334 28901 434 1 2 -95 47 395 0.137 11.57 2.02 Intr - 29770 29549 222 2 0 -117 66 301 0.112 4.58 2.01 Init - 30172 29869 304 1 1 53 32 320 0.081 18.79 2.00 Prom - 34061 34022 40 -6.05 3.00 Prom + 34325 34364 40 -4.55 3.01 Init + 39872 39920 49 1 1 44 110 29 0.392 1.86 3.02 Intr + 45949 46110 162 2 0 117 93 187 0.846 21.13 3.03 Intr + 58834 58958 125 2 2 62 19 162 0.668 6.08 3.04 Intr + 59400 59523 124 2 1 78 84 76 0.994 5.44 3.05 Intr + 61043 61371 329 0 2 84 88 319 0.823 25.99 3.06 Intr + 62935 63087 153 0 0 88 90 166 0.911 16.15 3.07 Intr + 70159 70323 165 0 0 79 102 115 0.455 11.24 3.08 Intr + 77723 77951 229 1 1 85 96 184 0.983 15.42 3.09 Intr + 78426 78710 285 1 0 81 84 223 0.999 17.59 3.10 Intr + 78847 79010 164 2 2 96 94 86 0.966 8.77 3.11 Intr + 79778 79888 111 1 0 84 94 139 0.996 13.76 3.12 Intr + 80613 80704 92 2 2 113 66 101 0.998 8.27 3.13 Intr + 81444 82058 615 0 0 29 79 666 0.348 50.34 3.14 Intr + 82702 82952 251 1 2 45 44 164 0.329 3.86 3.15 Intr + 85285 85427 143 2 2 105 13 73 0.115 0.65 3.16 Intr + 88249 88301 53 0 2 90 83 26 0.048 -0.91 3.17 Intr + 89941 90103 163 1 1 50 75 137 0.176 7.66 3.18 Intr + 91444 91744 301 0 1 55 59 145 0.019 3.68 3.19 Term + 95416 95894 479 2 2 101 48 232 0.821 14.52 3.20 PlyA + 97707 97712 6 1.05 4.13 PlyA - 98180 98175 6 -0.45 4.12 Term - 100613 100467 147 2 0 90 44 81 0.671 0.82 4.11 Intr - 101478 101324 155 1 2 62 101 100 0.960 7.57 4.10 Intr - 105962 105902 61 2 1 79 101 50 0.982 2.79 4.09 Intr - 110280 110107 174 0 0 56 105 215 0.931 19.21 4.08 Intr - 111525 111391 135 0 0 67 90 32 0.589 1.14 4.07 Intr - 113042 112935 108 2 0 66 32 127 0.463 4.46 4.06 Intr - 113398 113243 156 1 0 72 88 186 0.996 16.29 4.05 Intr - 117899 117752 148 1 1 82 97 191 0.998 18.72 4.04 Intr - 118999 118767 233 1 2 45 72 201 0.999 9.85 4.03 Intr - 122853 122753 101 1 2 93 105 64 0.994 7.51 4.02 Intr - 123332 123242 91 2 1 58 91 48 0.996 0.75 4.01 Init - 124309 124196 114 1 0 91 84 118 0.906 12.24 4.00 Prom - 137441 137402 40 -4.65 5.10 PlyA - 137503 137498 6 1.05 5.09 Term - 144468 144345 124 2 1 105 43 109 0.343 4.98 5.08 Intr - 167721 167591 131 0 2 73 77 47 0.003 0.67 5.07 Intr - 194242 194113 130 0 1 96 64 143 0.844 12.48 5.06 Intr - 195563 195416 148 0 1 131 108 126 0.992 17.27 5.05 Intr - 202671 202549 123 1 0 126 101 50 0.991 9.74 5.04 Intr - 211433 211328 106 2 1 93 91 79 0.954 7.57 5.03 Intr - 214688 214655 34 1 1 85 91 -3 0.402 -2.99 5.02 Intr - 218583 218495 89 0 2 103 111 76 0.816 9.25 5.01 Intr - 222226 222168 59 2 2 70 60 75 0.292 0.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 29254 28901 354 1 0 77 47 302 0.837 20.80 S.002 Init + 91306 91744 439 0 1 68 59 248 0.962 16.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:111642053_111866361|GENSCAN_predicted_peptide_1|74_aa KNSSNRRQEKRRSKKMDSEKGNWHFRRKEDYFRERFLKLRCNEGLSSSQGGDQIGYRYER AYCLAKKLSKDIKQ >gi568815587r:111642053_111866361|GENSCAN_predicted_CDS_1|225_bp aagaattcttccaacagaaggcaagaaaagagaagatccaagaagatggacagtgagaaa gggaattggcattttcgtaggaaagaggactacttcagagagaggttcctaaaattgagg tgtaatgaaggcttgtcttcctcacagggtggagatcagattggatacagatatgaacgt gcatattgccttgcaaagaagctttctaaggacattaaacagtga >gi568815587r:111642053_111866361|GENSCAN_predicted_peptide_2|319_aa MLRPVVWGGITAIMVNQSLLSSFKLEVDPNIQAVCNQEKEQIKTLDKKFASFINKLWFLE QQNKALETKWSLLQQQKMAGSNMDEMFESHINTLWWQLDTLEKENEFVFIKKDVDEAYMN KAELESHLEGLTDEINFLKQLYEGDPGAAVPDLGHICGSVHGQQLLPGHGQHHHQVCGEL AIKHTNAKLVELEAALQWANQDMVWELHEYQELMNINLALDIEITTYCKLLQGKESWLES GMQNMSIHRKTTSGYSGGLSWAYRGLTSSGLNYGLGSSFGSGGGSGSFSCPSSSRDLIVK KIKIHDGKLMSKSSDVPSK >gi568815587r:111642053_111866361|GENSCAN_predicted_CDS_2|960_bp atgctgaggccagtggtatgggggggcatcacagccattatggtaaaccagagcctgcta agctcttttaagctggaagtggaccccaacatccaggccgtgtgtaaccaggagaaggag cagatcaagacccttgacaagaagtttgcctccttcatcaacaagttgtggttcctggag cagcagaacaaggcacttgagaccaagtggagcctcctgcagcagcagaagatggctggg agcaacatggatgagatgttcgagagccacatcaacaccctttggtggcagctggacact ctggagaaggaaaacgaatttgtcttcatcaagaaggatgtggatgaagcttacatgaac aaagcagagctggagtctcacctggaagggttgactgatgagatcaacttcctcaagcag ctgtatgaaggagatccaggagctgctgtcccagatcttggacacatctgtggttctgtc catggacaacagctgctccctggacatggacagcatcatcaccaagtatgtggggagctg gccattaagcacaccaatgccaagctggtggagctggaggccgccctgcaatgggccaat caggacatggtgtgggagctgcatgagtaccaggagctcatgaacatcaatttagccctg gacatcgagatcactacctactgcaagctgctgcagggcaaggaaagctggctggagtct gggatgcagaatatgagtatccataggaagaccaccagtggctactcaggtgggctgagc tgggcctataggggtctcacaagctctggtctcaactatggcctgggctctagctttggt tctggtgggggttccggctccttcagctgccccagctcctccagggacctgattgtgaag aagatcaagatccatgatgggaagctgatgtctaagtcctctgatgtcccgtccaagtga >gi568815587r:111642053_111866361|GENSCAN_predicted_peptide_3|1330_aa MKTQALSKTNYVRNSKDYLANHGRLNESEARRKFWQILSAVDYCHGRKIVHRDLKAENLL LDNNMNIKIADFGFGNFFKSGELLATWCGSPPYAAPEVFEGQQYEGPQLDIWSMGVVLYV LVCGALPFDGPTLPILRQRVLEGRFRIPYFMSEVEDTKINTLVQSHMSQANNPEVQGDSC DFCNIVFSIDCEHLIRRMLVLDPSKRLTIAQIKEHKWMLIEVPVQRPVLYPQEQENEPSI GEFNEQVLRLMHSLGIDQQKTIESLQNKSYNHFAAIYFLLVERLKSHRSSFPVEQRLDGR QRRPSTIAEQTVAKAQTVGLPVTMHSPNMRLLRSALLPQASNVEAFSFPASGCQAEAAFM EEECVDTPKVNGCLLDPVPPVLVRKGCQSLPSNMMETSIDEGLETEGEAEEDPAHAFEAF QSTRSGQRRHTLSEVTNQLVVMPGAGKIFSMNDSPSLDSVDSEYDMGSVQRDLNFLEDNP SLKDIMLANQPSPRMTSPFISLRPTNPAMQALSSQKREVHNRSPVSFREGRRASDTSLTQ GIVAFRQHLQNLARTKGILELNKVQLLYEQIGPEADPNLAPAAPQLQDLASSCPQEEVSQ QQESVSTLPASVHPQLSPRQSLETQYLQHRLQKPSLLSKAQNTCQLYCKEPPRSLEQQLQ EHRLQQKRLFLQKQSQLQAYFNQMQIAESSYPQPSQQLPLPRQETPPPSQQAPPFSLTQP LSPVLEPSSEQMQYSPFLSQYQEMQLQPLPSTSGPRAAPPLPTQLQQQQPPPPPPPPPPR QPGAAPAPLQFSYQTCELPSAASPAPDYPTPCQYPVDGAQQSDLTGPDCPRSPGLQEAPS SYDPLALSELPGLFDCEMLDAVDPQHNGNRLHLHSEAGDTPLQKVPWVAECQNILRTPEV SHVELTGDPPPWRQGARNSRTHQEHQPWAREDRLFLQFLVDTAGLRAVGLQGKKKSVALS SSDIKNSQVFKPYAGVSGWDRQEPKTEASPCLLCPFPTLKICSVKEIQVVRPFHHGWTLF SSAAFVSRRLQQGGEGEQQGGLHPKPRASAQWMEPDSSACTSQVLHQDAGQVGWDSSGSS SSLPKSEMPRVGLTDKSDPEETVHCHPRATHIVGGGTQWNGGKAHQALSSKHATHSSRSA QPCLSLLSGIRQNCLPVATEPKCLAVPLCQQSTVGLGKWFLSARGTLGSPLSFTGNSSFL SLCATTRDTDSGAVTVPDQGTSLPSPPVLPSQDALSCASRVGCPAAAYKYMGHWAEKAGA AQGRAEVPFPRGPGKRHPIGPFGVHLAVPHILVTDGLQSPNHRKDLAPFRTHISLREQLW RQKWLGFPTQ >gi568815587r:111642053_111866361|GENSCAN_predicted_CDS_3|3993_bp atgaaaactcaggctctttccaaaaccaactatgtcagaaactctaaggactatcttgct aatcatggccggttaaatgagtctgaagccaggcgaaaattctggcaaatcctgtctgct gttgattattgtcatggtcggaagattgtgcaccgtgacctcaaagctgaaaatctcctg ctggataacaacatgaatatcaaaatagcagatttcggttttggaaatttctttaaaagt ggtgaactgctggcaacatggtgtggcagccccccttatgcagccccagaagtctttgaa gggcagcagtatgaaggaccacagctggacatctggagtatgggagttgttctttatgtc cttgtctgtggagctctgccctttgatggaccgactcttccaattttgaggcagagggtt ctggaaggaagattccggattccgtatttcatgtcagaagtagaggatacaaagattaac acccttgtacaaagtcacatgagtcaagcaaataaccctgaagtgcaaggtgattcttgt gacttttgtaacattgtgttttctatagattgcgagcaccttatccgaaggatgttggtc ctagacccatccaaacggctaaccatagcccaaatcaaggagcataaatggatgctcata gaagttcctgtccagagacctgttctctatccacaagagcaagaaaatgagccatccatc ggggagtttaatgagcaggttctgcgactgatgcacagccttggaatagatcagcagaaa accattgagtctttgcagaacaagagctataaccactttgctgccatttatttcttgttg gtggagcgcctgaaatcacatcggagcagtttcccagtggagcagagacttgatggccgc cagcgtcggcctagcaccattgctgagcaaacagttgccaaggcacagactgtggggctc ccagtgaccatgcattcaccgaacatgaggctgctgcgatctgccctcctcccccaggca tccaacgtggaggccttttcatttccagcatctggctgtcaggcggaagctgcattcatg gaagaagagtgtgtggacactccaaaggtcaatggctgtctgcttgaccctgtgcctcct gtcctggtgcggaagggatgccagtcactgcccagcaacatgatggagacctccattgac gaagggctggagacagaaggagaggccgaggaagaccccgctcatgcctttgaggcattt cagtccacacgcagcgggcagagacggcacactctgtcagaagtgaccaatcaactggtc gtgatgcctggggcagggaaaattttctccatgaatgacagcccctcccttgacagtgtg gactctgagtatgatatggggtctgttcagagggacctgaactttctggaagacaaccct tcccttaaggacatcatgttagccaatcagccttcaccccgcatgacatctcccttcata agcctgagacctaccaacccagccatgcaggctctgagctcccagaaacgagaggtccac aacaggtctccagtgagcttcagagagggccgcagagcatcagatacctccctcacccag ggaattgtagcatttagacaacatcttcagaatctggctagaaccaaaggaattctagag ttgaacaaagtgcagttgttgtatgaacaaataggaccggaggcagaccctaacctggcg ccggcggctcctcagctccaggaccttgctagcagctgccctcaggaagaagtttctcag cagcaggaaagcgtctccactctccctgccagcgtgcatccccagctgtccccacggcag agcctggagacccagtacctgcagcacagactccagaagcccagccttctgtcaaaggcc cagaacacctgtcagctttattgcaaagaaccaccgcggagccttgagcagcagctgcag gaacataggctccagcagaagcgactctttcttcagaagcagtctcaactgcaggcctat tttaatcagatgcagatagcagagagctcctacccacagccaagtcagcagctgcccctt ccccgccaggagactccaccgccttctcagcaggccccaccgttcagcctgacccagccc ctgagccccgtcctggagccttcctccgagcagatgcaatacagccctttcctcagccag taccaagagatgcagcttcagcccctgccctccacttccggtccccgggctgctcctcct ctgcccacgcagctacagcagcagcagccgccaccgccaccaccccctccaccaccacga cagccaggagctgccccagcccccttacagttctcctatcagacttgtgagctgccaagc gctgcttcccctgcgccagactatcccactccctgtcagtatcctgtggatggagcccag cagagcgacctaacggggccagactgtcccagaagcccaggactgcaagaggccccctcc agctacgacccactagccctctctgagctacctggactctttgattgtgaaatgctagac gctgtggatccacaacacaacggaaaccgccttcatctccattcggaagcaggtgacaca ccccttcagaaggtgccctgggttgccgagtgtcagaatatactcaggactccagaggtg tcacacgtggaactgacaggagacccgccaccgtggaggcagggggcaagaaactcaaga acgcatcaagagcaccagccctgggccagggaagacaggctcttcctgcagtttctcgtg gacactgctggcttgcgggcagtcggtctccagggaaagaaaaagtcagtggccctttct tcctcagatatcaagaactcccaagtgtttaaaccgtatgctggagtcagtggttgggac agacaggagcccaagactgaagccagcccttgcctcttgtgtcccttcccaactctgaag atttgctcagtcaaggaaattcaagtggtgagacctttccaccatgggtggacactcttc agttctgcagcctttgtgagtcgaaggctccagcagggtggggaaggagagcagcaggga ggcctgcaccccaaacccagggcctctgcccagtggatggaaccagacagcagtgcctgc acttcccaagttctccatcaagacgcaggacaggtcggctgggacagttctggctccagc tcctcactccccaaaagtgaaatgccccgagtggggctgactgacaaatcagaccctgag gagactgttcattgtcaccccagggccacccacatagttgggggtgggacacaatggaat ggaggaaaagcccaccaagccctttcctccaagcacgccacacatagctcccggagcgca cagccttgcctgtctcttctgtctgggatccgccagaactgccttccagtcgctacagag cccaaatgcttagcagtgcccctgtgccagcaaagcactgtgggtcttgggaagtggttc ttgtcagcccgagggacactgggttctccactgtccttcacaggaaattctagcttcctc agcctttgtgccaccactagagacacagacagtggcgctgtaactgtccctgaccagggc acatccctcccaagcccacctgtgcttcccagccaggatgccctcagttgtgccagcaga gttgggtgtccagcagcagcctacaagtatatggggcactgggcagagaaagctggggca gcccagggtcgtgctgaggtgcctttccctcggggccctggaaagcgccaccccatcggg ccttttggtgtccacctggctgttccccacatcctggtcactgatggtctccagagcccc aaccacaggaaagacctggctcctttcagaactcatatcagtttaagagaacaactctgg agacagaaatggcttggctttccgacgcaatga >gi568815587r:111642053_111866361|GENSCAN_predicted_peptide_4|540_aa MAGASELGTGPGAAGGDGDDSLYPIAVLIDELRNEDVQLRLNSIKKLSTIALALGVERTR SELLPFLTDTIYDEDEVLLALAEQLGNFTGLVGGPDFAHCLLPPLENLATVEETVVRDKA VESLRQISQEHTPVALEAYFVPLVKRLASGDWFTSRTSACGLFSVCYPRASNAVKAEIRQ QFRSLCSDDTPMVRRAAASKLGEFAKVLELDSVKSEIVPLFTSLASDEQDSVRLLAVEAC VSIAQLLSQDDLETLVMPTLRQAAEDKSWRVRYMVADRFSELQKAMGPKITLNDLIPAFQ NLLKDCEAEVRAAAAHKELVSDTNQHVKSALASVIMGLSTILGKENTIEHLLPLFLAQLK DECPDVRLNIISNLDCVNEVIGIRQLSQSLLPAIVELAEDAKWRVRLAIIEYMPLLAGQL GVEFFDEKLNSLCMAWLVDHVYAIREAATNNLMKLVQKFGTEWAQNTIVPKVLVMANDPN YLHRMTTLFCINALSEACGQEITTKQMLPIVLKMAGDQVANVRFNVAKSLQKIGPILDTK >gi568815587r:111642053_111866361|GENSCAN_predicted_CDS_4|1623_bp atggcgggcgcatcagagctcgggaccggcccaggagcagcgggtggagatggagatgat tcgctatacccgatcgcggttttaatcgacgagctccgcaatgaagacgtgcagctccga ctcaacagtattaagaagttatcaacaattgccctagcacttggagtagaaaggacccga agtgaattgttgccatttcttacagatacaatttatgatgaagatgaggtactattagct cttgctgagcagctgggaaatttcactggcctagtgggaggtcctgactttgcccactgt ctgctgcctcctttggaaaatctggcaactgtggaagagactgttgttcgtgacaaggct gtggagtccctgagacagatctcccaggagcatactcctgttgctctggaagcttatttt gtacctctggtgaaacgcttagcaagtggggattggttcacctctcgcacatctgcatgt ggtttgttcagcgtttgctatcccagggcatcaaatgctgttaaagcagaaatcagacag caattccgttccttgtgctcagatgacacaccaatggtacgacgtgctgctgcttccaaa ttgggtgaatttgcaaaagttttggaattagacagtgtgaaaagtgaaattgttccactg ttcactagtctagcttcagatgaacaggattcagtgcgcctccttgctgtggaagcttgt gtcagtattgcccagttattgtctcaggatgaccttgagactttggtgatgcctacactt cgacaagcagcagaagataaatcttggcgcgttcgctatatggtggctgacagattttca gagctccagaaagccatgggtcctaaaatcaccctaaatgacctcatccccgcctttcag aacctacttaaagactgtgaagctgaagtccgggcagctgctgcccacaaagaattagta tccgataccaatcaacatgtcaaatcggctctagcttctgtaattatgggattgtctact attttgggcaaagaaaataccattgaacatcttctacctcttttcttagctcagttaaag gatgagtgtcctgacgttcgtttgaatatcatctccaatttggattgtgtaaatgaagtg attggaatccgtcagctctctcagtctctccttcctgccatagtggagctggcagaagat gccaaatggagggtccgcctggccatcattgagtatatgccgctgctggcaggccagctg ggtgtggaattctttgatgaaaagctgaattctttatgtatggcttggctcgtggaccat gtatacgccatccgagaagctgccaccaacaacctcatgaaactagttcagaagtttggt acagagtgggcccaaaatactattgttcccaaagtgttagtaatggcaaatgatcctaat tacttgcatagaatgaccactttattctgcattaatgcactgtctgaggcctgtggtcag gaaataactactaagcaaatgctgcccatcgtattaaaaatggcaggagaccaagtagca aatgttcgcttcaatgtggccaaatctctacaaaagattggaccaattctagataccaag taa >gi568815587r:111642053_111866361|GENSCAN_predicted_peptide_5|314_aa XATLEEPDMVHVRAGPVRGLGCVQEVWVARESNDASLLGSQHWHVLLIISVWYLNFLPVP QVPVVVIDSYYYGKLVIAPLNIVLYNVFTPHGPDLYGTEPWYFYLINGFLNFNVAFALAL LVLPLTSLMEYLLQRFHGYHGPLDLYPEFYRIATDPTIHTVPEGRPVNVCVGKEWYRFPS SFLLPDNWQLQFIPSEFRGQLPKPFAEGPLATRIVPTDMNDQNLEEPSRYIDISKCHYLV DLDTMRETPREPKYSSNKEEWISLAYRPFLDASRSSKLLRAFYVPFLSDQYTVYVNYTIL KPRKAKQIRKKSGG >gi568815587r:111642053_111866361|GENSCAN_predicted_CDS_5|945_bp ncggcaacgctggaggagccagacatggtgcacgtgagggccgggccagttcgtgggttg ggctgtgtgcaagaagtttgggttgcacgtgagtcgaatgatgctagccttcttggttct cagcactggcatgttttgctcatcatcagtgtgtggtaccttaatttcttacctgtacca caggtgcctgtggtggtcattgacagctactattatgggaagttggtgattgcaccactc aacattgttttgtataatgtctttactcctcatggacctgatctttatggtacagaaccc tggtatttctatttaattaatggatttctgaatttcaatgtagcctttgctttggctctc ctagtcctaccactgacttctcttatggaatacctgctgcagagatttcatggatatcac gggccccttgatttgtatccagaattttaccgaattgctacagacccaaccatccacact gtcccagaaggcagacctgtgaatgtctgtgtgggaaaagagtggtatcgatttcccagc agcttccttcttcctgacaattggcagcttcagttcattccatcagagttcagaggtcag ttaccaaaaccttttgcagaaggacctctggccacccggattgttcctactgacatgaat gaccagaatctagaagagccatccagatatattgatatcagtaaatgccattatttagtg gatttggacaccatgagagaaacaccccgggagccaaaatattcatccaataaagaagaa tggatcagcttggcctatagaccattccttgatgcttctagatcttcaaagctgctgcgg gcattctatgtccccttcctgtcagatcagtatacagtgtacgtaaactacaccatcctc aaaccccggaaagcaaagcaaatcaggaagaaaagtggaggttag