GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:16:54 Sequence gi568815588r:119068483_119278790 : 210308 bp : 46.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 2603 2404 200 0 2 49 91 85 0.395 3.97 1.04 Intr - 4571 4408 164 1 2 79 116 -20 0.378 -0.48 1.03 Intr - 5095 4959 137 1 2 18 87 71 0.391 -0.63 1.02 Intr - 5455 5265 191 2 2 88 87 60 0.654 5.20 1.01 Init - 12194 12146 49 2 1 104 113 37 0.905 8.92 1.00 Prom - 12825 12786 40 -1.36 2.00 Prom + 15455 15494 40 -2.96 2.01 Init + 35568 35723 156 2 0 65 29 176 0.877 7.31 2.02 Term + 36282 36485 204 2 0 87 42 168 0.930 9.47 2.03 PlyA + 36560 36565 6 -0.45 3.00 Prom + 37138 37177 40 -6.06 3.01 Init + 39109 39157 49 0 1 86 89 27 0.910 1.71 3.02 Intr + 39486 39682 197 1 2 95 86 69 0.840 6.53 3.03 Intr + 43367 43446 80 1 2 70 48 26 0.290 -4.85 3.04 Intr + 49037 49185 149 2 2 93 116 135 0.901 16.68 3.05 Intr + 51859 51970 112 2 1 47 69 43 0.730 -2.26 3.06 Intr + 54987 55087 101 0 2 78 74 165 0.991 13.85 3.07 Intr + 61033 61140 108 2 0 70 76 79 0.715 5.16 3.08 Intr + 64033 64127 95 2 2 49 93 67 0.656 2.98 3.09 Term + 67989 68165 177 2 0 58 42 112 0.825 1.29 3.10 PlyA + 69273 69278 6 1.05 4.03 PlyA - 71724 71719 6 1.05 4.02 Term - 72703 72619 85 0 1 91 50 67 0.837 0.33 4.01 Init - 74733 74678 56 0 2 77 111 41 0.903 4.24 4.00 Prom - 76677 76638 40 -2.26 5.16 PlyA - 77159 77154 6 1.05 5.15 Term - 77871 77776 96 0 0 112 35 40 0.452 -0.93 5.14 Intr - 78873 78676 198 0 0 46 30 118 0.235 1.35 5.13 Intr - 86695 86580 116 2 2 74 116 99 0.999 11.47 5.12 Intr - 88274 88196 79 2 1 149 97 1 0.993 6.22 5.11 Intr - 89445 89389 57 0 0 76 83 65 0.642 3.88 5.10 Intr - 91271 91246 26 0 2 114 101 -4 0.970 1.24 5.09 Intr - 92487 92433 55 0 1 62 109 83 0.968 6.35 5.08 Intr - 92599 92573 27 1 0 87 89 18 0.508 0.11 5.07 Intr - 95714 95649 66 2 0 95 81 24 0.070 1.50 5.06 Intr - 97171 97055 117 1 0 -5 42 145 0.206 1.26 5.05 Intr - 100860 100695 166 2 1 -4 90 170 0.976 7.96 5.04 Intr - 104003 103900 104 2 2 21 116 77 0.984 2.77 5.03 Intr - 105390 105255 136 2 1 92 86 65 0.985 7.27 5.02 Intr - 106078 105969 110 1 2 57 78 71 0.386 2.08 5.01 Init - 110308 110273 36 1 0 86 109 87 0.817 8.78 5.00 Prom - 115516 115477 40 -4.66 6.00 Prom + 118469 118508 40 -3.76 6.01 Init + 124966 124996 31 0 1 98 38 29 0.034 -1.20 6.02 Intr + 130771 130890 120 2 0 56 78 64 0.093 2.67 6.03 Intr + 139341 139487 147 1 0 36 116 213 0.760 19.11 6.04 Term + 141681 141817 137 1 2 93 39 37 0.535 -2.52 6.05 PlyA + 142499 142504 6 1.05 7.05 PlyA - 142672 142667 6 1.05 7.04 Term - 143289 143184 106 2 1 106 48 102 0.751 5.88 7.03 Intr - 162502 162414 89 1 2 126 68 57 0.340 6.27 7.02 Intr - 172817 172655 163 1 1 80 82 79 0.497 6.48 7.01 Init - 174036 173987 50 0 2 103 80 34 0.892 4.53 7.00 Prom - 174429 174390 40 -4.06 8.05 PlyA - 175705 175700 6 1.05 8.04 Term - 178662 178448 215 1 2 -10 41 221 0.305 4.99 8.03 Intr - 183683 183612 72 0 0 68 37 99 0.165 2.18 8.02 Intr - 204380 204331 50 1 2 118 71 55 0.612 5.12 8.01 Init - 208771 208704 68 1 2 93 86 24 0.398 3.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 94062 94018 45 0 0 93 105 38 0.874 5.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_1|247_aa MPAYFQRPENALKRANEFLEVGKKQPALDVLYDVMKSKKHRTWQKIHEPIMLKYLELCVD LRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKMAEEKTEAAKEESQQMVLDIEDLDN IQTPESVLLSAVSGEDTQDRTDRLLLTPWVKFLWESYRQCLDLLRNNSRVERLYHDIAQQ AFKFCLQYTRKAEFRKLCDNLRMHLSQIQRHHNQSTAINLNNPESQSMHLETRLVQLDSA ISMELWQ >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_1|741_bp atgccggcctattttcagaggccggaaaatgccctcaaacgcgccaacgaatttcttgag gttggcaaaaagcagcctgctctggatgttctttatgatgttatgaaaagtaaaaaacat agaacatggcaaaagatacacgaaccaattatgttgaaatacttggaactttgcgtggat cttcgcaagagccacttggcaaaggaggggttataccagtataagaacatttgtcaacag gtgaacataaaatctctggaggatgttgttagggcatatttgaaaatggcagaggaaaaa actgaagctgctaaagaagaatctcagcagatggtcttagatatagaggatctagataat attcaaactcctgagagtgttctcctaagtgctgtaagtggtgaagacactcaggatcgt actgacagattacttttaactccatgggttaaattcctgtgggagtcttacaggcagtgt ttggaccttcttagaaacaattctagagtagagcgcctgtaccatgatattgcccagcaa gctttcaaattctgcctccaatacacgcgtaaggctgaattccgtaaactgtgtgacaat ttgagaatgcacttatcgcagattcagcgccaccataaccaaagtacggcaatcaatctt aataatccagagagccagtccatgcatttggaaaccagacttgttcagctggacagtgct atcagcatggaattgtggcag >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_2|119_aa MGAGPTAGGAGFGCEAGPAGGSQSCAPRRRKMAAAEVADTQLMLGVGLIGEDLSPRTEGP WETSDAHFTEEQMEAQAVIVHGHTIGRTEPGVGPSQCIPEPPTPTSGELLHHVIELVHK >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_2|360_bp atgggggcgggccctacggcgggcggggcggggttcgggtgcgaggccggtcctgcaggc ggcagccagagctgcgcgccgcggcggcggaagatggctgcggccgaggtggcggacact cagctgatgcttggagtcgggctgatcggtgaggacctctcaccaaggaccgaggggccg tgggaaacgtctgacgcccacttcacggaggagcaaatggaggcccaggccgtcattgtg cacggacacaccataggccgtacagagccgggagtgggaccctcgcagtgtatcccagag cccccaacccccaccagtggagagctcctgcatcatgtgatcgagcttgtgcataaataa >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_3|355_aa MGFLQVGQAGLEVPTSEKDTNGEVLWVWCYPSTTATLRNLLLRKCCLTDENKLLHPFVFG QYRRTWFYITTIEVPDSSILKKVTHFSIVLTAKDFNPEKYAAFTRILCRMYLKHGSPVKM MESYIAVLTKGICQSEENGSFLSKDFDARKAYLAGSIKDIVSQFGMETVILHTALMLKKR IVVYHPKIEAVQEFTRTLPALVWHRQDWTILHSYVHLNADELEALQMCTGYVAGFVDLEV SNRPDLYDVFVNLAESEITIAPLAKEAMAMGKLHKEMGQLIVQSAEDPEKSESHVIQDIA LKTREIFTNLAPFSEVSADGEKRVLNLEALKQKRFPPATENFLYHLAAAEQMLKI >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_3|1068_bp atggggtttctccaggttggtcaggctggtctcgaagtcccgacctcagaaaaggacaca aatggagaagttctgtgggtgtggtgttatccttccacgacagccacattaaggaacctg ctgctgagaaaatgctgccttacagatgaaaacaaacttctccatccctttgtctttggt cagtacagaagaacatggttttatatcacaacaattgaagttccagattcttccattttg aaaaaggtgactcatttttctattgtcctgaccgccaaagattttaacccagagaagtat gctgccttcactaggatattgtgtagaatgtacctgaaacatgggagcccagttaaaatg atggagagttatattgcagttctcacaaaggggatatgccagagtgaagaaaacggctct ttccttagtaaggattttgatgcccgaaaggcctacctggctggctccatcaaagacatt gtatctcagtttggaatggaaactgttatcttacacacagcactgatgctaaagaaaaga attgtggtgtatcaccccaagatagaagcggtccaggagttcaccaggactctgcctgcc ctggtgtggcaccgacaggactggaccatccttcactcttacgtgcacctcaacgccgat gagctggaagccctgcagatgtgcacaggttacgtcgctggatttgtagacttggaggtg agcaacagaccagacctctatgatgtgtttgtgaatctggcagagagtgagattaccatt gctccccttgcaaaagaggccatggcaatgggcaaactgcacaaagaaatgggtcagcta attgttcagtctgcagaagatccagagaaatcagagagccacgttatacaggatattgct ctaaaaacaagagaaatctttaccaacctagcaccgttttcagaagtttcggctgatgga gaaaagagagtccttaatttggaggcgctaaagcaaaaacgatttccaccagcaacagaa aacttcctttatcatctagcagcagccgaacaaatgctgaaaatctga >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_4|46_aa MLGARWQGLHSQAKEQGSRFGLENFATGLLGTPEVSVLTRRLQIPY >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_4|141_bp atgctgggggccaggtggcagggcctgcacagccaggccaaggagcagggcagcaggttc ggtttagagaactttgccacaggtcttctggggaccccagaggtgtctgtgctgacaagg cgacttcagattccatactga >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_5|462_aa MAAAVGRLLRASHAPYFKGTAVVNGEFKDLSLDDFKGKYLVLFFYPLDFTFVCPTEIVAF SDKANEFHDVNCEVVAVSVDSHFSHLAWINTPRKNGGLGHMNIALLSDLTKQISRDYGVL LEGSGLALRGLFIIDPNGVIKHLSVNDLPVGRSVEETLRLVKAFQYVETHGEVCPANWTP DSPTRKMSLEQEEETQPGRLLGRRDAVPAFIEPNVRFWITERQSFIRRFLQWTELLDPTN VFISVIQEAWKRSLATVHPDSSNLIPKLFRPAAFLPFMAPTVFLCAYMAAFNSINGNRSY VIPQFVQMKYGLTGPWIKRLLPVIFLVQASGMNVYMSRSLESIKGIAVMDKEGNVLGHSR IAGTKKPEMAYEAEDLGIHQGSCGHQYEKPNLQELRPEGRRWGFRAGTGKDCSPWQLEQK LPCWARLPVILDPVFQEKPRVIVDFETVLYCPGNGTDGAIFF >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_5|1389_bp atggcggctgctgtaggacggttgctccgagcgtcgcatgcaccctattttaagggtaca gccgttgtcaatggagagttcaaagacctaagccttgatgactttaaggggaaatatttg gtgcttttcttctatcctttggatttcacctttgtgtgtcctacagaaattgttgctttt agtgacaaagctaacgaatttcacgacgtgaactgtgaagttgtcgcagtctcagtggat tcccactttagccatcttgcctggataaatacaccaagaaagaatggtggtttgggccac atgaacatcgcactcttgtcagacttaactaagcagatttcccgagactacggtgtgctg ttagaaggttctggtcttgcactaagaggtctcttcataattgaccccaatggagtcatc aagcatttgagcgtcaacgatctcccagtgggccgaagcgtggaagaaaccctccgcttg gtgaaggcgttccagtatgtagaaacacatggagaagtctgcccagcgaactggacaccg gattctcctacgcgcaaaatgtccctggaacaggaggaggaaacgcaacctgggcggctc ctaggacgcagagacgccgtccccgccttcattgagcccaacgtgcgcttctggatcacc gagcgccaatcctttattcgacgatttcttcaatggacagaattattagatcctacaaat gtgttcatttcagttatacaagaagcttggaagcggagtcttgcaacagtgcatcccgac agcagcaacctgatccccaagctttttcgacctgcagcgttcctgcctttcatggcaccc acggttttcctctgtgcctacatggcagcgttcaacagcatcaatggaaacagaagttac gtaatccctcagtttgtccagatgaagtatggcctgactggcccttggattaaaagactc ttacctgtgatcttcctcgtgcaagccagtggaatgaatgtctacatgtcccgaagtctt gaatccattaaggggattgcggtcatggacaaggaaggcaatgtcctgggtcattccaga attgctgggacaaagaagccagaaatggcctacgaggcggaggacctgggcatccatcag ggttcctgtggccaccagtatgagaaacctaacctgcaggagctcagacccgaggggaga cgttggggctttagggccggaacgggcaaggattgcagtccatggcagcttgaacagaag ctcccctgctgggcgcgcctccctgtcattctcgacccagtatttcaggaaaaacccagg gtcattgtggattttgaaactgtcttgtactgtcctggcaatgggactgatggtgccatt ttcttttag >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_6|144_aa MAFGKQRKQEGRSNKAPSMKQGALTRHHICWCPDLAFSSLQNYEQSISIVSAAAARAAAA AAAAPQALTAPPAGSVADRRLSMELENIVANTVLLKAREGFFNFAVRVWSSSHGEKIALD DALEMERACGSRERLTREQEKLII >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_6|435_bp atggcatttggaaaacagcgaaagcaggaaggacgtagcaacaaggcaccatccatgaag cagggagctctcaccagacaccatatctgctggtgccctgatcttgcattttccagcctc cagaactacgagcaatccatttctattgtttcagcggcggcagcccgagcagcggcagca gcagcggcagcaccccaggcgctgacagccccgccggccggctccgttgctgaccgccga ctgtcaatggagctggaaaacatcgtggccaacacggtcttgctgaaagccagggaaggc tttttcaattttgctgtgagagtttggtcgtcatctcatggtgagaaaatagctcttgat gatgccttggaaatggagagggcctgcggaagcagggagagactgacacgtgaacaagaa aaattaatcatttaa >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_7|135_aa MMPGQLLDAGEHCQPVRKPQSMSTGTTEPTPHVAFALLHLILTEAPGIDDGTLPTSQTRK LRPEEEGYLLQNPGSPHNTSKNHCPFGPTASAARSGVFKYRPRGANLDDQEDPAKSQSYL KKRFLCILMFGNCDK >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_7|408_bp atgatgccagggcagctcttggatgcaggagagcactgtcagccagtcaggaaaccgcag tctatgagcacagggaccacagagccaacccctcacgtggcctttgctcttctccatttg attctcacagaagccccggggatagatgacggcaccctgcccacgtcacagacgaggaag ctgaggccggaggaggagggctatctgctacagaaccctggaagcccccacaatacttcc aagaaccactgcccatttggccctacagcttctgctgccagatctggtgtcttcaagtac aggccccgaggtgctaatcttgatgatcaggaagatccagccaagtcacagagttacctg aaaaaacggttcctgtgtattctaatgtttggcaattgtgacaaatga >gi568815588r:119068483_119278790|GENSCAN_predicted_peptide_8|134_aa MVVCGLSWGSGRNDQGVGTRHTRSINARTSDTDNCEYLTGWSCYSHSCPYRHEHATQASS KRDERSGVGRPRTDSHSDAELQAIAKHHLITTAESNGVQNPILGPGHGSDPGIPDSFSDL DRTQEALSEALCAS >gi568815588r:119068483_119278790|GENSCAN_predicted_CDS_8|405_bp atggtggtctgtgggctctcttgggggtctggccgaaatgatcaaggtgtcggcacccga cacacaaggtctatcaatgcgagaacaagtgacacagacaactgtgaatatctgacagga tggtcctgctactcacacagctgcccctatcggcacgagcacgccacgcaggccagcagc aagagggatgagaggtctggagttggacgccctaggacggacagtcactcagatgccgag ctccaggccattgccaagcaccacctcatcacaactgcagaaagcaacggggtacaaaat cccatcctcgggccagggcacggatctgaccccggcatcccagattccttctcagatctg gatagaacccaagaggctctcagcgaggccctctgtgccagttaa