GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:14:41 Sequence gi568815590r:51720354_51961151 : 240798 bp : 38.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 28 23 6 1.05 1.03 Term - 1804 1689 116 2 2 20 43 175 0.377 3.95 1.02 Intr - 3397 3322 76 1 1 73 39 123 0.563 4.07 1.01 Init - 6070 6011 60 1 0 105 69 15 0.368 2.60 1.00 Prom - 7960 7921 40 -2.85 2.02 PlyA - 10744 10739 6 1.05 2.01 Sngl - 11335 11006 330 1 0 35 53 218 0.568 8.77 2.00 Prom - 21026 20987 40 -4.85 3.00 Prom + 26100 26139 40 -4.85 3.01 Init + 32532 32673 142 2 1 45 7 153 0.485 3.25 3.02 Intr + 33976 34094 119 2 2 67 85 122 0.670 9.06 3.03 Intr + 46578 46611 34 2 1 107 35 36 0.006 -2.92 3.04 Intr + 52293 52406 114 1 0 74 77 84 0.261 5.40 3.05 Term + 56107 56213 107 2 2 94 40 69 0.398 0.29 3.06 PlyA + 57046 57051 6 1.05 4.04 PlyA - 58206 58201 6 1.05 4.03 Term - 59071 58914 158 2 2 69 44 71 0.253 -2.09 4.02 Intr - 59403 59187 217 0 1 71 95 45 0.513 0.65 4.01 Init - 61720 61535 186 1 0 53 81 121 0.582 7.01 4.00 Prom - 69136 69097 40 -5.45 5.05 PlyA - 69549 69544 6 1.05 5.04 Term - 70327 70167 161 2 2 98 48 92 0.256 3.32 5.03 Intr - 72889 72730 160 1 1 62 60 107 0.169 3.94 5.02 Intr - 75598 75526 73 0 1 58 89 51 0.119 0.69 5.01 Init - 88991 88828 164 2 2 97 84 133 0.400 11.05 5.00 Prom - 99456 99417 40 -7.05 6.07 PlyA - 99837 99832 6 1.05 6.06 Term - 100365 99998 368 1 2 83 32 285 0.996 16.18 6.05 Intr - 111214 111091 124 1 1 84 94 102 0.993 9.64 6.04 Intr - 113336 113165 172 1 1 86 93 65 0.701 5.82 6.03 Intr - 125410 125308 103 2 1 53 97 111 0.966 6.81 6.02 Intr - 136949 136851 99 0 0 83 99 81 0.951 7.86 6.01 Init - 140798 140492 307 2 1 62 59 268 0.945 18.80 6.00 Prom - 153955 153916 40 -4.95 7.00 Prom + 158828 158867 40 -5.65 7.01 Init + 163246 163297 52 0 1 36 95 25 0.143 -0.63 7.02 Intr + 166072 166205 134 2 2 44 49 93 0.155 0.44 7.03 Term + 178402 179058 657 0 0 101 49 348 0.976 25.16 7.04 PlyA + 180744 180749 6 1.05 8.00 Prom + 185833 185872 40 -4.15 8.01 Init + 187137 187194 58 2 1 65 80 -5 0.290 -2.01 8.02 Intr + 194071 194213 143 2 2 75 78 155 0.873 12.35 8.03 Intr + 204344 204606 263 1 2 91 78 47 0.051 -0.64 8.04 Term + 214829 214952 124 2 1 45 43 174 0.736 5.48 8.05 PlyA + 215031 215036 6 1.05 9.05 PlyA - 215611 215606 6 1.05 9.04 Term - 222110 221932 179 0 2 57 55 165 0.813 6.97 9.03 Intr - 222646 222536 111 2 0 78 78 101 0.803 7.53 9.02 Intr - 226503 226431 73 0 1 76 86 57 0.259 2.36 9.01 Init - 229361 229290 72 2 0 72 29 71 0.244 0.72 9.00 Prom - 233191 233152 40 -5.45 10.03 PlyA - 233526 233521 6 1.05 10.02 Term - 234135 234037 99 0 0 103 43 101 0.852 4.25 10.01 Intr - 237816 237736 81 0 0 75 94 32 0.623 1.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 94417 94605 189 0 0 58 38 181 0.877 4.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_1|83_aa MGLSMNSISLGFTKPKVKWDETAPPRSEELCRPAGAPAAQGPPSAATIMNQGKLSRLQAP VCIGEKATACGRKKVVRRTAAAD >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_1|252_bp atgggcctttcaatgaattccatttctcttgggtttaccaagccaaaggtgaagtgggat gaaacagccccacctcgctcagaggaactttgtcgacctgctggtgctcccgcagcccag ggcccaccttcggcagcaacaatcatgaaccagggaaagctcagccgactgcaggcacca gtgtgcattggtgagaaagcaactgcttgcggaaggaagaaggtggtccgtagaacagca gcagcagattag >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_2|109_aa MYENAWVSRHRCAAGQSPNGEPLLGQCKRECGMGTPTQSPSGAVRRGPPSSGPQNGSSTD SLHYVPGKAADTQRQFLKATRRGLYPAKPQGWSCLRPWKSTSCISVTWM >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_2|330_bp atgtatgaaaatgcctgggtgtccaggcataggtgtgctgcagggcagagccctaatgga gaacctctgctaggacagtgcaaaagggaatgtggaatgggaacccccacacagagtcct agtggagctgtgagaagagggccaccgtcctccggaccccaaaatggtagttccacggat agcttgcactatgtgcctggaaaagctgcagacactcaacgccagttcctgaaagcaacc aggagggggctgtaccctgcaaagccacaggggtggagctgcctaagaccatggaaatcc acctcttgcatcagtgtgacctggatgtga >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_3|171_aa MSIHGAYPWAQMSTNGVAVMSQIKTKVSARILGKGFVATANMRLPPAGAVDPIAGGSEST QKKPTTGGSPDEKPAQGQGKSHGFWLKNYDHVYVGALDGWGDPVSLGLYEVCVEGGKEEE AEDMGLRQGSMGLSCKGQPTLREIEKNLREIALNYRGLVSPNNPDKVSILD >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_3|516_bp atgagcatccatggggcatacccatgggcacaaatgtctacaaatggggtggctgtgatg tctcagatcaagaccaaggtgtctgccagaatcctggggaaaggcttcgtggcgactgcc aacatgcggctgcctccagccggggctgttgatcccattgcgggtggttctgagagcaca caaaagaaacccaccactggaggcagccccgatgagaagcctgcccaagggcaggggaaa agccatggcttctggctgaagaactatgatcatgtgtatgttggtgcccttgatggctgg ggtgacccagtgtctctagggttgtatgaagtctgtgtggaaggtgggaaggaggaagag gctgaagacatggggctgagacaaggtagcatgggtctgtcctgcaaaggccagccgaca cttagagaaatagaaaagaacctacgtgaaatagcattgaattatcgggggctggtttcc cccaataaccctgacaaagtctctatccttgactaa >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_4|186_aa MGREQPPYCVLSKFLTYTVVNCVAAGHLNTCGEEAIAKDELEVTMRRMGRYSGQEQPWPK LALKRDEDSQLLCGSLLVTKNMHNCQSGIHEIEFMGTHTWWTRTGSCVPSALPVSAHPFF LLPGPNFSGHRLSPGVPDIWDFCDRLFGLDGSSHSDTLETHIGAHRDFASPSDHDMCGLP LHTALA >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_4|561_bp atgggcagagaacagccaccctactgtgttttgtccaaattcctgacctacactgtggtt aactgtgtagcagcaggtcatttgaacacatgtggtgaggaggccatagctaaggatgaa ctggaagtcaccatgaggaggatggggcggtattctgggcaggagcagccttggccaaag ctggcgctgaaaagggatgaagacagccaactattatgtggcagccttctggttaccaag aacatgcacaattgtcagagcggaatccatgagatagaatttatgggcacacatacttgg tggaccagaactgggagctgcgtgccatcagcactccctgtctctgctcatcctttcttt ctgctgcctgggcctaacttctctgggcacaggctttctccaggggttcccgacatctgg gatttttgtgacaggttgtttggattggatggcagttctcactcagataccctggaaact cacataggtgcccacagggacttcgcctcgccgtcagatcatgatatgtgtggccttccc ttgcacacagccttggcctag >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_5|185_aa MEPRLFCWTTLFLLAGWCLPGLPCPSRCLCFKSTVRCMHLMLDHIPQVPQQTTVLPTFDH LRVHDVSLQGRGDQVNYPQSDPSFIGLLWFPGGPLQTWVPRASVISGGYRTAKMAAAPSP GLSVPEGHQPDAGPSLAHQGGQCRRELRLRHGPTCSLGKKGILESLCLIEKIMLPLVAPG CQKRK >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_5|558_bp atggagcccagactgttctgctggaccactctctttctcctggccgggtggtgcctgcca gggttgccctgccccagccggtgcctttgctttaagagcaccgtccgctgcatgcacttg atgctggaccacattcctcaggtaccacagcagaccacagttctaccaacctttgatcat ttgcgggtgcacgatgtctctctccagggtcggggtgaccaggttaattacccacagtca gacccctctttcatagggctgctgtggtttcctgggggtccactccagacctgggtccct cgtgcaagtgtcatcagtggaggttacagaacagcaaagatggctgccgctccttcccct gggctgtctgtcccagaggggcaccaacctgatgccggcccttccctcgcccatcaggga ggacagtgtcgcagagagctgcgactccggcatgggcccacgtgtagcctagggaagaaa gggatcctggaatctctgtgcctcatagaaaaaataatgctgcccttagttgctcctgga tgccagaagagaaaataa >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_6|390_aa MGGAVSAGEDNDDLIDNLKEAQYIRTERVEQAFRAIDRGDYYLEGYRDNAYKDLAWKHGN IHLSAPCIYSEVMEALKLQPGLSFLNLGSGTGYLSTMVGLILVNRSDSGETASAIQHCGC HYLAATTFIPKGEIIGPFGINHGIELHSDVVEYAKEKLESFIKNSDSFDKFEFCEPAFVV GNCLQIASDSHQYDRIYCGAGVQKDHENYMKILLKVGGILVMPIEDQLTQIMRTGQNTWE SKNILAVSFAPLVQPSKNDNGKPDSVGLPPCAVRNLQDLARIYIRRTLRNFINDEMQAKG IPQRAPPKRKRKRVKQRINTYVFVGNQLIPQPLDSEEDEKMEEDNKEEEEKDHNEAMKPE EPPQNLLREKIMKLPLPESLKAYLTYFRDK >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_6|1173_bp atgggaggagctgtgagtgctggggaagataatgatgacttaattgataatttaaaagaa gctcagtatattcgtactgaaagagtggagcaagccttcagagcgattgatcgtggagat tactatttggaaggctacagagacaatgcttacaaagacttagcctggaagcatggaaac atccacttgtcagcaccttgcatttattctgaagttatggaagcattgaaacttcaacca ggattgtcttttcttaacctgggaagtggaaccggatatttaagtacaatggtgggctta attttagttaataggagtgactctggggagactgctagtgccatccagcactgtggatgc cactaccttgctgctacaacattcattcctaaaggtgaaataattggtccttttggaata aatcatgggattgagcttcattcagatgtggtggaatatgccaaggaaaaactggagagc ttcatcaaaaatagtgatagctttgataaatttgagttctgtgaacctgcatttgttgtt ggtaattgcctccagatagcttctgacagtcatcagtatgatcgaatttattgtggagct ggagtgcagaaagaccatgaaaactacatgaaaatattactaaaagttggaggcatatta gtcatgcctatagaggatcagttaacacagattatgcgaactggacagaacacttgggaa agtaaaaatatccttgctgtttcatttgctccacttgtgcaaccaagtaagaatgataat ggcaaaccagattctgtgggactccctccctgtgctgtcaggaatctacaggacttggct cgtatttacattcgacgcacacttagaaatttcataaatgatgagatgcaggccaagggg attcctcaaagggctccacccaaaaggaaaagaaagagagttaaacagagaattaacact tacgtatttgtgggtaatcagcttattcctcagcctctagacagtgaagaggatgaaaaa atggaagaggataacaaagaagaggaggaaaaagatcacaatgaagcaatgaagccagag gagccacctcaaaatttactgagagaaaaaatcatgaagctgcccctccctgaatcttta aaagcttacttgacatattttagagacaaataa >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_7|280_aa MPDIFKTCSESQTENNQDYCNELQDDHTCSYPNFNGIPICHFDSSAQYTTFINKESSSKN PQRLSFLAPDLPALKSPCPRLFGCRPPGPLASKRIPVARPALARGTRTPHDPEPPLAAAL PVARAAARRPGARTRRGPQQPDAATTTITTRTPPPSGEAGPGPATGAARASRTPEPPGSG HAQRTTGTGAAPRGRQSQQGRETPPSGPAPLRARSAPKPKQREALTGQKTMAVGEITREH GLHLSSPPPSPRSRTGAGVQAFSGLRATACRCLAAQSPLH >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_7|843_bp atgcccgatatttttaaaacgtgttcagaaagtcaaactgaaaacaaccaagattactgc aatgaacttcaagatgatcatacctgtagttatcccaatttcaatgggattcctatatgc cattttgattcatctgcccagtatacaacttttatcaataaagaaagcagctccaaaaat ccacagcgcctcagtttcctggcccccgacctgcccgcccttaagtccccctgcccccga ctcttcggctgccgtccccctggccctctggcctccaagcgcatcccagtcgcccgcccg gccctagcacgcgggactcgcacaccccacgaccccgagcccccactggcggcggcgtta cctgtggcgcgggcagcggcgcgcaggccaggcgctaggactcggcggggtccgcagcag ccagacgccgctaccaccacaataacaacacggacgccaccgccgagtggagaggcgggg cccggacccgcgactggagcagccagagcctcccggactccggagccgccggggtccggg catgcgcagagaaccacgggcacaggggcagctccccgcggacgccaaagtcaacaaggc cgggagacgcccccatctggccccgcccccctcagggctcgctccgcccccaagcccaaa caacgcgaggccttgacaggccagaagacgatggcggtaggggagatcacgcgagagcat ggcctccacctctcctccccaccgccatctccacgttctcgcaccggcgctggggtgcag gctttcagtggtctcagggctacggcttgtcggtgcctcgccgcacagtcccctttgcat tga >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_8|195_aa MAHITSVHVHMAHTACRGTVFPDDMGICDDMGICDMGIPDDMGIWLCVPTQISSQIVIPV CPRRDQMSLNPLSLPKHRLKLFSEVPLSAQSPDLPKKKTIISDFFPEFSLTELFTETPGI LSRTCCSMYRMPITDSMSIAREEGFIGCSVEENRSNITGIYPNHCDANPTKAPDGKASIS GIQFGDLPCEEDEAI >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_8|588_bp atggcacacatcacttctgtccacgttcatatggctcacactgcctgcagggggacagta ttccctgatgatatgggaatctgtgatgatatgggaatctgtgatatgggaatccctgat gatatgggaatatggctctgtgtccccacccaaatctcatctcaaattgtgatccccgtg tgtccgaggagggaccaaatgtctctcaatcctctgtctctcccaaaacacaggctaaag ttgttctctgaagttcccttgtctgcccaaagtccagacctaccaaagaagaaaacaatt atctctgatttcttccctgagttttcattaactgaactttttactgaaacaccagggatt ctgtctaggacttgctgctccatgtacagaatgccaatcactgactcaatgagtattgcc agggaagaaggctttattgggtgctcagtggaggagaacaggagtaacatcacggggatt tacccaaaccactgtgatgccaatccaaccaaagctcctgatggaaaagcttcaatctca ggaattcagtttggcgacttgccctgtgaagaagatgaagctatatag >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_9|144_aa MKKLYSHIDPITLWESSNWLLEFGHHGSLELGRRKQQHKKEGGKEVREGPQRFAVSGEGT RDNTISGAVGAMKAEKEVPRQGVLRDLYAFSPTYLPFASSFFSESSEGEGEVFSCGPYMA KHHALSGMHSASERLAAMAQDATL >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_9|435_bp atgaagaagctgtattctcatattgatcccattactctgtgggagtcttcaaactggctg ctggaatttgggcaccatgggagtttagagctgggaaggagaaagcagcaacacaagaaa gaaggtggtaaggaagtgagagaaggacctcaaagattcgcagtgagtggagaggggacc agggacaatacaatttcaggagcagtaggggccatgaaagcagagaaggaagtgcccagg caaggcgtcctacgagatctatatgccttttctccaacatatctgccttttgcgagttca ttttttagtgaatcttcagagggtgaaggagaagttttttcctgtggcccatatatggcc aagcatcatgccctgtcaggcatgcacagcgcatctgagcgactcgcagcaatggctcag gatgccacactctga >gi568815590r:51720354_51961151|GENSCAN_predicted_peptide_10|59_aa VTPLNNVGVRGLTPTLADFVIWPHGGQAVAAFRDQRPLQAWSKITQAAAFSMALNSMYT >gi568815590r:51720354_51961151|GENSCAN_predicted_CDS_10|180_bp gtcactcctttgaacaatgtaggagttagaggactaactcctacacttgcggattttgtc atctggcctcacggtggccaggctgttgcagccttcagagatcagcgtcctctgcaagcc tggagcaagattactcaggcagcagcgttttctatggcgctaaattccatgtacacatag