GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:15:25 Sequence gi568815594r:143897539_144240635 : 343097 bp : 37.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 20 15 6 1.05 1.02 Term - 10074 10007 68 1 2 83 42 80 0.529 0.02 1.01 Init - 14515 13978 538 1 1 58 70 320 0.615 22.27 1.00 Prom - 19287 19248 40 -3.65 2.04 PlyA - 20052 20047 6 1.05 2.03 Term - 26590 26456 135 1 0 77 34 137 0.939 4.24 2.02 Intr - 29030 28906 125 0 2 74 36 39 0.202 -3.32 2.01 Init - 33055 32872 184 1 1 56 86 125 0.673 8.43 2.00 Prom - 35071 35032 40 -4.55 3.00 Prom + 35501 35540 40 -5.65 3.01 Init + 35555 35973 419 1 2 81 53 194 0.462 11.15 3.02 Intr + 54916 55045 130 1 1 73 75 52 0.000 2.18 3.03 Intr + 69550 69641 92 0 2 84 95 79 0.497 6.07 3.04 Term + 84033 84321 289 0 1 68 39 217 0.900 8.66 3.05 PlyA + 84717 84722 6 1.05 4.06 PlyA - 85759 85754 6 1.05 4.05 Term - 86228 86016 213 2 0 79 43 104 0.142 1.25 4.04 Intr - 101911 101873 39 1 0 68 91 56 0.075 1.40 4.03 Intr - 103069 102846 224 2 2 24 -19 196 0.011 -0.58 4.02 Intr - 112929 112806 124 0 1 65 69 58 0.041 0.84 4.01 Init - 119374 119264 111 1 0 33 107 90 0.114 5.66 4.00 Prom - 120680 120641 40 -6.25 5.13 PlyA - 121132 121127 6 1.05 5.12 Term - 123851 123793 59 0 2 79 45 81 0.060 -0.23 5.11 Intr - 127824 127786 39 1 0 121 93 31 0.050 4.18 5.10 Intr - 150609 150505 105 1 0 91 46 133 0.529 8.67 5.09 Intr - 162948 162804 145 0 1 33 106 146 0.542 9.83 5.08 Intr - 165702 165623 80 1 2 64 106 56 0.256 3.35 5.07 Intr - 192223 192113 111 2 0 53 83 75 0.206 2.93 5.06 Intr - 193482 193318 165 1 0 44 96 66 0.130 2.01 5.05 Intr - 201324 201246 79 0 1 62 98 54 0.035 2.01 5.04 Intr - 219401 219316 86 0 2 99 100 40 0.864 4.92 5.03 Intr - 222243 222148 96 1 0 87 96 115 0.066 11.26 5.02 Intr - 235119 234996 124 0 1 71 69 60 0.147 1.64 5.01 Init - 240733 240623 111 1 0 33 111 64 0.310 3.46 5.00 Prom - 250874 250835 40 -2.75 6.02 PlyA - 251634 251629 6 1.05 6.01 Sngl - 253131 252322 810 0 0 70 44 352 0.945 24.53 6.00 Prom - 254078 254039 40 -6.15 7.03 PlyA - 254247 254242 6 1.05 7.02 Term - 255421 255158 264 1 0 66 44 250 0.691 12.92 7.01 Init - 256503 256465 39 0 0 111 49 61 0.393 4.74 7.00 Prom - 262283 262244 40 -0.95 8.03 PlyA - 262400 262395 6 1.05 8.02 Term - 266656 266206 451 0 1 34 48 263 0.726 10.57 8.01 Init - 275701 275652 50 1 2 73 87 36 0.206 2.47 8.00 Prom - 276169 276130 40 -4.45 9.07 PlyA - 276309 276304 6 1.05 9.06 Term - 281922 281696 227 1 2 119 44 76 0.335 2.26 9.05 Intr - 298350 298074 277 0 1 45 85 112 0.391 2.77 9.04 Intr - 301055 300742 314 0 2 23 51 285 0.160 13.28 9.03 Intr - 308534 308483 52 2 1 83 115 23 0.009 2.06 9.02 Intr - 332100 331947 154 2 1 37 105 74 0.074 3.15 9.01 Init - 336656 336601 56 2 2 35 115 37 0.577 1.94 9.00 Prom - 338716 338677 40 -3.65 10.02 PlyA - 339092 339087 6 1.05 10.01 Term - 341865 341459 407 1 2 66 49 186 0.692 6.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 187357 187437 81 0 0 120 51 119 0.946 8.11 S.002 Init - 267098 266935 164 2 2 52 70 89 0.868 2.75 S.003 Init - 303229 303160 70 1 1 86 78 29 0.881 2.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_1|201_aa MLSRQKTKNEVSKPAEVQGKYVKKETSPLLRNLMPSFIQHGPKIPGRTDICLPDSSPNAF STSGDGVVSRNQSFLRTPIQRTPHKIMRRESNRLSAPSYLARSLADVPREYGSSQSFVTE VSFAVENGDSGSRYYYSDNFVDGQRKRPLGDRAHEDYRYYEYNHDLFQRMPQNQGRHASA MDLHTGGNLKLGYDIGYPQDI >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_1|606_bp atgctgtcccgacagaaaaccaaaaacgaagtgtccaagccggccgaggtgcaggggaag tacgtgaagaaggagacgtcgcctctgcttcggaatcttatgccttcattcatccagcat ggtccaaaaattccaggacgaactgatatctgtcttccagattcaagccctaatgccttt tcaacttctggagatggagtagtttcaagaaaccagagtttccttagaactccaattcaa agaacacctcataaaataatgagaagagaaagcaacagattatctgcaccttcttatctt gccagaagtctagcagatgtccctagagagtatggttcttctcagtcatttgtaacggaa gttagttttgctgttgaaaatggagactctggttcccgatattattattcagacaatttt gttgatggtcagagaaagcggccacttggagatcgtgcacatgaagactacagatattat gaatacaaccatgatctcttccaaagaatgccacagaatcaggggaggcatgcttcagca atggacctccatactggagggaacctgaagctaggatatgacataggttacccgcaggat atataa >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_2|147_aa MGRGWNRLEGSEVHRKVKKSLELPRDLLNGCDQNADSEMDSEVQAEVVSDGDEELIGNWS KDVAGIRMQNCLVGVIWENWSKVHFDMRECHILDAISNSFEISDRNILRNVFRQQDVVER TLALEPEIHSDNMTPGTDRETLDKFMK >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_2|444_bp atgggcagaggctggaacagattggaaggatcagaagtacacaggaaggtgaaaaaaagt ttggaacttcctagagacttgttgaatggttgtgaccaaaatgctgatagtgaaatggac agtgaagtccaggctgaggtagtttcagatggagatgaggaactgattgggaactggagt aaagatgtagcaggcatcaggatgcaaaactgtcttgtgggtgtgatttgggaaaactgg tccaaagtgcattttgacatgagagaatgtcatattctagatgctattagtaactctttt gaaatctcagatagaaatattttaagaaatgtgttcagacagcaggatgtagtagaaaga acacttgctttggaaccagaaatccacagtgacaatatgacacctggcactgaccgagaa accctagacaagttcatgaaatga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_3|309_aa MGKDFMSKTPKAMATNAKIDKWDLIKLKSFCTAKETTIRVNRQPKEWEKIFATYSSDKGL ISRIYDELKRIYKKKTNNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRPRDLEEKNGFMVWVQGSPALCSLRSWYPASQLLQLQLWLK GAKLIEEYSIQKNTSSAKKTQSCREPFHPQSGPRLGPCCGLSVCGSPQNVYIEALPHDVA VFGDVASKEVTEVKRGDKGRDLIQQDNVIIRRDTRELTPSLSTMQGHSEKMAVYKPGRES SPEPNPAGL >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_3|930_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaatgccaaaattgac aaatgggatctaatcaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctaaagaatgggagaaaatttttgcaacctactcatctgacaaagggcta atatccagaatctacgatgaactcaaacgaatttacaagaagaaaacaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg cccagagacctagaagagaaaaatggtttcatggtctgggtccagggctcccctgctctg tgcagcctcaggtcatggtaccctgcatcccagctactccagctccagctgtggctaaaa ggggccaagcttatagaagagtacagcatacagaaaaatacatcttctgcaaagaaaaca cagtcatgtagagaacctttccatccccagtctggacccaggcttgggccttgttgtgga ctaagtgtctgtggctccccacaaaatgtatatattgaagccctaccccatgatgtggct gtatttggagatgtagcctctaaggaagtaactgaagttaaaagaggtgataagggtaga gacctgatccaacaggataatgtcattataagaagagacaccagagagcttaccccctct ctttctaccatgcaaggacacagcgagaagatggctgtctacaagccaggaagagagtcc tcaccagaaccaaaccctgctggactttga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_4|236_aa MTFMLNVEQGDGANPENSGRRGQKAEGRARAKILKQERQDFALWPKLDCNGTVIAYFSLE LLDSSDSPASASQVAETSVFRGEEEILELLLGGMWENLSFMIRCMSTQSPHSCSLSQLLL YADKHKRDTCPAHTANEVSEISVTTVSPPEKKNEKRDNLSIVSLYQEYKGLASIPQVKAI GRPIPAFQFPMESAEAQLCLLWNPSSLSTQPYFLPQKTLNELPDFNLHLRFCCLHI >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_4|711_bp atgacatttatgctaaacgtggaacaaggagatggagccaaccctgaaaattctgggaga agaggacagaaggcagagggaagagcaagagcaaaaattctgaaacaggagagacaagat tttgctctgtggcctaagctggactgcaacggcacagtcatagcttacttcagtcttgaa ctgctggattcaagtgattctccagcttctgcctctcaagtagctgagacttcagttttc aggggagaagaagaaattctggagctgctgctgggagggatgtgggagaatttgtctttc atgatacgctgtatgtccacgcagtcacctcattcttgttccctttctcaacttctctta tatgcagataagcacaaacgggacacatgtccagctcatactgctaatgaagtttcagaa atttctgttacaactgtttcccctccagaaaagaaaaacgagaaacgggacaacttgtcc atcgtttcactgtaccaggaatataaaggcctggcctccataccccaagtcaaggcaatc ggaaggcccatccctgctttccagttccccatggagtcggcagaggcccaactgtgcctg ctttggaacccatcttctctctctacccaaccctacttccttcctcagaagactttgaat gaacttcctgacttcaatctccatctcagattctgctgcctgcacatctga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_5|399_aa MTFMLNVEQGDGANPENSGKRGQKAEGRARAKILKQQRKDFALWPKLDCNGTVIAYFSLE LLDSSDSPASASQVAETSDTHKRDTYAATPRAHEVSEISVRTVYPPEEETEITLIIFGVM AGVIGTILLISYGIRRLIKDTADLTEMAGSEISEGPQDAHTSERTGEFPDPQLAGHATEV WFLCLATRSSNTLREGEHADRQVQELEKVLLGSRPVVGSKGNDTKQTTKDMKKMNFRHRT SVAQKSDCRERGIKQNGALTQKSKSKGSSETRPGSSTYEPVKSKCKQGSGKIPGGEVALC AAQKLRPHHGITKPPDLQGCAKIQNECLTAKVEKLSSTKPVSDAIKLGDRCSSDDKKMLP TVIMVSNGKSGIPSSFGSIAAMDLHTGGNLKLGYDIGYP >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_5|1200_bp atgacatttatgctaaatgtggaacaaggagacggagccaaccctgaaaattctgggaaa agaggacagaaggcagagggaagagcaagagcaaaaattctgaaacagcagagaaaagat tttgctctgtggcctaagctggactgcaacggcacagtcatagcttacttcagtcttgaa ctgctggattcaagtgattctccagcttctgcctctcaagtagctgagacttcagatacg cacaaacgggacacatatgcagccactcctagagctcatgaagtttcagaaatttctgtt agaactgtttaccctccagaagaggaaaccgagataacactcattatttttggggtgatg gctggtgttattggaacgatcctcttaatttcttacggtattcgccgactgataaaggac acagcagatcttactgagatggctgggtcagaaatctctgagggaccccaggatgcccac acctcagaaaggacaggggagttccctgacccccaacttgcaggacatgcaacagaggtg tggtttctctgtttggccaccaggagctcaaacaccttacgggagggagagcatgcagat aggcaggtgcaagagctggagaaagtgcttttgggctccagacccgtggtaggatctaag ggaaatgacaccaagcaaacaacaaaagatatgaaaaaaatgaatttcagacaccggaca tcagtagcacagaagagtgattgccgagagagaggaatcaaacaaaatggagcattaacc caaaagtccaagtccaaaggctcatctgagacaagaccaggttcttccacctatgagcct gtaaaatcaaaatgcaaacaaggaagtggcaagatcccaggaggagaagtggctctgtgt gcagctcagaaactgagaccacaccatggtatcacaaagccaccggacctccagggctgt gctaaaatacagaatgagtgtcttactgctaaagtggaaaaattgtcttccacgaaacca gtctctgatgccataaagcttggggaccgctgctctagtgatgataagaagatgctgccc actgtgataatggtcagtaatggtaaatcagggattccttcttcatttggatccattgct gcaatggacctccatactggagggaacctgaagctaggatatgacataggttacccgtag >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_6|269_aa MDKFLHTYTVPRLNQEFESLNRPITGSEIEAIINSLPTKKSPGPHGFTAKFYQRYEEELV PLLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAN RIQQYIKKLIRHDQVGFISGMQGWFNIQKSINVIQHINRTKDKNYMIISIDAEKAFDKIQ QPFMLKTLNKLGIDGTHLKIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNI VLEVLARAIRQEKEIKGIELGSQIVPVCR >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_6|810_bp atggataaattcctccacacatacaccgtcccaagactaaaccaggaatttgaatctctg aatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaag agtccaggaccacatggattcacagccaaattctaccagagatacgaggaggagctggta cctttacttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagtacatcaaaaagcttatccgccatgatcaagtgggcttcatctctggg atgcaaggctggttcaacatacaaaaatcaataaacgtaatccagcatataaacagaacc aaagacaaaaactacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacccttcatgctgaaaactctcaataaattaggtattgatgggacacatctcaaaata agagctatctatgacaaacccacagccaatatcatactgaatggacaaaaactggaagca ttccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacata gtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggcattgaatta ggaagtcaaattgtccctgtttgcagatga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_7|100_aa MAEQEQLQSTAPSERSSSPATEQSWMENDFDELREEGFRRSNYSELKEEVQTNGKEVKNR EKKLDEWITKITNAEKSLKDLMDLKTKARELCDECTSHSS >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_7|303_bp atggccgaacaagaacagctccagtctacagctcccagcgaacgcagctcctcaccagca acggaacaaagctggatggagaatgactttgatgagttgagagaagaaggcttcagaaga tcaaactactctgagctaaaggaggaagttcaaaccaatggcaaagaagttaaaaaccgt gaaaaaaaattagacgaatggataaccaaaataaccaatgcagagaagtccttaaaggac ctgatggatctgaaaaccaaggcacgagaactatgtgacgaatgcacaagccacagtagc tga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_8|166_aa MSIFPDPQRYQCIVVPRGSSAVRGTSTCRDEKEPTQELWHLKWPEYLMSSKYTNSPTRVL NQAELAEMTEVEFRLWIGMKTIEIQENGKSQSKETKNNNNMTQELTDKITSIKKSLTHLI ELKNTLQEFPNAITSINSRTDQAEERILELEDWLSEITQTKIKKKE >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_8|501_bp atgagcatttttccagatccacaaagataccaatgcatcgtggtccccagaggcagctca gcagttagaggaacatccacatgcagagatgagaaagaaccaacacaagaactctggcac ctcaaatggccagagtatcttatgtcctccaaatacactaattctccaacaagggttctt aaccaggctgagttagctgaaatgacagaagtagaattcagactatggataggaatgaag actattgagattcaggagaatggcaaaagccaatccaaggaaactaagaataacaataat atgacgcaggagctgacagacaaaataaccagtataaaaaagagcctaacacatctgata gagctgaaaaacacactacaagaatttcccaatgcaatcacaagtattaacagcagaaca gaccaagctgaggaaaggatcttggaacttgaagactggctctctgaaataacacagaca aaaataaagaaaaaagaatga >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_9|359_aa MLSDLRLAIKGYSLLSGPSFVRQGTTDKTEESKGMSRQLLTHSCTDAREMKQQAGEGDRV QNGQRVYSYELLEDGDWQEKSRRFWSPDDFEGFKTSVEKVTPNVVEIARELKLEVEHEGV TELLQSYHKTWTDEKLILINEQRKWSLEMEFMSDEDAVNPVEMTTMDFEYYIVDKAPVEF ETIDINFERSSNAPPPTLGITIPHEIWEGTQIQTISTGKPKTSHDPLYCDIHIIVVVWNG TYSISEVCLQYVQAPRVSNQLTKLSYSLWPWFPRVIPTYPEKREGPIATASVSAVSGQIC LEGEESSHSIPPLSEHVFYSIAGLLFYLLPGNPIPAPTDFIVDVVQAAGLKANSDSSKV >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_9|1080_bp atgctttcagacctgagacttgccattaagggctatagccttctctctggcccaagcttt gtgcgacaagggactacagacaagacagaagaaagtaagggcatgagcagacagctcctc actcactcttgcactgatgctagggagatgaagcaacaagctggggagggagacagagtt cagaatggccagagggtctatagttatgagctactagaggatggagactggcaggagaaa agcagaaggttctggagtccagatgactttgaggggttcaagacttcagtggagaaagtc actcccaatgtggtggaaatagcaagagaactaaaattagaagtggaacatgaaggtgtg actgaattgctgcaatcttatcataaaacctggacagatgagaagttgattcttatcaat gagcaaagaaaatggtctcttgagatggaatttatgtctgatgaagatgctgtgaaccct gttgaaatgacaacaatggatttcgaatattacatagttgataaagcaccagtagagttt gaaacgattgacattaattttgaaagaagttctaatgctcctcctccaacactggggatc acaattccacatgagatttgggaggggacacaaatccaaactatatctactgggaaacca aaaacttcacatgacccactttattgtgatattcacattattgtggtggtctggaatgga acctacagtatctctgaggtatgcctgcagtatgttcaggcacccagagtcagcaatcag ctcacaaagttgtcctattctctgtggccctggttccccagggtgatccctacatatcct gagaagcgtgaaggccccattgctactgcctcagtatcagcagtatcaggacaaatatgc cttgagggagaagaatcatcccattccattccacctctctcagaacatgtcttctacagt atagctgggcttttattttaccttctaccaggaaaccccatacctgcccccacagatttc atagtagatgtggtacaggctgcaggactgaaagcaaactcagacagcagcaaagtgtag >gi568815594r:143897539_144240635|GENSCAN_predicted_peptide_10|135_aa XLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKASG YKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLK EIKEDTNGRTFHAHG >gi568815594r:143897539_144240635|GENSCAN_predicted_CDS_10|408_bp ntgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaac cccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagcctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaag gaaataaaagaggatacaaatggaagaacattccatgctcatgggtag