GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:42:51 Sequence gi568815594r:144014689_144240635 : 225947 bp : 37.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 1124 1119 6 1.05 1.12 Term - 6701 6643 59 0 2 79 45 81 0.058 -0.23 1.11 Intr - 10674 10636 39 1 0 121 93 31 0.049 4.18 1.10 Intr - 33459 33355 105 1 0 91 46 133 0.529 8.67 1.09 Intr - 45798 45654 145 0 1 33 106 146 0.542 9.83 1.08 Intr - 48552 48473 80 1 2 64 106 56 0.256 3.35 1.07 Intr - 75073 74963 111 2 0 53 83 75 0.206 2.93 1.06 Intr - 76332 76168 165 1 0 44 96 66 0.130 2.01 1.05 Intr - 84174 84096 79 0 1 62 98 54 0.035 2.01 1.04 Intr - 102251 102166 86 0 2 99 100 40 0.864 4.92 1.03 Intr - 105093 104998 96 1 0 87 96 115 0.066 11.26 1.02 Intr - 117969 117846 124 0 1 71 69 60 0.147 1.64 1.01 Init - 123583 123473 111 1 0 33 111 64 0.310 3.46 1.00 Prom - 133724 133685 40 -2.75 2.02 PlyA - 134484 134479 6 1.05 2.01 Sngl - 135981 135172 810 0 0 70 44 352 0.945 24.53 2.00 Prom - 136928 136889 40 -6.15 3.03 PlyA - 137097 137092 6 1.05 3.02 Term - 138271 138008 264 1 0 66 44 250 0.691 12.92 3.01 Init - 139353 139315 39 0 0 111 49 61 0.393 4.74 3.00 Prom - 145133 145094 40 -0.95 4.03 PlyA - 145250 145245 6 1.05 4.02 Term - 149506 149056 451 0 1 34 48 263 0.726 10.57 4.01 Init - 158551 158502 50 1 2 73 87 36 0.206 2.47 4.00 Prom - 159019 158980 40 -4.45 5.07 PlyA - 159159 159154 6 1.05 5.06 Term - 164772 164546 227 1 2 119 44 76 0.335 2.26 5.05 Intr - 181200 180924 277 0 1 45 85 112 0.391 2.77 5.04 Intr - 183905 183592 314 0 2 23 51 285 0.160 13.28 5.03 Intr - 191384 191333 52 2 1 83 115 23 0.009 2.06 5.02 Intr - 214950 214797 154 2 1 37 105 74 0.074 3.15 5.01 Init - 219506 219451 56 2 2 35 115 37 0.577 1.94 5.00 Prom - 221566 221527 40 -3.65 6.02 PlyA - 221942 221937 6 1.05 6.01 Term - 224715 224309 407 1 2 66 49 186 0.692 6.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 70207 70287 81 0 0 120 51 119 0.946 8.11 S.002 Init - 149948 149785 164 2 2 52 70 89 0.868 2.75 S.003 Init - 186079 186010 70 1 1 86 78 29 0.881 2.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_1|399_aa MTFMLNVEQGDGANPENSGKRGQKAEGRARAKILKQQRKDFALWPKLDCNGTVIAYFSLE LLDSSDSPASASQVAETSDTHKRDTYAATPRAHEVSEISVRTVYPPEEETEITLIIFGVM AGVIGTILLISYGIRRLIKDTADLTEMAGSEISEGPQDAHTSERTGEFPDPQLAGHATEV WFLCLATRSSNTLREGEHADRQVQELEKVLLGSRPVVGSKGNDTKQTTKDMKKMNFRHRT SVAQKSDCRERGIKQNGALTQKSKSKGSSETRPGSSTYEPVKSKCKQGSGKIPGGEVALC AAQKLRPHHGITKPPDLQGCAKIQNECLTAKVEKLSSTKPVSDAIKLGDRCSSDDKKMLP TVIMVSNGKSGIPSSFGSIAAMDLHTGGNLKLGYDIGYP >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_1|1200_bp atgacatttatgctaaatgtggaacaaggagacggagccaaccctgaaaattctgggaaa agaggacagaaggcagagggaagagcaagagcaaaaattctgaaacagcagagaaaagat tttgctctgtggcctaagctggactgcaacggcacagtcatagcttacttcagtcttgaa ctgctggattcaagtgattctccagcttctgcctctcaagtagctgagacttcagatacg cacaaacgggacacatatgcagccactcctagagctcatgaagtttcagaaatttctgtt agaactgtttaccctccagaagaggaaaccgagataacactcattatttttggggtgatg gctggtgttattggaacgatcctcttaatttcttacggtattcgccgactgataaaggac acagcagatcttactgagatggctgggtcagaaatctctgagggaccccaggatgcccac acctcagaaaggacaggggagttccctgacccccaacttgcaggacatgcaacagaggtg tggtttctctgtttggccaccaggagctcaaacaccttacgggagggagagcatgcagat aggcaggtgcaagagctggagaaagtgcttttgggctccagacccgtggtaggatctaag ggaaatgacaccaagcaaacaacaaaagatatgaaaaaaatgaatttcagacaccggaca tcagtagcacagaagagtgattgccgagagagaggaatcaaacaaaatggagcattaacc caaaagtccaagtccaaaggctcatctgagacaagaccaggttcttccacctatgagcct gtaaaatcaaaatgcaaacaaggaagtggcaagatcccaggaggagaagtggctctgtgt gcagctcagaaactgagaccacaccatggtatcacaaagccaccggacctccagggctgt gctaaaatacagaatgagtgtcttactgctaaagtggaaaaattgtcttccacgaaacca gtctctgatgccataaagcttggggaccgctgctctagtgatgataagaagatgctgccc actgtgataatggtcagtaatggtaaatcagggattccttcttcatttggatccattgct gcaatggacctccatactggagggaacctgaagctaggatatgacataggttacccgtag >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_2|269_aa MDKFLHTYTVPRLNQEFESLNRPITGSEIEAIINSLPTKKSPGPHGFTAKFYQRYEEELV PLLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAN RIQQYIKKLIRHDQVGFISGMQGWFNIQKSINVIQHINRTKDKNYMIISIDAEKAFDKIQ QPFMLKTLNKLGIDGTHLKIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNI VLEVLARAIRQEKEIKGIELGSQIVPVCR >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_2|810_bp atggataaattcctccacacatacaccgtcccaagactaaaccaggaatttgaatctctg aatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaag agtccaggaccacatggattcacagccaaattctaccagagatacgaggaggagctggta cctttacttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagtacatcaaaaagcttatccgccatgatcaagtgggcttcatctctggg atgcaaggctggttcaacatacaaaaatcaataaacgtaatccagcatataaacagaacc aaagacaaaaactacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacccttcatgctgaaaactctcaataaattaggtattgatgggacacatctcaaaata agagctatctatgacaaacccacagccaatatcatactgaatggacaaaaactggaagca ttccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacata gtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggcattgaatta ggaagtcaaattgtccctgtttgcagatga >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_3|100_aa MAEQEQLQSTAPSERSSSPATEQSWMENDFDELREEGFRRSNYSELKEEVQTNGKEVKNR EKKLDEWITKITNAEKSLKDLMDLKTKARELCDECTSHSS >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_3|303_bp atggccgaacaagaacagctccagtctacagctcccagcgaacgcagctcctcaccagca acggaacaaagctggatggagaatgactttgatgagttgagagaagaaggcttcagaaga tcaaactactctgagctaaaggaggaagttcaaaccaatggcaaagaagttaaaaaccgt gaaaaaaaattagacgaatggataaccaaaataaccaatgcagagaagtccttaaaggac ctgatggatctgaaaaccaaggcacgagaactatgtgacgaatgcacaagccacagtagc tga >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_4|166_aa MSIFPDPQRYQCIVVPRGSSAVRGTSTCRDEKEPTQELWHLKWPEYLMSSKYTNSPTRVL NQAELAEMTEVEFRLWIGMKTIEIQENGKSQSKETKNNNNMTQELTDKITSIKKSLTHLI ELKNTLQEFPNAITSINSRTDQAEERILELEDWLSEITQTKIKKKE >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_4|501_bp atgagcatttttccagatccacaaagataccaatgcatcgtggtccccagaggcagctca gcagttagaggaacatccacatgcagagatgagaaagaaccaacacaagaactctggcac ctcaaatggccagagtatcttatgtcctccaaatacactaattctccaacaagggttctt aaccaggctgagttagctgaaatgacagaagtagaattcagactatggataggaatgaag actattgagattcaggagaatggcaaaagccaatccaaggaaactaagaataacaataat atgacgcaggagctgacagacaaaataaccagtataaaaaagagcctaacacatctgata gagctgaaaaacacactacaagaatttcccaatgcaatcacaagtattaacagcagaaca gaccaagctgaggaaaggatcttggaacttgaagactggctctctgaaataacacagaca aaaataaagaaaaaagaatga >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_5|359_aa MLSDLRLAIKGYSLLSGPSFVRQGTTDKTEESKGMSRQLLTHSCTDAREMKQQAGEGDRV QNGQRVYSYELLEDGDWQEKSRRFWSPDDFEGFKTSVEKVTPNVVEIARELKLEVEHEGV TELLQSYHKTWTDEKLILINEQRKWSLEMEFMSDEDAVNPVEMTTMDFEYYIVDKAPVEF ETIDINFERSSNAPPPTLGITIPHEIWEGTQIQTISTGKPKTSHDPLYCDIHIIVVVWNG TYSISEVCLQYVQAPRVSNQLTKLSYSLWPWFPRVIPTYPEKREGPIATASVSAVSGQIC LEGEESSHSIPPLSEHVFYSIAGLLFYLLPGNPIPAPTDFIVDVVQAAGLKANSDSSKV >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_5|1080_bp atgctttcagacctgagacttgccattaagggctatagccttctctctggcccaagcttt gtgcgacaagggactacagacaagacagaagaaagtaagggcatgagcagacagctcctc actcactcttgcactgatgctagggagatgaagcaacaagctggggagggagacagagtt cagaatggccagagggtctatagttatgagctactagaggatggagactggcaggagaaa agcagaaggttctggagtccagatgactttgaggggttcaagacttcagtggagaaagtc actcccaatgtggtggaaatagcaagagaactaaaattagaagtggaacatgaaggtgtg actgaattgctgcaatcttatcataaaacctggacagatgagaagttgattcttatcaat gagcaaagaaaatggtctcttgagatggaatttatgtctgatgaagatgctgtgaaccct gttgaaatgacaacaatggatttcgaatattacatagttgataaagcaccagtagagttt gaaacgattgacattaattttgaaagaagttctaatgctcctcctccaacactggggatc acaattccacatgagatttgggaggggacacaaatccaaactatatctactgggaaacca aaaacttcacatgacccactttattgtgatattcacattattgtggtggtctggaatgga acctacagtatctctgaggtatgcctgcagtatgttcaggcacccagagtcagcaatcag ctcacaaagttgtcctattctctgtggccctggttccccagggtgatccctacatatcct gagaagcgtgaaggccccattgctactgcctcagtatcagcagtatcaggacaaatatgc cttgagggagaagaatcatcccattccattccacctctctcagaacatgtcttctacagt atagctgggcttttattttaccttctaccaggaaaccccatacctgcccccacagatttc atagtagatgtggtacaggctgcaggactgaaagcaaactcagacagcagcaaagtgtag >gi568815594r:144014689_144240635|GENSCAN_predicted_peptide_6|135_aa XLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKASG YKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLK EIKEDTNGRTFHAHG >gi568815594r:144014689_144240635|GENSCAN_predicted_CDS_6|408_bp ntgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaac cccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagcctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaag gaaataaaagaggatacaaatggaagaacattccatgctcatgggtag