GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:30:48 Sequence gi568815586r:16451225_16700860 : 249636 bp : 35.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 17707 17772 66 1 0 128 70 58 0.030 5.10 1.02 Term + 35678 35735 58 2 1 84 49 61 0.109 -2.02 1.03 PlyA + 36016 36021 6 1.05 2.00 Prom + 40840 40879 40 -3.05 2.01 Init + 50809 50892 84 0 0 57 82 57 0.210 2.87 2.02 Term + 62188 63147 960 0 0 85 42 819 0.052 68.06 2.03 PlyA + 64138 64143 6 1.05 3.07 PlyA - 64840 64835 6 1.05 3.06 Term - 76705 76550 156 1 0 112 42 110 0.833 5.75 3.05 Intr - 86296 86034 263 2 2 8 39 150 0.407 -1.62 3.04 Intr - 87215 87002 214 2 1 -28 57 227 0.067 5.37 3.03 Intr - 100103 100007 97 1 1 47 57 112 0.020 3.09 3.02 Intr - 109314 109189 126 2 0 54 99 60 0.047 2.57 3.01 Init - 126544 126534 11 1 2 80 87 2 0.006 -0.76 3.00 Prom - 128674 128635 40 -2.75 4.06 PlyA - 128917 128912 6 1.05 4.05 Term - 133681 133382 300 1 0 54 41 189 0.704 5.04 4.04 Intr - 149644 149490 155 2 2 128 64 178 0.582 18.27 4.03 Intr - 187640 187535 106 2 1 75 56 46 0.027 -1.03 4.02 Intr - 188360 188265 96 2 0 78 66 44 0.153 0.49 4.01 Init - 189945 189835 111 0 0 100 68 75 0.235 6.96 4.00 Prom - 203204 203165 40 -0.95 5.04 PlyA - 203319 203314 6 1.05 5.03 Term - 207886 207703 184 0 1 56 42 173 0.979 5.53 5.02 Intr - 214027 213862 166 2 1 92 68 58 0.893 2.30 5.01 Init - 224718 224694 25 0 1 86 123 18 0.732 4.85 5.00 Prom - 226631 226592 40 -5.75 6.00 Prom + 234469 234508 40 -4.75 6.01 Init + 244023 244107 85 2 1 62 49 14 0.325 -4.07 6.02 Term + 244251 244411 161 1 2 -30 45 295 0.814 10.52 6.03 PlyA + 244660 244665 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 152043 151995 49 0 1 88 64 4 0.828 -0.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_1|41_aa XCSDHPASCTVDEKLLGDGRQTRTRITVLDHIRVLFQAATV >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_1|126_bp nngtgtagtgatcatccagcgtcatgcacagtagatgagaaactcctaggggatggtaga cagacaaggactcgcattacagttcttgatcacatcagagtgctcttccaggccgcgact gtttaa >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_2|347_aa MSTAQIRTNKNHVLLFILYDCRRQMNSKLRRLRDRRRLPPGLASDAPARASSWWTHVEMG PPDPILGVTEGFKRDTNSKKMNLGVSAYRDHNGKPYSIHKAEAQIAAKNLDEDYLSIGRL AEFCKASAEVALGENSEVSKSGRFVTVQTISGTSALRIRASFLQRFLKFSQDVFLPKPTW GSRTPIFRDTGMQLQGYRYYDRKTCGFDFTDTVEDISKTPEQSLLLLHACSHNPTGVDPG PKQQKEIATVVKKTKISLHSSTWPTKALPVAMVTRMPGLCATLPIIRHCRCQSYTKNMVL HSEGVAGFTMVCKDADEAKRVESHLKILMCPMYSNPPLNGTRLLLPF >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_2|1044_bp atgtctacagctcaaatccgaacaaataaaaaccatgttttgctgtttattttatatgat tgtcgaagacaaatgaattccaagctgcgtcgtctccgggatcgccgccgccttccaccc gggctcgcctctgatgcccctgccagagccagctcctggtggacccatgtggagatggga ccaccagatcccatccttggagtcaccgaaggctttaagagggacaccaatagcaaaaag atgaatctgggagttagtgcctacagggaccacaacggaaagccttacagcatccacaag gcagaggcccagattgcagccaaaaatttggacgaggactacctctccattgggcggctg gctgaattttgcaaggcatctgcagaagtagccttgggcgagaatagcgaagtctcgaaa agtggccggtttgtcactgtgcagaccatttctggaactagtgccttaaggatcagagct agttttctgcaaagatttcttaagttcagccaagatgtctttctgcccaaaccaacctgg ggaagtcgcacacccatcttcagggatactggcatgcagctacaaggttatcgatactat gaccgcaagacttgcggttttgacttcacagacactgtggaggacatttcaaaaacacca gagcaaagtcttcttcttctgcatgcttgctcccataatcccacaggagtggaccctggt cccaagcagcagaaggaaatagcaacagtggtgaagaaaacaaaaatctctttgcactct tcgacatggcctaccaaggctttgccagtggcaatggtaacaaggatgcctgggctgtgc gccactctgccaattatacgccattgtcgctgccaatcatacaccaagaacatggtctta cacagtgagggtgtggcaggcttcaccatggtttgcaaagacgcagatgaagccaagagg gtagagtcacatttgaagatcttgatgtgtcccatgtattccaaccctcccctcaatggg acccgtttgcttctaccattctga >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_3|288_aa MEPLLFGVTGNCAACSKLIPAFEMVMRAKDNVYHLDCFACQLCNQRFCVGDKFFLKNNMI LCQTDYEEGLMKEGYAPQIPENVEATLELGNRKKLEEFGGLRRQEMWESLEPPRDLLNGF DKNDDSDLNNKGQAEVVSDGDEELGNWSKVRKGNVGLVPPHRVHTGAPPSGAVRRGPPCS RSQNGRSHSLHHATGKATDTQNQPMKAARRGAVPCKAIGVELPKTMGTYLLHQHDLDVDD NSTFSDYLVQCLEKLNKGAYALIALGQTIKATSVSRISNHSKDSMQGV >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_3|867_bp atggagccattgctctttggtgtaacgggaaactgcgctgcctgtagtaagctcatccct gcctttgagatggtgatgcgtgccaaggacaatgtttaccacctggactgctttgcatgt cagctttgtaatcagagattttgtgttggagacaaatttttcctaaagaataacatgatc ctttgccagacggactacgaggaaggtttaatgaaagaaggttatgcaccccagatacct gagaatgtggaagcgactttggaactgggtaacaggaagaagttggaagagtttggaggg ctcagaagacaggaaatgtgggaaagtttagagcctcctagagacttgttgaatggcttt gacaaaaatgatgatagtgatttgaacaataagggccaggctgaggtggtctcagatgga gatgaggaacttgggaactggagcaaagtgcggaagggaaatgtggggttggtgccccca cacagagtccatactggggcacctcctagtggagctgtgagaagagggccaccatgctcc agatcccagaatggtagatcccacagcttgcaccatgctactggaaaagccacagatact caaaaccagcccatgaaagcagccaggagaggggctgtaccctgcaaagccataggggtg gagttgcccaagactatgggaacctacctcttgcatcagcatgacctggatgttgatgat aatagcaccttctctgactaccttgtacagtgcttagaaaaattaaataagggggcgtat gcgctgattgctctgggccagacaataaaggctacttcagtttccaggattagcaaccac agcaaagactccatgcaaggtgtataa >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_4|255_aa MAKQDTARAVASEGVNPNTWGLPCGVEPEHAQKSRIECWDYSREPLRLADGRWFYKGEFP CTGSLACHHCSTQKHITLNCSTSAKNDHKIQDSCSLLLQEPGYLGIQMLSVQPDTKPKGC AGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGEFAKVFENISSHPEFLKRGSEPFIQ CIPVTLLLQKSPGSFQPNKESWHCFVRREAARPDKRNLLLFLELHGSHCNMFGLKWDIFW DPHCAKAKTSEAIHL >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_4|768_bp atggctaaacaagatacagctcgggctgtggcttcagagggtgtaaaccccaacacttgg gggcttccatgtggtgttgagcctgagcatgcacagaaatcgagaattgagtgctgggat tacagccgtgagcccctgcgcctggccgatggccgatggttttataaaggggaattcccc tgcacaggctctcttgcctgccaccattgcagtacacaaaagcacatcactctgaactgt tccacctctgcaaaaaatgaccacaaaattcaagactcctgttctttactgctgcaagag ccagggtatctaggtatacaaatgctctcagtccagccagacaccaagccgaaaggttgt gctggctgcaaccgaaagatcaaggaccggtatcttctaaaggcactggacaaatactgg catgaagactgcctgaagtgtgcctgctgtgactgtcgcttgggagagttcgctaaggtg tttgaaaatatttcctcgcatcctgaattcttgaagagaggatcagagcccttcatccaa tgtataccagttacactcctgcttcagaaatctcctgggagctttcagcctaacaaggag tcttggcactgctttgtgaggcgagaagcagcccgtcctgataaaagaaatctcttgctg tttcttgaacttcatggcagccattgcaatatgtttggacttaagtgggatattttctgg gaccctcactgtgccaaagctaagacatctgaggccatccacctctaa >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_5|124_aa MEVTEMSFVTNSISLGVITKNISKLPNGLSKTKSPVVESHRSIGFSFHLHTFPTCRIFVE ELGSWGGNVTPELQWAAQEYQVMIYSQHSSGRGAHTFRALRARTVKKHRGATQVTKNLSA DDYA >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_5|375_bp atggaagtaacagagatgtccttcgttaccaatagcatctccctaggtgtgatcaccaaa aatatctccaaactgccaaatggcctgtcaaagacaaaatcacctgtggttgagagccac cgatctataggcttttccttccatcttcatacttttcccacttgcagaatatttgttgaa gagcttgggagttggggaggaaacgttaccccagagctgcagtgggcagcccaggagtac caagtcatgatctacagccagcactcaagtgggagaggagcccacactttcagagcactg agagctagaactgtgaagaaacacaggggagctacacaagtgaccaagaatctatcagct gatgactatgcctaa >gi568815586r:16451225_16700860|GENSCAN_predicted_peptide_6|81_aa MVAVIEVGSRMVVARGWQKGKIGRCWLKVKEEEEEEEEEEEEEEEEEEEEEGRGEEEEGG RREEEEKEGEGNRKEEMTWGR >gi568815586r:16451225_16700860|GENSCAN_predicted_CDS_6|246_bp atggtcgctgtcatagaagtagggagtagaatggtggttgccagaggctggcaaaagggg aaaattggtagatgttggttaaaggtaaaagaagaggaagaggaagaagaggaagaagag gaagaggaagaagaagaagaagaggaagaagaaggaagaggagaggaggaggaagggggg aggagggaggaggaggagaaggaaggagaaggaaacagaaaggaggaaatgacttggggc aggtga