GENSCAN 1.0 Date run: 11-May-117 Time: 11:22:39 Sequence gi568815588r:102730705_102937361 : 206657 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1019 1079 61 2 1 61 88 50 0.450 1.04 1.02 Intr + 1448 1514 67 1 1 101 43 45 0.586 -0.22 1.03 Intr + 2155 2204 50 2 2 113 91 33 0.873 4.50 1.04 Term + 5158 5343 186 0 0 118 43 67 0.810 2.69 1.05 PlyA + 6451 6456 6 1.05 2.00 Prom + 6633 6672 40 -4.36 2.01 Init + 13377 13439 63 2 0 86 110 123 0.578 13.46 2.02 Intr + 32937 33070 134 2 2 50 106 25 0.067 -0.06 2.03 Intr + 34889 34985 97 2 1 11 97 77 0.133 0.91 2.04 Intr + 67289 67391 103 1 1 97 78 78 0.061 7.45 2.05 Intr + 79189 79350 162 2 0 96 51 199 0.548 16.95 2.06 Term + 81891 82627 737 1 2 97 43 919 0.999 81.93 2.07 PlyA + 83769 83774 6 1.05 3.10 PlyA - 84961 84956 6 1.05 3.09 Term - 91683 91672 12 0 0 134 45 -2 0.303 -1.70 3.08 Intr - 100281 100043 239 1 2 107 91 262 0.671 25.73 3.07 Intr - 100907 100804 104 1 2 28 105 107 0.869 6.22 3.06 Intr - 101976 101807 170 0 2 91 69 302 0.994 27.34 3.05 Intr - 102504 102289 216 0 0 70 69 293 0.999 24.30 3.04 Intr - 103418 103332 87 2 0 75 100 8 0.645 0.87 3.03 Intr - 104310 104081 230 1 2 114 100 285 0.996 29.89 3.02 Intr - 104688 104550 139 0 1 29 70 267 0.996 19.04 3.01 Init - 106657 106361 297 1 0 98 115 214 0.915 20.32 3.00 Prom - 106746 106707 40 -3.56 4.10 PlyA - 107583 107578 6 1.05 4.09 Term - 110531 110275 257 0 2 72 43 114 0.414 1.05 4.08 Intr - 111750 111312 439 0 1 67 92 184 0.690 9.79 4.07 Intr - 111906 111823 84 0 0 56 93 82 0.603 5.52 4.06 Intr - 113092 112951 142 0 1 25 76 114 0.321 4.26 4.05 Intr - 114719 114442 278 2 2 -31 -6 341 0.141 9.11 4.04 Intr - 123890 123690 201 2 0 59 85 154 0.230 11.68 4.03 Intr - 127058 126973 86 0 2 59 127 24 0.429 2.94 4.02 Intr - 129994 129836 159 2 0 80 32 88 0.272 2.36 4.01 Init - 130438 130408 31 1 1 114 73 -20 0.363 -1.00 4.00 Prom - 134068 134029 40 -5.76 5.00 Prom + 134184 134223 40 -5.06 5.01 Init + 138722 138889 168 1 0 74 79 218 0.564 17.03 5.02 Intr + 139380 139507 128 2 2 34 75 203 0.992 13.08 5.03 Intr + 141744 141894 151 0 1 52 94 110 0.996 8.16 5.04 Intr + 142393 142529 137 0 2 91 25 134 0.986 6.77 5.05 Intr + 143888 143957 70 2 1 102 100 51 0.967 6.88 5.06 Intr + 144139 144189 51 0 0 78 63 57 0.651 1.30 5.07 Intr + 146250 146331 82 2 1 38 61 12 0.784 -7.29 5.08 Intr + 147675 147806 132 1 0 113 103 28 0.930 7.42 5.09 Intr + 148145 148287 143 0 2 60 94 70 0.935 4.87 5.10 Intr + 159840 159974 135 2 0 68 96 84 0.834 7.96 5.11 Term + 169889 169996 108 1 0 84 35 68 0.427 -0.39 5.12 PlyA + 170470 170475 6 1.05 6.00 Prom + 177364 177403 40 -5.16 6.01 Init + 187855 189397 1543 0 1 85 81 3234 0.990 312.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 72228 72215 14 0 2 59 115 22 0.807 1.59 S.002 Init + 78165 78234 70 2 1 86 52 39 0.883 1.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_1|121_aa XELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPGMILLPVIMERLEKLHFMQ PHLHGASGVWAFPTEMVSAVHSLLGLIDHAASALPDLPAVQKETGKGQEGQHLSEDKCPS A >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_1|366_bp nnggagctcataaagggaatctgcgtgaaggacaggaatgaaaatgagattggtcattcc cggagagctgcggccataggcatcacccaagtagttatttctcggatcaccatgtcagct cctgggatgatcttgctgccagtcatcatggaaaggcttgagaaattgcacttcatgcag cctcatcttcatggtgccagtggcgtgtgggcttttcccacagaaatggtatctgctgtt cattccttacttggtttaattgaccatgctgccagcgcgctccctgatctgcctgctgtc cagaaggagacagggaaggggcaggaagggcagcacctgtctgaggacaagtgtccatca gcctaa >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_2|431_aa MALLLLQALPSPLSARAEPPQPETLSPIQGHFTFSDLQYIHLGSRDSNILLPAVFQLIEG AAKMDGACFVQGIVLGPVRDMEVNVSHLTPKLLTLEEKDKEACVGTNNQSYICDTGHCCG QSQCCNYYYELWWFWLVWTIIIILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYS ALPFYFRFLPNYLLPPYEEVVNRPPTPPPPYSAFQLQQQQLLPPQCGPAGGSPPGIDPTR GSQGAQSSPLSEPSRSSTRPPSIADPDPSDLPVDRAATKAPGMEPSGSVAGLGELDPGAF LDKDAECREELLKDDSSEHGAPDSKEKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFNT LIDDALDGPLDFCDSCHVRPPGDEEEGLCQSSEEQAREPGHPHLPRPPACLLLNTINEQD SPNSQSSSSPS >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_2|1296_bp atggcgctcctgctcctccaggcgctgcccagccccttgtcagccagggctgaacccccg cagccagaaacccttagtccaatccaaggtcactttacattttcagacctccagtatatt catctgggaagcagggatagcaatatcctgttaccagctgtattccagctgatagaagga gcagctaaaatggatggggcctgctttgtgcaaggcatcgtgcttggtcctgtgagggat atggaagtgaacgtttcacatcttacccctaagttgcttactttagaggagaaggataag gaagcctgtgtgggtaccaacaatcaaagctacatctgtgacacaggacactgctgtgga cagtctcagtgctgcaactactactatgaactctggtggttctggctggtgtggaccatc atcatcatcctgagctgctgctgtgtttgccaccaccgccgagccaagcaccgccttcag gcccagcagcggcaacatgaaatcaacctgatcgcttaccgagaagcccacaattactca gcgctgccattttatttcaggtttttgccaaactatttactacctccttatgaggaagtg gtgaaccgacctccaactcctcccccaccatacagtgccttccagctacagcagcagcag ctgctgcctccacagtgtggccctgcaggtggcagtcccccgggcatcgatcccaccagg ggatcccagggggcacagagcagccccttgtctgagcccagcagaagcagcacaagaccc ccaagcatcgctgaccctgatccctctgacctaccagttgaccgagcagccaccaaagcc ccagggatggagcccagtggctctgtggctggcctgggggagctggacccgggggccttc ctggacaaagatgcagaatgtagggaggagctgctgaaagatgacagctctgaacacggc gcacccgacagcaaagagaagacgcctgggagacatcgccgcttcacaggtgactcgggc attgaagtgtgtgtgtgcaaccggggccaccatgacgatgacctcaaagagttcaacaca ctcatcgatgatgctctggatgggcccctggacttctgcgacagctgccatgtgcggccc cctggtgatgaggaggaaggcctctgtcagtcctctgaggagcaggctcgagagcctggg cacccgcacctgccacggccgcccgcatgcctgctgctgaacaccatcaacgagcaggac tctcccaactcccagagcagcagctcccccagctag >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_3|497_aa MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLPRHGHMHNNFFKLQKKY GPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATLDIASNNRKGIAFADSGAH WQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATHNGQSIDISFPVFVAVTNVIS LICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLVPWLKIFPNKTLEKLKSHVKIRN DLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNGNAGPDQDSELLSDNHILTTIGDIF GAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQNVGFSRTPTISDRNRLLLLEATIREV LRLRPVAPMLIPHKANVDSSIGEFAVDKGTEVIINLWALHHNEKEWHQPDQFMPERFLNP AGTQLISPSVSYLPFGAGPRSCIGEILARQELFLIMAWLLQRFDLEVPDDGQLPSLEGIP KVVFLIDSFKVKIKSQG >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_3|1494_bp atgtgggagctcgtggctctcttgctgcttaccctagcttatttgttttggcccaagaga aggtgccctggtgccaagtaccccaagagcctcctgtccctgcccctggtgggcagcctg ccattcctccccagacacggccatatgcataacaacttcttcaagctgcagaaaaaatat ggccccatctattcggttcgtatgggcaccaagactacagtgattgtcggccaccaccag ctggccaaggaggtgcttattaagaagggcaaggacttctctgggcggcctcaaatggca actctagacatcgcgtccaacaaccgtaagggtatcgccttcgctgactctggcgcacac tggcagctgcatcgaaggctggcgatggccacctttgccctgttcaaggatggcgatcag aagctggagaagatcatttgtcaggaaatcagtacattgtgtgatatgctggccacccac aacggacagtccatagacatctcctttcctgtcttcgtggcggtaaccaatgtcatctcc ttgatctgcttcaatacctcctacaagaatggggaccctgagttgaatgtcatacagaat tacaatgaaggcatcatagacaacctgagcaaagacagcctggtggacctagtcccctgg ttgaagattttccccaacaaaaccctggaaaaattaaagagccatgttaaaatacgaaat gatctgctgaataaaatacttgaaaattacaaggagaaattccggagtgactctatcacc aacatgctggacacactgatgcaagccaagatgaactcagataatggcaatgctggccca gatcaagactcagagctgctttcagataaccacattctcaccaccataggggacatcttt ggggctggcgtggagaccaccacctctgtggttaaatggaccctggccttcctgctgcac aatcctcaggtgaagaagaagctctacgaggagattgaccagaatgtgggtttcagccgc acaccaactatcagtgaccgtaaccgtctcctcctgctggaggccaccatccgagaggtg cttcgcctcaggcccgtggcccctatgctcatcccccacaaggccaacgttgactccagc atcggtgagtttgctgtggacaagggcacagaagttatcatcaatctgtgggcgctgcat cacaatgagaaggagtggcaccagccggatcagttcatgcctgagcgtttcttgaatcca gcggggacccagctcatctcaccgtcagtaagctatttgcccttcggagcaggacctcgc tcctgtataggtgagatcctggcccgccaggagctcttcctcatcatggcctggctgctg cagaggttcgacctggaggtgccagatgatgggcagctgccctccctggaaggcatcccc aaggtggtctttctgatcgactctttcaaagtgaagatcaagagccagggctga >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_4|558_aa MPSPNHIFLRVSGIFAIEKVILGHPHLYSHSGLHRKVLKHKVHESRPGIMILQQIVAGGG ILTEYHTLQVTITLFSQFSADGFGGCSQSFATRLHTPECNTRLREEDLLIASTSQPTIPK MQRDLLPTATKDCDSPDYQAGRVHFQLTSELREPFSTCLDAAIVGYKDLPSIQATIPGKT FINVKPAEVDVLVGKDWSSFFVNEVTLGGQKCSVIWDLLLQDGELTYLYTKSTDGAPTFN VTVTMTARMLVLAVEEHCPLFPEKGTLDVELWDRVGAKFRELVPTGNYVPVTVWGDWALA QLTGSNNYSDTTAQLGFDALTVEQVTKGHNELYPDFLAKLQDAVEKSVSDEHAQGILLRM LAFENANHECKMAMHSVQQQNLPDREVLPEYIKYEGIGSDTKLFCAHKAILWARAMKDGN QTGSTDSFLGACYNCGQLGHTRKNCTVKNLKAAKPAQKTQPNAPATVCPCCCKVSPHIDL PATQNYSCWAYVPFPPLIQPLTWMDAPAEIYTNDSVWMPGATDDRCPTQPGKEGTAFNVP TVINTLLCASDMHLVVSI >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_4|1677_bp atgcccagccccaaccatatatttctaagagtctcaggcatctttgccatcgagaaagtc atccttggccacccccacctttactcccactctggcctccacaggaaagtgctgaagcac aaagtccatgaaagcagaccagggataatgattctgcagcaaatagttgctggagggggc atactcactgagtaccacactctacaggtgaccataacattattcagtcagttctctgct gacggatttggaggctgttcccagtcttttgccacgagactgcacaccccagaatgcaac acgcgcctccgcgaagaagacctactcatagcaagtacttcacagcctacaattcccaag atgcaacgcgacctccttcccacagcaacgaaagactgcgactccccggactatcaggcc gggagagtccacttccagctcacctcggagctccgggagcctttcagcacctgcttggat gcagccatcgtgggctacaaggacttgccttccatccaggccaccatccctgggaaaacc ttcatcaacgtcaagccagctgaggttgatgtcctggttggcaaagactggtcaagtttt tttgtgaatgaggtgacacttgggggccagaaatgttctgtgatctgggacctactgctg caggatggggaactgacctatctttataccaagagcaccgatggagcccccaccttcaat gtcactgtcaccatgactgctaggatgctagtgttggcagtggaagaacattgtcctttg tttcctgaaaagggaacattagacgtggaactatgggatcgtgttggtgcaaaattccgg gaactggtcccaacaggaaattatgttcccgtcactgtttggggtgactgggccttggct cagctcacaggctccaataattactctgacactactgcccaattaggctttgatgctctc accgtggaacaagtaacaaagggtcacaatgaattatatcctgattttttagctaaatta caagatgctgttgaaaaatctgtctctgatgagcacgctcaaggtattcttcttcgaatg ttagcttttgagaatgcgaaccatgagtgtaaaatggccatgcattccgtccaacaacaa aatttacctgatcgtgaggtgttgcctgagtatattaaatatgaaggcattggatcagac acaaagctattctgtgcccacaaagctattctgtgggcacgggccatgaaggacggcaat caaactggctcgactgattcttttcttggagcctgctataattgtggtcaacttggtcat actcgaaaaaattgcacagttaaaaacttaaaggcagccaagccggctcaaaaaacacag ccaaatgctcctgctactgtttgcccttgttgttgtaaagtttcccctcacattgattta cctgctacacaaaattattcttgctgggcttatgtgccttttcctccacttattcaacct ctcacctggatggatgctcctgcggaaatctatactaatgatagtgtgtggatgcctgga gctacagatgaccgttgccccactcaaccaggaaaagaaggcactgcatttaatgttcct acggttataaataccctcctctgtgcctcggacatgcaccttgttgtatccatctag >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_5|434_aa MGGRSKPASLGGASAQELAAGARRPESQAEETVSARPESQAEETVSARPESQAEETTYYG QVLKRSADLQTNGCVTTARPVPKHIREALQNVHEEVALRYYGCGLVIPEHLENCWILDLG SGSGRDCYVLSQLVGEKGHVTGIDMTKGQVEVAEKYLDYHMEKYGFQASNVTFIHGYIEK LGEAGIKNESHDIVVSNCVINLVPDKQQVLQEAYRVLKVAKGSKSEVTVVTECVKHGGEL YFSDVYTSLELPEEIRTHKVLWGECLGGALYWKELAVLAQKIGFCPPRLVTANLITIQNK ELERVIGDCRFVSATFRLFKHSKTGPTKRCQVIYNGGITGHEKELMFDANFTFKEGEIVE VDEETAAILKNSRFAQDFLIRPIGEKLPTSGGCSALELKDIITDPFKLAEESDSMKSRCV PDAAGGCCGTKKSC >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_5|1305_bp atgggcgggcggagcaagcctgccagcctgggcggggcctcggcacaggagctggctgcg ggagcccgccgtcctgagtcgcaggccgaggagacagtgagtgcgcgccctgagtcgcag gccgaggagacagtgagtgcgcgccctgagtcgcaggccgaggagacaacctactacggg caggtgctgaagagatcggcagacctccagaccaacggctgtgtcaccacagccaggccg gtccccaagcacatccgggaagccttgcaaaatgtacacgaagaagtagccctaagatat tatggctgtggtctggtgatccctgagcatctagaaaactgctggattttggatctgggt agtggaagtggcagagattgctatgtacttagccagctggttggtgaaaaaggacacgtg actggaatagacatgaccaaaggccaggtggaagtggctgaaaagtatcttgactatcac atggaaaaatatggcttccaggcatctaatgtgacttttattcatggctacattgagaag ttgggagaggctggaatcaagaatgagagccatgatattgttgtatcaaactgtgttatt aaccttgtgcctgataaacaacaagtgcttcaggaggcatatcgggtgctgaaggttgca aaaggtagtaaatccgaggtgacagttgtcactgaatgtgttaagcatggtggggagtta tatttcagtgacgtctatacgagccttgaactgccagaagaaatcaggacacacaaagtt ttatggggtgagtgtctgggtggtgctttatactggaaggaacttgctgtccttgctcaa aaaattgggttctgccctccacgtttggtcactgccaatctcattacaattcaaaacaag gaactggaaagagttatcggtgactgtcgttttgtttctgcaacatttcgcctcttcaaa cactctaagacaggaccaaccaagagatgccaagttatttacaatggaggaattacagga catgaaaaagaactaatgtttgatgccaattttacatttaaggaaggtgaaattgttgaa gtggatgaagaaacagcagctatcttgaagaattcaagatttgctcaagattttctgatc agaccaattggagagaagttgccaacatctggaggctgttctgctttggagttaaaggat ataatcacagatccatttaagcttgcagaagagtctgacagtatgaagtccagatgtgtc cctgatgctgctggaggctgctgtggcacaaagaaaagctgctaa >gi568815588r:102730705_102937361|GENSCAN_predicted_peptide_6|515_aa MAARRSLSARGRGILQAAAGRLLPLLLLSCCCGAGGCAAVGENEETVIIGLRLEDTNDVS FMEGGALRVSERTRVKLRVYGQNINNETWSRIAFTEHERRRHSPGERGLGGPAPPEPDSG PQRCGIRTSDIIILPHIILNRRTSGIIEIEIKPLRKMEKSKSYYLCTSLSTPALGAGGSG STGGAVGGKGGSGVAGLPPPPWAETTWIYHDGEDTKMIVGEEKKFLLPFWLQVIFISLLL CLSGMFSGLNLGLMALDPMELRIVQNCGTEKEKNYAKRIEPVRRQGNYLLCSLLLGNVLV NTTLTILLDDIAGSGLVAVVVSTIGIVIFGEIVPQAICSRHGLAVGANTIFLTKFFMMMT FPASYPVSKLLDCVLGQEIGTVYNREKLLEMLRVTDPYNDLVKEELNIIQGALELRTKTV EDVMTPLRDCFMITGEAILDFNTMSEIMESGYTRIPVFEGERSNIVDLLFVKDLAFVDPD DCTPLKTITKFYNHPLHFVFNDTKLDAMLEEFKKX >gi568815588r:102730705_102937361|GENSCAN_predicted_CDS_6|1545_bp atggcggcgcgccgcagcctcagcgctcgcggccgggggatcctgcaggcggctgcgggg cggctgctgccgctgctcctgctgagctgctgctgcggtgcgggcggctgcgcagcggtg ggcgagaatgaggagacggtgatcatcgggctgcgactggaggacacgaacgacgtgtcg ttcatggaagggggggcgctgcgggtgagcgaacggacccgggtcaagctgcgggtgtac gggcagaacatcaataacgagacgtggtcccgcatcgccttcaccgagcacgagcggcgg cgccacagcccgggggagcgcgggctggggggccccgcgccgccagagccggacagcggc ccccagcgatgcggcatccgcacctcagacatcatcatcttgccccacatcattctcaac cgccgcacctcgggcatcatcgagatcgagatcaaaccgctacgcaagatggagaagagc aagtcctattacctgtgcacgtcgctctccacgcccgccctgggcgccggcggctcgggg tccacgggtggcgccgtcgggggcaagggtggctcgggggtggccgggctcccgccgccc ccgtgggccgagaccacctggatttaccacgacggcgaggacaccaagatgatcgtaggc gaagagaagaagttcctgctgcccttctggctgcaggtgatcttcatttcgctgctgctg tgcctgtcgggcatgttcagcggcctcaacctggggctcatggccctggacccgatggag ctgcgcatcgtgcagaactgcggcacggagaaggagaagaattacgccaagcgcatcgag ccggtgcgcaggcagggcaactacctgctgtgctcactgctgctgggcaacgtgctggtc aacaccacgctcaccatcctgctcgacgacatcgccggctcgggcctcgtggccgtggta gtctccaccatcggtatcgtcatcttcggagagatcgtgccccaggccatctgctcccgg catggcctggctgtgggggccaacaccatcttcctcaccaagtttttcatgatgatgacc ttccccgcttcctacccggtcagcaagctgctggactgcgtcctgggccaggagataggc accgtctataaccgggaaaaactgctggagatgctccgggtcaccgatccctacaacgac ctcgttaaggaggagctgaacatcatccaaggggcgctggagctccgcaccaagacggtg gaggacgtgatgaccccactccgggactgcttcatgatcaccggcgaagccatcctggac ttcaacaccatgtctgagatcatggagagcggctacacccgcattccagtgtttgaaggg gagcgctccaatatcgtggacctgctgtttgtcaaagacttggccttcgtggatcccgat gactgtacccccctgaaaaccatcaccaaattttataaccaccccttgcactttgttttc aatgacaccaagttggacgctatgctggaagaatttaagaaagnn