GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:11:55 Sequence gi568815593r:97063064_97283191 : 220128 bp : 38.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21521 21631 111 1 0 69 101 56 0.828 5.26 1.02 Intr + 23232 23309 78 2 0 92 69 33 0.579 0.63 1.03 Term + 26765 26875 111 1 0 132 49 92 0.940 7.28 1.04 PlyA + 27007 27012 6 1.05 2.09 PlyA - 27613 27608 6 1.05 2.08 Term - 31972 31685 288 1 0 94 43 331 0.985 23.59 2.07 Intr - 33824 33747 78 2 0 41 110 62 0.732 2.53 2.06 Intr - 42222 42127 96 0 0 84 82 105 0.996 8.79 2.05 Intr - 44437 44297 141 1 0 97 75 109 0.930 10.13 2.04 Intr - 58977 58879 99 0 0 -6 67 126 0.313 0.49 2.03 Intr - 60348 60295 54 0 0 75 79 52 0.425 1.26 2.02 Intr - 61566 61403 164 1 2 136 110 59 0.539 11.67 2.01 Init - 79513 79432 82 1 1 95 93 45 0.578 6.89 2.00 Prom - 97064 97025 40 -2.85 3.12 PlyA - 97825 97820 6 1.05 3.11 Term - 100162 99998 165 1 0 55 34 188 0.996 7.03 3.10 Intr - 102084 101988 97 2 1 47 100 62 0.923 2.39 3.09 Intr - 104928 104404 525 2 0 106 80 592 0.978 51.33 3.08 Intr - 105789 105697 93 2 0 116 72 34 0.871 2.76 3.07 Intr - 108334 108143 192 0 0 50 76 187 0.999 11.39 3.06 Intr - 110200 110112 89 1 2 59 93 74 0.999 2.85 3.05 Intr - 114228 114053 176 1 2 70 63 86 0.999 3.04 3.04 Intr - 114785 114669 117 0 0 51 111 114 0.998 9.52 3.03 Intr - 116130 115992 139 0 1 88 106 55 0.974 6.42 3.02 Intr - 120239 120063 177 2 0 4 95 219 0.966 13.39 3.01 Init - 120906 120808 99 0 0 72 94 111 0.998 10.41 3.00 Prom - 142525 142486 40 -4.05 4.02 PlyA - 142836 142831 6 1.05 4.01 Sngl - 144058 143450 609 1 0 44 38 288 0.681 15.34 4.00 Prom - 155897 155858 40 -3.65 5.03 PlyA - 156656 156651 6 1.05 5.02 Term - 157442 156832 611 0 2 67 34 255 0.299 11.67 5.01 Init - 171501 171423 79 0 1 95 75 22 0.398 2.87 5.00 Prom - 171678 171639 40 -3.65 6.02 PlyA - 171961 171956 6 -0.45 6.01 Sngl - 172981 172571 411 1 0 60 44 200 0.580 8.84 6.00 Prom - 173496 173457 40 -8.15 7.06 PlyA - 173614 173609 6 1.05 7.05 Term - 174658 173724 935 2 2 37 35 474 0.193 28.29 7.04 Intr - 176575 176438 138 2 0 -5 94 136 0.223 4.41 7.03 Intr - 177247 177186 62 0 2 33 72 83 0.141 -1.34 7.02 Intr - 182621 182493 129 1 0 21 103 71 0.037 0.79 7.01 Init - 186856 186522 335 1 2 93 46 219 0.055 15.02 7.00 Prom - 206445 206406 40 -0.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 94893 94642 252 0 0 75 43 145 0.925 3.54 S.002 Intr + 130279 130523 245 1 2 74 110 101 0.837 7.12 S.003 Term + 191976 192176 201 2 0 71 48 148 0.964 5.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_1|99_aa MVCPEFVPSDVRTFLEFLPYGAFVVSLASRVKLQTFMAIYFRSCESYCQCCDTCNAWDEA AHVALVKIGSEKRLSGCHFHVTRMLLFLVVLTQTLSKVL >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_1|300_bp atggtgtgtccagagtttgttccttctgacgttcggacgtttttggagtttcttccttat ggtgcgtttgtggtctcactggcttcacgagtgaagctgcagaccttcatggccatttat tttagatcctgtgaatcctactgtcaatgttgtgacacctgcaatgcctgggatgaagct gcccatgtggccctggtgaaaataggctctgagaagaggcttagtggctgccattttcat gtcacccgaatgttgctgtttctagtggtactaactcagacactttcaaaagttttgtag >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_2|333_aa MDRTLESLRHIIAQVLPHRDPALVFKDLNVVSMLQEFWESKQQQKAAFPSEGVVVYESLP APGPPFVSYVTLPGGSCFGNFQWSALALATGWETQEEVERTRMRGSEGERKAAIEEEALQ PTCACRSIIEHPKCCLSRAEARRDAAKVALINSLFNELPSRRITKEFIMESVQEAVASTS GTLDDADDPSTSVGAYHYMLESNMGKTMLEFQELMTIFQLLHWNGSLKALRETKCSRQEV ISYYSQYSLDEKMRSHMALDWIMKERDSPGIVSQELRMALRQLEEARKAGQELRFYKEKK EILSLALTQICSDPDTSSPSDDQLSLTALCGYH >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_2|1002_bp atggacagaaccttggaatctctgagacacatcattgcccaagtcttgcctcacagagat ccggctctagtcttcaaagacttgaacgttgtgtcaatgttacaggaattttgggaaagc aagcagcagcagaaggctgcattcccaagtgaaggtgtggtggtctatgagtcactgcca gctcctgggcctccctttgtgagttacgtgaccctcccagggggaagctgttttggcaac tttcagtggtcagccctggctttggccactgggtgggaaacacaggaggaagtggagcgg acacggatgagaggcagcgaaggggaaagaaaggcagccattgaggaagaggctctccaa cccacttgtgcctgcaggtccattattgagcatcctaagtgctgcttaagtagagccgag gccaggcgggatgcagctaaagtggccctgatcaactccctcttcaatgagctgccctct cgcaggatcaccaaggaattcattatggaaagtgttcaggaagcagtagcctccaccagc ggcaccttagatgatgcagatgaccccagcaccagtgttggggcctatcactacatgctg gagtcaaacatggggaagactatgctggagtttcaggagctgatgaccattttccaacta ttgcactggaatggaagcctaaaagcccttcgtgaaacaaagtgttcccgacaggaagtc atctcctactattctcagtattctctagatgaaaagatgcgcagccacatggccctggac tggatcatgaaggagcgggactcaccaggaattgtctctcaagagctacgaatggccctg aggcagttggaggaagccaggaaagcaggacaagaactacggttttacaaagaaaagaaa gaaatattgagcttagccctgactcagatctgcagtgaccctgacacttcctcacccagt gatgatcagctgagccttacggccctgtgtggctatcactag >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_3|622_aa MGNYSKILKSLLGPRSPTRVGQPELSKSPLPLERVREPSGAVKRLEAGITTCLTLCGCRR RGIWVRTGAAMGKVNVAKLRYMSRDDFRVLTAVEMGMKNHEIVPGSLIASIASLKHGGCN KVLRELVKHKLIAWERTKTVQGYRLTNAGYDYLALKTLSSRQVVESVGNQMGVGKESDIY IVANEEGQQFALKLHRLGRTSFRNLKNKRDYHKHRHNVSWLYLSRLSAMKEFAYMKALYE RKFPVPKPIDYNRHAVVMELINGYPLCQIHHVEDPASVYDEAMELIVKLANHGLIHGDFN EFNLILDESDHITMIDFPQMVSTSHPNAEWYFDRDVKCIKDFFMKRFSYESELFPTFKDI RREDTLDVEVSASGYTKEMQADDELLHPLGPDDKNIETKEGSEFSFSDGEVAEKAEVYGS ENESERNCLEESEGCYCRSSGDPEQIKEDSLSEESADARSFEMTEFNQALEEIKGQVVEN NSVTEFSEEKNRTENYNRQDGQRVQGGVPAGSDEYEDECPHLIALSSLNREFRPFRDEEN VGAMNQYRTRTLSITSSGSAVSCSTIPPELVKQKVKRQLTKQQKSAVRRRLQKGEANIFT KQRRENMQNIKSSLEAASFWGE >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_3|1869_bp atgggtaactattctaaaatccttaaatcgctcctgggccctcgaagtcctacccgcgta ggtcagcccgaactatcaaaatccccacttcctctggagcgcgtgcgcgagccatctggc gctgtaaagcgcttagaggctggaataacgacctgccttacgctttgcggctgtcgtcgg agaggcatctgggttcggactggggccgccatggggaaagtgaatgtggccaagttgcgt tacatgagccgagatgacttcagggtcttgaccgcggttgaaatgggcatgaagaaccat gaaattgttcccggcagtttgattgcttctatagccagccttaaacatggtggctgtaat aaagttttaagagaattagtgaaacataaactcatagcttgggagcgtaccaaaactgtc cagggctatcggttgacaaatgcaggatatgattacctagctttgaaaacactttcttct aggcaagtagttgagtctgttggaaaccagatgggtgttggcaaagaatcagatatttac attgttgcaaatgaagaaggacaacaatttgcattaaagcttcacagactaggaagaacc tcgtttcgaaatttgaaaaacaaacgcgattatcataaacataggcacaatgtgtcatgg ctatatttatctcgtctctctgccatgaaggaatttgcctatatgaaggcattgtatgag aggaaatttccagttccaaagccaattgattacaatcgtcatgcagtggtcatggaactc ataaatggttatccactatgtcagatacaccatgttgaagatcctgcatcagtatatgat gaagctatggaactaattgtcaaacttgcaaatcatgggctgattcatggagattttaat gaatttaatctcattttggatgaaagtgaccatatcaccatgattgattttccacagatg gtttcaacttctcatcccaatgctgagtggtattttgacagagatgttaaatgcattaaa gatttctttatgaaacgtttcagctacgaaagtgagctttttccaacttttaaggatatc aggagagaagacactcttgatgtggaggtttctgccagtggctacacaaaggaaatgcag gcagatgatgaactgcttcatccattaggtccagatgataaaaatattgaaacaaaagag ggatctgaattctcattttcagatggagaagtggcagaaaaagcagaggtttacgggtca gaaaatgaaagtgaacggaactgtctagaagaatcagagggctgctattgcagatcatct ggagaccctgaacaaataaaggaagacagtttatcagaagagagtgctgatgcacggagt tttgaaatgactgaattcaatcaagctttagaagaaataaaagggcaggttgttgaaaac aactctgtaactgaattttctgaggagaaaaacagaactgaaaattacaacaggcaagat ggtcagagagttcaaggaggagtccctgctggctctgacgagtatgaagatgaatgccct catctaattgccttgtcgtcattaaatagagaattcaggcctttcagagatgaagaaaat gtgggagctatgaatcagtatagaacaagaactctgagtatcacttcttcaggcagtgct gtaagctgttcaacaattcctccagaactggtgaaacagaaggtgaaacgtcagttgaca aaacagcaaaaatcagctgtcagacgtcgattgcagaaaggagaagcaaatatatttacc aagcaacgtagggaaaacatgcaaaatatcaaatcaagtttggaagcagccagcttttgg ggagaataa >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_4|202_aa MKLDPHFSPYRKTNSRYIKDLNLRPETIKILEDNIRKTLLDIGLGKEFMTKNPKANAIKT KTNRWYLIKLKSFFMGKGAVSRVNRKPTEWEKIFTVYTYDKGLISRIYNELKQISNEKIN NPIKKWGKDMNRQFSKEDIQIPKKHEKMLNITNEQRNLNQNHNAIPHYSCKIGHNQKNQK IVDVGMDAENREHFYAAGGNVN >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_4|609_bp atgaaattggatcctcatttctcaccttatagaaaaaccaactcaagatacattaaggac ttaaacctaagacctgaaaccataaaaatcctagaagataacattagaaaaacccttcta gacattggcttaggcaaagagttcatgaccaagaacccaaaagcaaatgcaataaaaaca aagacaaataggtggtacttaattaaactaaagagctttttcatgggaaaaggagcagtc agcagagtaaacagaaaacccacagagtgggagaaaatcttcacagtctatacatacgac aaaggactaatatccagaatctacaatgaactcaaacaaatcagcaacgaaaaaataaac aatcccatcaaaaagtggggtaaggacatgaatagacagttctcaaaagaagatatacaa atacccaaaaaacatgaaaaaatgctcaacatcaccaatgaacagagaaatctaaatcaa aatcacaatgcgataccacactactcctgcaagattggccataatcaaaaaaatcaaaaa atagtagatgttggcatggatgcggagaacagggaacacttctacgctgctggtgggaat gtaaactag >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_5|229_aa MDEAGNHHSKQLLQGQKTKDCMFSLTVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDM IIYPENPIVSAQNLLKLISNFSKVSGYKIKVQKSQAFLYTNNRQAESQIMSELPFTIASK RIKYLGIQLTRDVKDLFKENYKALLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYK FNAIPIKLPMTFFTELEENYFKVHMEPKKSPHCQVNPKPKEQSWSHNAT >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_5|690_bp atggatgaagctggaaaccatcattctaagcaactattgcaaggacagaaaaccaaagac tgcatgttctcactcacagtgttggaagttctggccagggcaatcaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtctctgtttgcagatgacatg attatatatccagaaaaccccatcgtctcagcccaaaatctccttaagctaataagcaac ttcagcaaagtctcaggatataaaatcaaagtgcaaaaatcacaagcattcttatacacc aataacagacaagcagagagccaaattatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaagcactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttataaa ttcaatgccattcccatcaagctaccaatgactttcttcacagaattggaagaaaactac tttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaaaa gaacaaagctggagccataacgctacctga >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_6|136_aa MSELPFTIVSKRIKYLGIQLTRDVKDLFKENYKALLNEIKEDTNKWKNIPCLWVGRINIV KMAILLKVIYKFNAIPIKPPMTFFTELEKTTLKFIWNQKRARIAKTILSKKNKAGGITLP DFKLYYKATVTKTAWY >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_6|411_bp atgagtgaactcccattcacaattgtttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaagcactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgcttatgggtaggaagaatcaatatcgtg aaaatggccatactgctcaaggtaatttataaattcaatgccatccccatcaagccacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagcaaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaactaaaacagcatggtactga >gi568815593r:97063064_97283191|GENSCAN_predicted_peptide_7|532_aa MEVTQPRVPEPMLAVEDVCVGKDLEQNAEVQARYEDHPHGGAFWHGVSWQSPFEWGGHSH RGSGLQCSNESICARVVTWCEISACASVKARGVVVRDRAGGEGTAKQGDGLLKACKRDTF SDKARALISEGFRSDSKGLNLSWEIMSQTMFEASMDCQTVAAEQWVQRTVHEPKQDKTSR GTIRQQHSRFTTIRCSADTAVADTQANRVWSGPLANSNRPAAPHHTYSKIDHIVGSKALL SKCKRTEIITNCLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFF ETNENKDTTYQNPWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQGQTHSKA SRRQEITKIKAELKEIETQRTLQEINESRSWFFERINKIDRPLARLLKKKREKNQIDTIK NDKGDMTTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTIPRLNQEEVESLNRPIT GSEIVAIINSLPTKRSPRTRWIHSRILPEVQGGTGTVPSETIPINRKRGNPP >gi568815593r:97063064_97283191|GENSCAN_predicted_CDS_7|1599_bp atggaggtgacccagcccagggtgccagagcccatgctggctgtggaggatgtctgtgtg ggaaaagacctcgagcagaatgctgaagtccaagcaaggtacgaagatcatcctcatgga ggggcattctggcacggggtatcatggcagagcccatttgagtggggcggacattcccac agagggtctgggctccagtgcagtaatgagagcatctgtgcaagagtggtgacctggtgt gagatcagtgcctgtgctagtgtgaaagcaagaggagtggtagtcagagacagagcaggg ggagaaggaactgccaaacagggagatggcctactgaaggcttgcaaacgtgacacgttc agtgacaaagcaagggctctaattagcgaagggttccggtctgacagtaaagggcttaac ctttcttgggaaataatgagccagacaatgtttgaagcctccatggattgccagacagtg gctgcagaacagtgggtgcagcgcactgtgcacgagccgaagcaggacaaaacttccaga ggaacgatcagacagcagcattcgcggttcacaacaatccgctgttctgcagacactgct gttgctgatacccaggcaaacagggtctggagtggacctctagcaaactccaacagacct gcagcaccacaccacacctattccaaaattgaccacatagttggaagtaaagctctcctc agcaaatgtaaaagaacagaaattataacaaactgtctctcagaccacagtgcaatcaaa ctagaactcaggattaagaatctcactcaaaaccgctcaactacatggaaactgaacaac ctgctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttcttt gaaaccaacgagaacaaagacacaacataccagaatccctgggacgcattcaaagcagtg tgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaatt gacaccctaacatcacaattaaaagaactagaaaagcaagggcaaacacattcaaaagct agcagaaggcaagaaataactaaaatcaaagcagaactgaaggaaatagagacacaaaga acccttcaagaaattaatgaatccaggagctggttttttgaaaggatcaacaaaattgat agaccgctagcaagactactaaagaagaaaagagagaagaatcaaatagacacaataaaa aatgataaaggggatatgaccactgatcccacagaaatacaaactaccatcagagaatac tacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaattccttgac acatacaccatcccaagactaaaccaggaagaagttgaatctctgaataggccaataaca ggctctgaaattgtagcaataatcaatagcttaccaaccaaaaggagtcccaggaccaga tggattcacagccgaattctaccagaggtacaaggaggaactggtaccgttccttctgaa actattccaattaacagaaaaagagggaatcctccctaa