GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:11:18 Sequence gi568815591r:99828853_100029620 : 200768 bp : 44.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9676 9725 50 0 2 68 100 18 0.468 1.42 1.02 Intr + 15291 15390 100 0 1 83 66 119 0.719 9.31 1.03 Intr + 27981 28047 67 2 1 98 86 19 0.769 1.18 1.04 Intr + 30978 31138 161 1 2 96 116 172 0.999 20.51 1.05 Intr + 32761 32987 227 0 2 100 82 161 0.792 13.48 1.06 Intr + 34685 34847 163 2 1 13 82 104 0.606 2.28 1.07 Term + 35683 35700 18 0 0 84 33 28 0.199 -4.78 1.08 PlyA + 36262 36267 6 1.05 2.02 PlyA - 36991 36986 6 1.05 2.01 Sngl - 48181 47210 972 1 0 68 40 642 0.942 54.15 2.00 Prom - 61188 61149 40 -2.76 3.00 Prom + 67670 67709 40 -5.96 3.01 Init + 68252 68498 247 1 1 68 100 167 0.850 13.66 3.02 Term + 70598 70695 98 0 2 76 40 50 0.560 -2.87 3.03 PlyA + 71639 71644 6 1.05 4.11 PlyA - 71668 71663 6 1.05 4.10 Term - 72198 71909 290 1 2 13 39 104 0.064 -6.46 4.09 Intr - 74463 74366 98 2 2 58 105 66 0.216 4.95 4.08 Intr - 74746 74724 23 1 2 113 85 14 0.451 -0.06 4.07 Intr - 79960 79730 231 1 0 17 94 264 0.838 17.87 4.06 Intr - 80808 80713 96 0 0 84 90 81 0.873 8.11 4.05 Intr - 87927 87850 78 0 0 85 75 43 0.719 2.45 4.04 Intr - 90558 90157 402 0 0 90 88 655 0.603 60.32 4.03 Intr - 100769 100475 295 1 1 98 10 335 0.181 23.71 4.02 Intr - 117644 117512 133 0 1 59 69 82 0.006 3.10 4.01 Init - 127475 127427 49 2 1 86 58 18 0.006 -2.29 4.00 Prom - 130428 130389 40 -7.36 5.05 PlyA - 137906 137901 6 1.05 5.04 Term - 138434 138151 284 0 2 99 49 298 0.989 22.59 5.03 Intr - 139578 139303 276 1 0 -11 73 272 0.162 13.29 5.02 Intr - 143154 142894 261 1 0 126 73 246 0.990 24.36 5.01 Init - 147168 147093 76 0 1 72 101 160 0.936 14.95 5.00 Prom - 148333 148294 40 -5.56 6.00 Prom + 149729 149768 40 -5.26 6.01 Init + 151921 151996 76 0 1 72 101 158 0.735 14.75 6.02 Intr + 154292 154552 261 0 0 116 73 291 0.925 27.86 6.03 Intr + 158406 158685 280 1 1 104 91 296 0.990 28.04 6.04 Intr + 163544 163841 298 2 1 78 10 289 0.139 16.98 6.05 Intr + 168727 168934 208 0 1 7 75 156 0.556 4.85 6.06 Term + 172203 172342 140 1 2 66 41 138 0.945 5.03 6.07 PlyA + 172758 172763 6 1.05 7.00 Prom + 194600 194639 40 -6.26 7.01 Init + 194658 195080 423 2 0 61 98 457 0.594 40.35 7.02 Intr + 195302 195455 154 1 1 87 94 54 0.289 5.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 184663 184728 66 0 0 84 39 117 0.920 4.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_1|261_aa MKTHQKDNELLSVHRQKLYEGQQPMLVIMDPDMIKTVLVKECYSVFTNQMHRVDFFQQMI DSQNSKETKSHKALSDLELVAQSIIIIFAAYDTTSTTLPFIMYELATHPDVQQKLQEEID AVLPNKAPVTYDALVQMEYLDMVVNETLRLFPVVSRVTRVCKKDIEINGVFIPKGLAVMV PIYALHHDPKYWTEPEKFCPERFSKKNKDSIDLYRYIPFGAGPRNCIGMRFALTNIKLAV IRALQNFSFKPCKETQAVIIT >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_1|786_bp atgaaaacacatcagaaagacaatgagttgcttagtgttcatagacagaagctgtatgag gggcaacagcccatgctggtcatcatggatcccgacatgatcaaaacagtgttagtgaaa gaatgttactctgtcttcacaaaccagatgcatcgagtagatttctttcaacagatgatc gactcccagaattccaaagaaacaaagtcccataaagctctgtctgatctggagcttgtg gcccagtcaattatcatcatttttgctgcctatgacacaactagcaccactctccccttc attatgtatgaactggccactcaccctgatgtccagcagaaactgcaggaggagattgac gcagttttacccaataaggcacctgtcacctacgatgccctggtacagatggagtacctt gacatggtggtgaatgaaacgctcagattattcccagttgttagtagagttacgagagtc tgcaagaaagatattgaaatcaatggagtgttcattcccaaagggttagcagtgatggtt ccaatctatgctcttcaccatgacccaaagtactggacagagcctgagaagttctgccct gaaaggttcagtaagaagaacaaggacagcatagatctttacagatacataccttttgga gctggaccccgaaactgcattggcatgaggtttgctctcacaaacataaaacttgctgtc attagagcactgcagaacttctccttcaaaccttgtaaagagactcaggctgttatcatc acctaa >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_2|323_aa MWQKNQTSLADFILEGLFDDSLTHLFLFSLTMVVFLIAVSGNTLTILLICIDPQLHTPMY FLLSQLSLMDLMHVSTIILKMATNYLSGKKSISFVGCATQHFLYLCLGGAECFLLAVMSY DRYVAICHPLRYAVLMNKKVGLMMAVMSWLGASVNSLIHMAILMHFPFCGPRKVYHFYCE FPAVVKLVCGDITVYETTVYISSILLLLPIFLISTSYVFILQSVIQMRSSGSKRNAFATC GSHLTVVSLWFGACIFSYMRPRSQCTLLQNKVGSVFYSIITPTLNSLIYTLRNKDVAKAL RRVLRRDVITQCIQRLQLWLPRV >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_2|972_bp atgtggcagaagaatcagacctctctggcagacttcatccttgaggggctcttcgatgac tcccttacccaccttttccttttctccttgaccatggtggtcttccttattgcggtgagt ggcaacaccctcaccattctcctcatctgcattgatccccagcttcatacaccaatgtat ttcctgctcagccagctctccctcatggatctgatgcatgtctccacaatcatcctgaag atggctaccaactacctatctggcaagaaatctatctcctttgtgggctgtgcaacccag cacttcctctatttgtgtctaggtggtgctgaatgttttctcttagctgtcatgtcctat gaccgctatgttgccatctgtcatccactgcgctatgctgtgctcatgaacaagaaggtg ggactgatgatggctgtcatgtcatggttgggggcatccgtgaactccctaattcacatg gcgatcttgatgcacttccctttctgtgggcctcggaaagtctaccacttctactgtgag ttcccagctgttgtgaagttggtatgtggcgacatcactgtgtatgagaccacagtgtac atcagcagcattctcctcctcctccccatcttcctgatttctacatcctatgtcttcatc cttcaaagtgtcattcagatgcgctcatctgggagcaagagaaatgcctttgccacttgt ggctcccacctcacggtggtttctctttggtttggtgcctgcatcttctcctacatgaga cccaggtcccagtgcactctattgcagaacaaagttggttctgtgttctacagcatcatt acgcccacattgaattctctgatttatactctccggaataaagatgtagctaaggctctg agaagagtgctgaggagagatgttatcacccagtgcattcaacgactgcaattgtggttg ccccgagtgtag >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_3|114_aa MTKLSWNPKSWDPVKRSSGREDEDWDESIEVPILSACLMVTTKIKEDQPHPQRVALQDLP PPERRQTTTMQDYTAVEMVKLGNLLFGLSDTVPSLMKSVGDWHKYTPFYQSPFK >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_3|345_bp atgactaaattgtcctggaaccctaagtcctgggaccccgtgaagagatctagtggcagg gaggatgaggactgggatgagtccatagaggttcccatcctctcagcatgtctgatggtc actaccaagataaaagaagaccaaccacacccacagagggtagccctacaagacttgcct ccacctgaaagaaggcagaccaccacaatgcaggattataccgctgtggaaatggtgaag ctgggaaatctcctcttcggcctttcagacacggtgccatccttaatgaagtcagtaggg gactggcacaagtatacacctttttatcagtctcccttcaagtga >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_4|564_aa MGFHHVGQAGLKLLTSGSQRLSLPAFKTIVEEGLIAGVGSGLKPGEDFTGPKHSGVEKVT KMCGRFLRRLLAEESRRSTPVGRLLLPVLLGFRLVLLAASGPGVYGDEQSEFVCHTQQPG CKAACFDAFHPLSPLRFWVFQVILVAVPSALYMGFTLYHGSSMEAEDIQEELTCPICLDY FQDPVSIECGHNFCRGCLHRNWAPGGGPFPCPECRHPSAPAALRPNWALARLTEKTQRRR LGPVPPGLCGRHWEPLRLFCEDDQRPVCLVCRESQEHQTHAMAPIDEAFESYRTGNFDIH VDEWKRRLIRLLLYHFKQEEKLLKSQRNLVAKMKKVMHLQDVEVKNATQWKDKIKSQRMR ISTEFSKLHNFLVEEEDLFLQRLNKEEEETKKKLNENTLKLNQTIASLKKLILEVGEKSQ APTLELLQNPKEVLTRSEIQDVNYSLEAVKVKTVCQIPLMKEMLKRFQERSGEGDGIGKI LEDIMAKDFLNIIKTKSTDPRCSTNPEYEKYEEKYAKAHHNLLETSVKKKILNATRGEDS YLQWNKDEVDITLLIGNNASKQTL >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_4|1695_bp atggggtttcaccatgttggccaggctggtctcaaactcctgacctcaggatcccagaga ctcagccttcctgcattcaaaaccatagtggaagaaggcctgattgctggggtgggcagt ggcctgaaacctggggaggacttcactgggcctaaacattctggtgtagagaaagtcact aagatgtgtggcaggttcctgcggcggctgctggcggaggagagccggcgctccaccccc gtggggcgcctcttgcttcccgtgctcctgggattccgccttgtgctgctggctgccagt gggcctggagtctatggtgatgagcagagtgaattcgtgtgtcacacccagcagccgggc tgcaaggctgcctgcttcgatgccttccaccccctctccccgctgcgtttctgggtcttc caggtcatcttggtggctgtacccagcgccctctatatgggtttcactctgtatcacgga agcagcatggaagctgaggacatccaggaggagttgacctgccccatctgcctggactat ttccaggacccggtgtccatcgagtgcggccacaacttctgccgcggctgcctgcaccgc aactgggcgccgggcggcggcccgttcccctgccccgaatgtcggcacccatcggcgccc gccgcgctgcgacccaactgggccctggccaggctgactgagaagacgcagcgccggcgc ctgggccccgtgcccccgggcctgtgcggccgccactgggagccgctgcggctcttctgc gaggacgaccagcggccagtgtgcctggtgtgcagggagtcccaggagcaccagactcac gccatggcacccatcgacgaggccttcgagagctaccggacaggtaactttgacatccac gtggatgaatggaagagaagactaattaggctgctcttgtaccattttaagcaggaggag aaacttcttaagtctcagcgtaatctcgtggccaagatgaagaaagtcatgcatttacag gatgtagaagtgaagaacgccacacagtggaaggataagataaagagtcagcgaatgaga atcagcacggagttttcaaagctgcacaacttcctggttgaagaagaggacctgtttctt cagagattgaacaaagaagaagaagagacgaagaagaagctgaatgagaacacgttaaaa ctcaatcaaactatcgcttcattgaagaagctcatcttagaggtgggggagaagagccag gctcccaccctggagctgcttcagaatccaaaagaagtgttgaccaggagtgagatccag gatgtgaactattctcttgaagctgtaaaggtgaagacagtgtgccagataccattgatg aaggaaatgctaaagcgattccaagagaggagtggggagggagatggaataggaaaaata cttgaagacataatggccaaagattttctgaatataataaaaactaagtccacagatcca agatgctcaacaaaccctgagtatgagaaatatgaagaaaaatatgccaaggctcatcat aacttgctggaaaccagtgttaaaaagaaaatcttaaatgcaaccaggggagaggatagt tacttgcagtggaacaaagacgaagttgacattacacttctcattggaaacaatgccagt aagcaaactctttaa >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_5|298_aa MVRMVPVLLSLLLLLGPAVPQENQDGRYSLTYIYTGLSKHVEDVPAFQALGSLNDLQFFR YNSKDRKSQPMGLWRQVEGMEDWKQDSQLQKAREDIFMETLKDIVEYYNDSNGSHVLQGR FGCEIENNRSSGAFWKYYYDGKDYIEFNKEIPAWVPFDPAAQITKQKWEAEPVYVQRAKA YLEEECPATLRKYLKYSKNILDRQDPPSVVVTSHQAPGEKKKLKCLAYDFYPGKIDVHWT RAGEVQEPELRGDVLHNGNGTYQSWVVVAVPPQDTAPYSCHVQHSSLAQPLVVPWEAS >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_5|897_bp atggtaagaatggtgcctgtcctgctgtctctgctgctgcttctgggtcctgctgtcccc caggagaaccaagatggtcgttactctctgacctatatctacactgggctgtccaagcat gttgaagacgtccccgcgtttcaggcccttggctcactcaatgacctccagttctttaga tacaacagtaaagacaggaagtctcagcccatgggactctggagacaggtggaaggaatg gaggattggaagcaggacagccaacttcagaaggccagggaggacatctttatggagacc ctgaaagacatcgtggagtattacaacgacagtaacgggtctcacgtattgcagggaagg tttggttgtgagatcgagaataacagaagcagcggagcattctggaaatattactatgat ggaaaggactacattgaattcaacaaagaaatcccagcctgggtccccttcgacccagca gcccagataaccaagcagaagtgggaggcagaaccagtctacgtgcagcgggccaaggct tacctggaggaggagtgccctgcgactctgcggaaatacctgaaatacagcaaaaatatc ctggaccggcaagatcctccctctgtggtggtcaccagccaccaggccccaggagaaaag aagaaactgaagtgcctggcctacgacttctacccagggaaaattgatgtgcactggact cgggccggcgaggtgcaggagcctgagttacggggagatgttcttcacaatggaaatggc acttaccagtcctgggtggtggtggcagtgcccccgcaggacacagccccctactcctgc cacgtgcagcacagcagcctggcccagcccctcgtggtgccctgggaggccagctag >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_6|420_aa MVRMVSVLLSLLLLLGPAVLQETRDGHYSLTYLYTGLSRSGKGTHRLQGTVFLNGHAFFH YNSEDRKAEPLGPWRHAEGVEDWEKQSQVQKAREDIFMETLNNIMEYYNDGNDPPSVVVT SHQAPGEKKKLKCLAYDFYPGKIDVHWTRAGEVQEPELRGDVLHGGNGTYLTWLLVHVPP QDTAPYSCHVQHSSLAQPLVVPGEARMCGRFLRWWLLAEESWHSTPVGRLLFPVLLGFRL VLLAASGPGVYGDEQSEFVCHTQQPGCKAACFDAFHPLSPLRFWVFQVILVAVPSVLYMG FTLYHAPVVGYTPWESGVFQYREPPRCGPSEVLERAPRPPAPATLPAFVGRSDGGLRLSV LFVAVTAVCSCQRLASKMYYGNLDIYELPCNGKVVTATSRVPAKFYPEWNKLLIVPFPGF >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_6|1263_bp atggtaagaatggtgtctgtcctgctgtctctgctgctgcttctgggtcctgctgtcctc caggagacccgagatggtcattactctctgacctatctctacactgggctgtccaggtct ggcaaaggcacccacaggctgcagggcactgtcttcctcaatggccatgccttcttccac tacaacagtgaagacaggaaggctgagcccctgggaccatggagacatgcggaaggagta gaggactgggagaagcagagccaagttcagaaggccagggaggacatctttatggagacc ctgaacaacatcatggagtattacaacgacggtaacgatcctccctctgtggtggtcacc agccaccaggccccaggagaaaagaagaaactgaagtgcctggcctacgacttctaccca gggaaaattgatgtgcactggactcgggccggcgaggtgcaggagcctgagttacgggga gatgttcttcacggtggaaacggcacttacctgacctggttgttggtgcatgtgcccccg caggacacagccccctactcctgccacgtgcagcacagcagcctggcccagcccctcgtg gtgcccggggaggccaggatgtgtggcaggttcctgaggtggtggctgctggcggaggag agctggcactccacccccgtggggcgcctcctgtttcccgtgctcctgggattccgcctt gtgctgctggctgccagtgggcctggagtctatggcgatgagcagagtgaattcgtgtgt cacacccagcagccgggctgcaaggctgcctgcttcgatgccttccacccgctctccccg ctgcgtttctgggtcttccaggtcatcttggtggctgtacctagcgtcctctacatgggt ttcactctgtatcacgcccccgtcgtagggtacacgccctgggaatcgggggtcttccag tatagggaaccaccgagatgcgggccctccgaagtcctagagagggcgcctcggccgccg gcacccgccacacttccggcctttgtgggccgcagcgacggcggtctgcggctgtcggtt ctgtttgttgctgtcactgctgtttgttcttgccagcggctagcctccaagatgtactat ggaaatttagacatttatgaattgccttgcaatggaaaggtagtaacagcaacttcgaga gttcctgccaagttctaccccgagtggaacaagttgctgattgttcccttccctggattt taa >gi568815591r:99828853_100029620|GENSCAN_predicted_peptide_7|193_aa MTAESREATGLSPQAAQEKDGIVIVKVEEEDEEDHMWGQDSTLQDTPPPDPEIFRQRFRR FCYQNTFGPREALSRLKELCHQWLRPEINTKEQILELLVLEQFLSILPKELQVWLQEYRP DSGEEAVTLLEDLELDLSGQQVPGQVHGPEMLARGMVPLDPVQESSSFDLHHEATQSHFK HSSRKPRLLQSRX >gi568815591r:99828853_100029620|GENSCAN_predicted_CDS_7|579_bp atgactgctgaatcacgggaagccacgggtctgtccccacaggctgcacaggagaaggat ggtatcgtaatagtgaaggtggaagaggaagatgaggaagaccacatgtgggggcaggat tccaccctacaggacacgcctcctccagacccagagatattccgccaacgcttcaggcgc ttctgttaccagaacacttttgggccccgagaggctctcagtcggctgaaggaactttgt catcagtggctgcggccagaaataaacaccaaggaacagatcctggagcttctggtgcta gagcagtttctttccatcctgcccaaggagctccaggtctggctgcaggaataccgcccc gatagtggagaggaggccgtgacccttctagaagacttggagcttgatttatcaggacaa caggtcccaggtcaagttcatggacctgagatgctcgcaagggggatggtgcctctggat ccagttcaggagtcctcgagctttgaccttcatcacgaggccacccagtcccacttcaaa cattcgtctcggaaaccccgcctcttacagtcacgagnn