GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:49:20 Sequence gi568815592f:47607246_47818439 : 211194 bp : 38.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1878 2059 182 2 2 65 114 176 0.878 15.64 1.02 Intr + 5228 5291 64 2 1 45 76 85 0.211 0.80 1.03 Intr + 9564 9648 85 2 1 78 76 8 0.013 -2.93 1.04 Term + 14963 15150 188 0 2 65 44 116 0.007 1.57 1.05 PlyA + 15762 15767 6 1.05 2.04 PlyA - 16789 16784 6 1.05 2.03 Term - 24814 24574 241 0 1 41 41 159 0.299 1.01 2.02 Intr - 26557 26433 125 1 2 52 44 71 0.347 -2.44 2.01 Init - 28299 28144 156 0 0 89 93 107 0.827 11.26 2.00 Prom - 28878 28839 40 -3.65 3.05 PlyA - 29637 29632 6 1.05 3.04 Term - 31739 30979 761 0 2 2 44 473 0.135 26.90 3.03 Intr - 33801 33533 269 2 2 12 71 156 0.051 2.55 3.02 Intr - 38380 38240 141 0 0 58 31 161 0.066 5.95 3.01 Init - 53591 53539 53 2 2 59 72 56 0.050 1.78 3.00 Prom - 55923 55884 40 -3.75 4.03 PlyA - 57735 57730 6 1.05 4.02 Term - 58985 58800 186 2 0 86 39 355 0.996 26.91 4.01 Init - 59775 59743 33 0 0 64 110 20 0.970 1.72 4.00 Prom - 64845 64806 40 -3.25 5.00 Prom + 68726 68765 40 -6.55 5.01 Init + 68899 68905 7 0 1 78 71 5 0.153 -1.33 5.02 Intr + 71738 71889 152 0 2 116 75 59 0.681 6.36 5.03 Intr + 72826 73059 234 0 0 89 109 73 0.942 6.46 5.04 Intr + 74020 75381 1362 0 0 66 111 791 0.093 67.07 5.05 Intr + 79747 79788 42 0 0 117 79 40 0.053 3.52 5.06 Intr + 84589 84665 77 0 2 23 95 39 0.026 -4.51 5.07 Intr + 86427 86503 77 0 2 91 105 46 0.075 4.84 5.08 Intr + 100059 100093 35 1 2 109 106 20 0.651 2.92 5.09 Intr + 100979 101033 55 1 1 49 95 45 0.764 -1.07 5.10 Intr + 103490 103641 152 0 2 137 59 74 0.991 8.36 5.11 Intr + 105112 105363 252 0 0 70 61 237 0.981 15.91 5.12 Intr + 106553 107932 1380 1 0 101 111 888 0.975 80.59 5.13 Intr + 110047 110106 60 0 0 80 99 40 0.703 2.31 5.14 Intr + 129652 129786 135 0 0 51 53 127 0.016 5.34 5.15 Intr + 150179 150276 98 1 2 63 83 121 0.281 6.99 5.16 Intr + 161498 161556 59 2 2 63 92 42 0.050 -0.29 5.17 Intr + 174799 174951 153 2 0 86 99 72 0.074 7.22 5.18 Intr + 179270 179389 120 0 0 74 92 46 0.866 3.15 5.19 Intr + 180762 180938 177 1 0 15 31 143 0.002 0.27 5.20 Intr + 184563 184727 165 1 0 28 115 126 0.002 8.31 5.21 Intr + 187984 188318 335 2 2 99 115 144 0.000 12.57 5.22 Intr + 200909 201150 242 1 2 99 98 170 0.549 14.53 5.23 Intr + 204429 204486 58 0 1 72 64 49 0.044 -1.13 5.24 Intr + 210618 210748 131 2 2 86 92 56 0.041 4.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 74020 75441 1422 0 0 66 38 851 0.891 67.85 S.002 Init + 174822 174951 130 2 1 74 99 83 0.861 8.36 S.003 Term + 180762 181114 353 1 2 15 42 243 0.910 6.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:47607246_47818439|GENSCAN_predicted_peptide_1|172_aa PSVYLSTPSSASKANTTAFLTPLEIKAKVETDDVKKNSLDELRAQIIELLCIVEALKKDH GKELEKLRKDLEEEKTMRSNLEVRFTDAKFKVSSSVPATDHIFYHCHPVKALFPDSGRDG LENLPQATCLLAAKEKGLVLPSPVESANQICRPTQSCGLEVSRPVQIVTKFS >gi568815592f:47607246_47818439|GENSCAN_predicted_CDS_1|519_bp ccatctgtgtacctttcaacaccttccagtgcttctaaagcaaatacaactgctttcctg actccattagaaatcaaagctaaagtggaaacagatgatgtgaaaaaaaattccctggat gaacttagagcccagattattgaattgttgtgcattgtagaagcactgaaaaaggatcac gggaaagaactggaaaaactgcgaaaagatttggaagaagagaagacaatgagaagtaat ctagaggttaggtttactgatgccaaattcaaggtctcttcttcagtgcctgcaacagat cacattttttaccattgccaccctgtcaaagccctgtttccagacagtgggcgagatggg cttgaaaacttgccccaggctacctgccttctagctgcgaaagaaaagggcttggttctt ccctcgcctgtggagtctgcaaaccagatttgccgccctacccagagttgtggcctggag gtctctcgccccgttcaaattgttacaaagtttagctag >gi568815592f:47607246_47818439|GENSCAN_predicted_peptide_2|173_aa MGDQATLPLKWSGRVPKTSLTKFQMSRLLVPVPSRCSATALILHGGLPGTALVALKMGKG GHELRIASSLKKLEKARKGIFPLELPLEALILAQSTLQDKMEEREVPPENQGQEVANNTC EKRFRTHCSNLGSKEPSRRQHPTEMTNKSKMVLLLLSDLFSQNSGPPFSLFPE >gi568815592f:47607246_47818439|GENSCAN_predicted_CDS_2|522_bp atgggtgatcaggccacgcttccactcaaatggagtgggcgagttccgaagaccagtctt accaagtttcagatgtccagactcctagtgccagttccttcccggtgttcagccactgcg ttgatcctccacgggggcctgccaggcactgctctggtggctttgaaaatggggaaaggg ggccatgagctaagaattgccagcagcctaaagaagctggaaaaggcaaggaaaggaatt ttccccctagagcttcctttggaagccttaattttagcccagagcacattgcaggacaaa atggaggaacgggaggtgccaccagagaatcaaggacaggaagtggcaaataacacatgt gagaaaaggttcaggactcactgcagcaacctggggtcaaaggagccctcaaggcgtcaa catccaacagaaatgacaaacaagagcaagatggttttgttgctgctgtctgacttgttc tcccagaactcagggcctcctttctccctcttccctgagtaa >gi568815592f:47607246_47818439|GENSCAN_predicted_peptide_3|407_aa MRKTSMDEASTVVASDKSHTWTSQLMGVTGAGLVVFFRVILQRLAKLLPSTYSSQSADVQ GWSQRGRLTPHTAGSHSETKLPEERSGSNICRSAIFAVLQPPLVIPRQTGSGVDLQQTPT DLQLRVLTVRRKTNKQKGHPQHNPISTSPSTKTKDHGAITLELRIKKLTQNCSTTRKLKN LLLNVYWVRNEMKAEIKMFFETNENKDTTYQNLWDTFKAGSRGKFIALNAHKRKQERSKI DTLTSQLKELEKQEQTHSKASRRQEITKIQAELKDIETQKTLQKISESRSWFFEKINKID RLLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLEIREEMDEFLD TYTLPRLNQEEAESLNRSITGSEIEAIINSLPTKKSQGPDRFTSEFY >gi568815592f:47607246_47818439|GENSCAN_predicted_CDS_3|1224_bp atgagaaagacatccatggatgaagcctccacagtggtagcttctgacaagagtcatacg tggactagtcagcttatgggtgtgactggagcagggcttgtcgtcttcttcagagtcatt ttgcagaggttggcgaagctgctcccgtccacgtacagctcacagtctgctgatgttcaa ggatggtctcagaggggccgactgacacctcacacagccgggagccactctgagacaaag cttccagaggaacgatcaggcagcaacatttgccgttctgcaatatttgcagttctgcag cctcctctggtgatacccaggcaaacagggtctggagtggacctccagcaaactccaaca gacctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaggacatcca caacacaaccccatctctacgtcaccatcaacaaagaccaaagaccatggtgcaatcaca ttagaactcaggattaagaaactcactcaaaactgctcaactacacggaagctgaaaaac ctgctcctgaatgtctactgggtccgtaacgaaatgaaggcagaaataaagatgttcttt gaaaccaatgagaacaaagacacaacataccagaatctctgggacacatttaaagcaggg agtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaatt gacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacattcaaaagct agcagaaggcaagaaataactaagatccaagcagaactgaaggacatagagacacaaaaa acccttcaaaaaatcagtgaatccaggagttggttttttgaaaagatcaacaaaatcgat agactgctagcaagactaataaagaagaaaagagagaagaatcaaatagacacaataaaa aatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatac tataaacacctctatgcaaataaactagaaattcgagaagaaatggatgaattcctggac acatacaccctcccaagactaaaccaggaagaagctgaatccctgaatagatcaataaca ggctctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtcaaggaccagac agattcacatccgaattctactag >gi568815592f:47607246_47818439|GENSCAN_predicted_peptide_4|72_aa MLTLISGPNQQVLLSVLIGHAKCWSSSLARRRRRGGIGGGGKEEEEEEEEEEEEEEEEEE EEEERRKEEGED >gi568815592f:47607246_47818439|GENSCAN_predicted_CDS_4|219_bp atgttgactctgatatctggacccaatcaacaggtactcctgtctgtcctcattggacat gcaaagtgctggtcatcatctctagcaagaagaagaagaagaggaggaataggaggagga gggaaggaggaggaggaagaggaggaggaggaagaggaggaggaagaggaggaggaggaa gaggaggaagaaagaaggaaggaggaaggagaagattaa >gi568815592f:47607246_47818439|GENSCAN_predicted_peptide_5|1853_aa MAGVCDGVCTDYSQCTQPCPPDTQGNMGFSCRQKTWHKITDTCQTLNALNIFEEDSRLVQ PFEDNIKISVYTGKSETITDMLLQKCPTDLSCVIRNIQQSPWIPGNIAVIVQLLHNISTA IWTGVDEAKMQSYSTIANHILNSKSISNWTFIPDRNSSYILLHSVNSFARRLFIDKHPVD ISDVFIHTMGTTISGDNIGKNFTFSMRINDTSNEVTGRVLISRDELRKVPSPSQVISIAF PTIGAILEASLLENVTVNGLVLSAILPKELKRISLIFEKISKSEERRTQCVGWHSVENRW DQQACKMIQENSQQAVCKCRPSKLFTSFSILMSPHILESLILTYITYVGLGISICSLILC LSIEVLVWSQVTKTEITYLRHVCIVNIAATLLMADVWFIVASFLSGPITHHKGCVAATFF VHFFYLSVFFWMLAKALLILYGIMIVFHTLPKSVLVASLFSVGYGCPLAIAAITVAATEP GKGYLRPEICWLNWDMTKALLAFVIPALAIVVVNLITVTLVIVKTQRAAIGNSMFQEVRA IVRISKNIAILTPLLGLTWGFGVATVIDDRSLAFHIIFSLLNAFQGFFILVFGTILDPKV EGHSSKDSLKCSGIENICLQQASLIMLLLNYLVIPSKAQVNKPGYIPNLGECSHYRSKIH LKAGDKLQSPEGKPKTGRIQEKCEGPCISSSNCSQPCAKDFHGEIGFTCNQKKWQKSAET CTSLSVEKLFKDSTGASRLSVAAPSIPLHILDFRAPETIESVAQGIRKNCPFDYACITDM VKSSETTSGNIAFIVELLKNISTDLSDNVTREKMKSYSEVANHILDTAAISNWAFIPNKN ASSDLLQSVNLFARQLHIHNNSENIVNELFIQTKGFHINHNTSEKSLNFSMSMNNTTEDI LGMVQIPRQELRKLWPNASQAISIAFPTLGAILREAHLQNVSLPRQVNGLVLSVVLPERL QEIILTFEKINKTRNARAQCVGWHSKKRRWDEKACQMMLDIRNEVKCRCNYTSVVMSFSI LMSSKSMTDKVLDYITCIGLSVSILSLVLCLIIEATVWSRVVVTEISYMRHVCIVNIAVS LLTANVWFIIGSHFNIKAQDYNMCVAVTFFSHFFYLSLFFWMLFKALLIIYGILVIFRRM MKSRMMVIGFAIGYGCPLIIAVTTVAITEPEKGYMRPEACWLNWDNTKALLAFAIPAFVI VAVNLIVVLVVAVNTQRPSIGSSKSQDVVIIMRISKNVAILTPLLGLTWGFGIATLIEGT SLTFHIIFALLNAFQIRDALRMRMSSLKGKSRAAEFNSRYNCLSDVIKLIHILPYVCVFE VVFKEILLYPQPVKMFPQIETEREQEEELEMVNSIGADECGCRHWKKTPKDLRDTWADDC KSTLLPRAFAYKHPSWFENRMALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYL TIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISAHSRCVLTQSLDGLGYL GRLYSVWTCINIMNAYGECTRTLRRARHAWFCENKQDEPQGICKPFTIISCFCHRWVFGW IGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFW TTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKII AKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSI PIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFRLHTVTTVRKS SAVLEIHEEDKGRLWHQLSCGENSEMTASSETLGTKAHSPLQETREGENEAVG >gi568815592f:47607246_47818439|GENSCAN_predicted_CDS_5|5559_bp atggcaggtgtatgcgatggtgtctgtacagactactcccagtgtactcaaccttgccct ccagacactcagggaaatatggggttttcatgcaggcaaaagacatggcacaagatcact gacacctgccagactcttaatgccctcaacatctttgaggaggattcacgtttggttcag ccatttgaagacaatataaaaataagtgtatatactggaaagtctgagaccataacagat atgttgctacaaaagtgtcccacagatctgtcttgtgtaattagaaacattcagcagtct ccctggataccaggaaacattgccgtaattgtgcagctcttacacaacatatcaacagca atatggacaggtgttgatgaggcaaagatgcagagttacagcaccatagccaaccacatt cttaacagcaaaagcatctccaactggactttcattcctgacagaaacagcagctatatc ctgctacattcagtcaactcctttgcaagaaggctattcatagataaacatcctgttgac atatcagatgtcttcattcatactatgggcaccaccatatctggagataacattggaaaa aatttcactttttctatgagaattaatgataccagcaatgaagtcactgggagagtgttg atcagcagagatgaacttcggaaggtgccttccccttctcaggtcatcagcattgcattt ccaactattggggctattttggaagccagtcttttggaaaatgttactgtaaatgggctt gtcctgtctgccattttgcccaaggaacttaaaagaatctcactgatttttgaaaagatc agcaagtcagaggagaggaggacacagtgtgttggctggcactctgtggagaacagatgg gaccagcaggcctgcaaaatgattcaagaaaactcccagcaagctgtttgcaaatgtagg ccaagcaaattgtttacctctttctcaattcttatgtcacctcacatcttagagagtctg attctgacttacatcacatatgtaggcctgggcatttctatttgcagcctgatcctttgc ttgtccattgaggtcctagtctggagccaagtgacaaagacagagatcacctatttacgc catgtgtgcattgttaacattgcagccactttgctgatggcagatgtgtggttcattgtg gcttcctttcttagtggcccaataacacaccacaagggatgtgtggcagccacatttttt gttcatttcttttacctttctgtatttttctggatgcttgccaaggcactccttatcctc tatggaatcatgattgttttccataccttgcccaagtcagtcctggtggcatctctgttt tcagtgggctatggatgccctttggccattgctgccatcactgttgctgccactgaacct ggcaaaggctatctacgacctgagatctgctggctcaactgggacatgaccaaagccctc ctggccttcgtgatcccagctttggccatcgtggtagtaaacctgatcacagtcacactg gtgattgtcaagacccagcgagctgccattggcaattccatgttccaggaagtgagagcc attgtgagaatcagcaagaacatcgccatcctcacaccacttctgggactgacctgggga tttggagtagccactgtcatcgatgacagatccctggccttccacattatcttctccctg ctcaatgcattccagggtttcttcatcctagtgtttggaaccatcctggatccaaaggtt gagggtcattcatctaaagactccctgaagtgttctggcattgaaaatatttgtctccaa caagcatccctgataatgcttcttctgaattatcttgtcatcccatcaaaggcccaagtt aacaaaccaggctacatccctaacctaggagaatgttcccactatagatccaagattcac ctaaaagctggagataaacttcaaagccctgaagggaaacccaagactggaaggatccaa gagaaatgcgaaggaccttgtatttcttcttccaactgcagccagccctgtgctaaggac tttcatggagaaataggatttacatgtaatcaaaaaaagtggcaaaaatcagctgaaaca tgtacaagcctttctgtggaaaaactctttaaggactcaactggtgcatctcgcctttct gtagcagcaccatctatacctctgcatattctagactttcgagctccagagaccattgag agtgtagctcaaggaatccgtaagaactgcccctttgattatgcctgcatcactgacatg gtgaaatcatcagaaacaacatctggaaatattgcatttatagtggagttattaaaaaat atttctacagacttgtctgataatgttactcgagagaaaatgaagagctatagtgaagtg gccaaccacatcctcgacacagcagccatttcaaactgggctttcattcccaacaaaaat gccagctcggatttgttgcagtcagtgaatttgtttgccagacaactccacatccacaat aattctgagaacattgtgaatgaactcttcattcagacaaaagggtttcacatcaaccat aatacctcagagaaaagcctcaatttctccatgagcatgaacaataccacagaagatatc ttaggaatggtacagattcccaggcaagagctaaggaagctgtggccaaatgcatcccaa gccattagcatagctttcccaaccttgggggctatcctgagagaagcccacttgcaaaat gtgagtcttcccagacaggtaaatggtctggtgctatcagtggttttaccagaaaggttg caagaaatcatactcaccttcgaaaagatcaataaaacccgcaatgccagagcccagtgt gttggctggcactccaagaaaaggagatgggatgagaaagcgtgccaaatgatgttggat atcaggaacgaagtgaaatgccgctgtaactacaccagtgtggtgatgtctttttccatt ctcatgtcctccaaatcgatgaccgacaaagttctggactacatcacctgcattgggctc agcgtctcaatcctaagcttggttctttgcctgatcattgaagccacagtgtggtcccgg gtggttgtgacggagatatcatacatgcgtcacgtgtgcatcgtgaatatagcagtgtcc cttctgactgccaatgtgtggtttatcataggctctcactttaacattaaggcccaggac tacaacatgtgtgttgcagtgacatttttcagccactttttctacctctctctgtttttc tggatgctcttcaaagcattgctcatcatttatggaatattggtcattttccgtaggatg atgaagtcccgaatgatggtcattggctttgccattggctatgggtgcccattgatcatt gctgtcactacagttgctatcacagagccagagaaaggctacatgagacctgaggcctgt tggcttaactgggacaataccaaagcccttttagcatttgccatcccggcgttcgtcatt gtggctgtaaatctgattgtggttttggttgttgctgtcaacactcagaggccctctatt ggcagttccaagtctcaggatgtggtcataattatgaggatcagcaaaaatgttgccatc ctcactccactgctgggactgacctggggttttggaatagccactctcatagaaggcact tccttgacgttccatataatttttgccttgctcaatgctttccagataagagatgctttg aggatgaggatgtcttcactgaaggggaaatcgagggcagctgagtttaattctcgatac aactgcctaagtgatgtcatcaaattgatccacattttgccttatgtttgtgtttttgaa gttgtgtttaaagaaattcttctgtacccgcagcctgtaaagatgtttccccagattgag actgaaagagagcaggaagaggagttggaaatggtcaacagtattggagcagatgagtgt gggtgcagacactggaaaaagacaccaaaggacctgagggacacctgggcagatgattgt aagtcaacgctacttccccgggccttcgcatataaacatccctcgtggttcgagaacaga atggcgttaaatcacactgccctgcctcaggacgagcgcctgccccattaccttcgagat ggggatccttttgcttccaaactttcttgggaagcggatttagtggctggcttttaccta acaataattgggattctgtccacatttggaaatggatatgtcctttacatgtcttctaga cgaaagaagaagctgagacccgctgaaataatgactatcaatttagcagtctgtgatctg gggatttcagcacacagcagatgtgtgcttactcagagtttggatggtcttggttatctg ggaaggctttattcagtatggacctgcataaacattatgaacgcttatggtgaatgcaca cgcacattaagaagagcaagacatgcctggttttgtgaaaataaacaggacgaaccccag gggatttgcaagccgttcaccatcatctcttgcttttgtcaccgctgggtgtttggctgg atcggctgccgctggtatggatgggctggatttttctttggctgtggaagccttatcacc atgactgctgtcagcctggatcgatatttgaaaatctgctatttatcttatggggtttgg ctgaaaagaaagcacgcctacatctgcctggcagccatctgggcctatgcttccttctgg accaccatgcccttggtaggtctgggggactacgtacctgagcccttcggaacctcgtgc accctggactggtggctggcccaggcctcggtagggggccaggttttcatcctgaacatc ctcttcttctgcctcttgctcccaacggctgtgatcgtgttctcctacgtaaagatcatt gccaaggttaagtcctcttccaaagaagtagctcatttcgacagtcggatccatagcagc catgtgctggaaatgaaactgacaaaggtagcgatgttgatttgtgctggattcctgatt gcctggattccttatgcagtggtgtctgtgtggtcagcttttggaaggccagactccatt cccatacagctctctgtggtgccaaccctacttgcaaaatctgcagcgatgtacaatccc atcatttaccaagttattgattacaaatttgcctgttgccaaactggtggtttgaaagca accaagaagaagtctctggaaggcttcaggctgcacaccgtaaccacagtcaggaagtct tctgctgtgctggaaattcatgaagaggacaaagggagattatggcatcagctatcttgt ggggagaattctgagatgacagcctccagtgaaacactagggacaaaggcccattcacct cttcaagaaaccagggaaggggaaaatgaagctgttggn