GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:56:56 Sequence gi568815578r:49805622_50008490 : 202869 bp : 47.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7229 7327 99 2 0 87 64 97 0.913 6.43 1.02 Intr + 9387 9568 182 0 2 101 36 163 0.936 11.91 1.03 Intr + 17440 17520 81 2 0 98 97 15 0.801 3.01 1.04 Intr + 24155 24370 216 0 0 53 39 150 0.595 5.08 1.05 Intr + 24480 24710 231 1 0 26 7 205 0.582 3.94 1.06 Intr + 25089 25165 77 1 2 70 38 69 0.507 -0.67 1.07 Term + 25318 25449 132 0 0 16 52 179 0.452 5.19 1.08 PlyA + 27818 27823 6 1.05 2.02 PlyA - 28292 28287 6 1.05 2.01 Sngl - 39937 39542 396 1 0 78 54 253 0.959 17.15 2.00 Prom - 42623 42584 40 -6.36 3.00 Prom + 47016 47055 40 -3.16 3.01 Init + 49297 49520 224 0 2 78 -13 153 0.776 0.26 3.02 Intr + 49817 49960 144 2 0 95 108 189 0.958 21.00 3.03 Intr + 53879 53982 104 2 2 110 61 -12 0.216 -1.88 3.04 Intr + 59107 59223 117 2 0 84 111 35 0.794 5.84 3.05 Intr + 69084 69200 117 1 0 112 89 149 0.486 17.84 3.06 Intr + 78225 78445 221 1 2 89 49 607 0.915 54.92 3.07 Intr + 81131 81277 147 1 0 69 59 322 0.846 27.83 3.08 Term + 82118 82315 198 1 0 113 48 245 0.760 20.40 3.09 PlyA + 85796 85801 6 1.05 4.03 PlyA - 90085 90080 6 1.05 4.02 Term - 101224 99998 1227 1 0 126 41 1451 0.999 135.82 4.01 Init - 102869 102534 336 2 0 33 100 473 0.379 40.28 4.00 Prom - 104351 104312 40 -7.76 5.00 Prom + 104652 104691 40 -3.16 5.01 Init + 106090 106231 142 0 1 68 77 52 0.510 2.40 5.02 Term + 109865 110091 227 0 2 59 43 208 0.718 10.34 5.03 PlyA + 110564 110569 6 1.05 6.00 Prom + 113019 113058 40 -6.76 6.01 Init + 115080 115153 74 2 2 63 68 70 0.282 1.05 6.02 Intr + 123104 123194 91 2 1 117 117 9 0.771 6.70 6.03 Intr + 130657 130914 258 0 0 69 49 214 0.014 13.26 6.04 Intr + 135962 136090 129 1 0 84 48 128 0.501 9.29 6.05 Intr + 139776 139867 92 2 2 63 94 85 0.528 5.39 6.06 Intr + 140515 140629 115 1 1 92 89 56 0.998 6.55 6.07 Intr + 143627 143734 108 1 0 46 103 125 0.994 10.28 6.08 Intr + 146455 146490 36 0 0 100 41 87 0.011 3.66 6.09 Intr + 151182 151616 435 2 0 74 -56 582 0.002 36.18 6.10 Term + 151944 152411 468 2 0 -32 46 422 0.014 20.97 6.11 PlyA + 152451 152456 6 1.05 7.00 Prom + 158030 158069 40 -3.76 7.01 Init + 165132 165266 135 2 0 84 93 53 0.477 5.68 7.02 Intr + 175897 175979 83 0 2 94 77 62 0.878 4.14 7.03 Intr + 177107 177265 159 2 0 57 46 151 0.924 6.90 7.04 Intr + 177462 177520 59 0 2 102 96 127 0.740 13.43 7.05 Intr + 178203 178730 528 1 0 137 92 512 0.910 49.41 7.06 Term + 182251 182435 185 2 2 123 46 228 0.999 19.81 7.07 PlyA + 183237 183242 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 146455 146520 66 0 0 100 53 167 0.955 12.34 S.002 Sngl + 152001 152411 411 2 0 74 46 362 0.976 26.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_1|339_aa XGSRKQKRVLLAPRLRTRWSWKLRRMGEKMAEEERFPNTTHEGFNVTLHTTLVVTTKLVL PTPGKPILPVQTGEQAQQEEQSSGMTIFFSLLVLAICIILVHLLIRYRLHFLPESVAVVS LVRREAGQTQVMRSANSLSEGKVKKIMGDGKKGVSSGSSETGSKSPLKRIQEQSPQKRGQ PPKNEKNVTIVESAKQAISVCYQAITKKLKICEEETGSTSIRAADSTAVNGSITDKKMGF GGLGLMRSGIVSNLLKMGHTVTVWNRTAKKREVGNAAKMMLIVNIVQGSFIAMITEKDLC LAIALGNAVNSLTPMAAAANQVYKRAKALDQSNNMSAVS >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_1|1020_bp nncggaagccggaagcaaaagcgggtcctgctagccccgcggctccgaactcggtggtcc tggaagctccgcaggatgggggagaagatggcggaagaggagaggttccccaatacaact catgagggtttcaatgtcaccctccacaccaccctggttgtcacgacgaaactggtgctc ccgacccctggcaagcccatcctccccgtgcagacaggggagcaggcccagcaagaggag cagtccagcggcatgaccattttcttcagcctccttgtcctagctatctgcatcatattg gtgcatttactgatccgatacagattacatttcttgccagagagtgttgctgttgtttct ttagtgaggagagaagcaggccaaactcaggtgatgagaagtgcaaacagcctgtctgaa ggaaaagtgaagaagatcatgggagatggaaagaagggggtgtcttcgggctcttcagag acaggctccaaatcccctctgaaaagaatccaagagcaaagtccccagaagcggggtcag cccccaaagaatgagaagaatgtcaccatcgtggagtccgccaaacaagccatctctgtc tgttaccaggcaatcacaaagaagttgaaaatatgtgaagaggaaactggttccacctcc atccgggcagctgacagcacggccgtgaatggcagcatcacagacaaaaagatgggattt gggggccttggtctcatgagaagtggaatcgtctctaacttgctaaaaatgggtcacaca gtgactgtctggaaccgcactgccaagaaacgtgaagttggcaacgcagccaagatgatg ctgattgtaaacatagtccaagggagcttcatagccatgatcactgagaaggatctctgc ttagccattgcgctgggcaatgcggtcaactctctgactcccatggcagctgcagccaac caggtgtacaaaagagccaaggcactggaccagtccaacaatatgtccgctgtgtcctga >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_2|131_aa MDGPEKVRKAPVGELKETKVLGHMSPSSQDTVYPLYIYQSVSRKTSQCIICEPLNITSAI FLGKAPIYGKHWLSEPYCLKPLKQPIHHVCSSPEVRHCSPASTEHNQEGAELLLLLLLNY TQAFKELALQT >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_2|396_bp atggatgggcctgagaaggttagaaaggccccagtgggggagctgaaagaaacaaaagtc ctggggcacatgtccccgtcgtcccaggacacagtctatcctctttacatctatcagtct gtgtccaggaagacgagtcaatgcatcatctgtgagccacttaacatcacttcagccatc ttcctgggcaaagcacccatttatggcaaacactggctcagtgagccatactgcctgaaa ccattaaagcagccaattcatcatgtctgctcctctccagaagtcaggcactgcagtcca gcttccactgagcacaaccaggaaggagcagagctgctgctgctgctgctgctaaattat acccaagcttttaaggaacttgccttacaaacatga >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_3|423_aa MASVFLSAVYATHPQSTLGLLLLFDTFLDLANLSAGWGRSRTAFQGAGGEGQSRGQMQEL MGHLALAALTRGTGRFAFGSLISAVDPVATIAIFNALHVDPVLNMLVFGESILNDAVSIV LTNTEEKYAPGSCCHLSLVIRIIPGFEEVYFINDHLQIVIYVLKHIDLRKTPSLEFGMMI IFAYLPYGLAEGISLSGIMAILFSGIVMSHYTHHNLSPVTQILMQQTLRTVAFLCGLRGA IPYALSLHLDLEPMEKRQLIGTTTIVIVLFTILLLGGSTMPLIRLMDIEDAKAHRRNKKD VNLSKTEKMGNTVESEHLSELTEEEYEAHYIRRQDLKGFVWLDAKYLNPFFTRRLTQETP HTKHPVASPPFHGPAPDALTAWLCLSTQDLHHGRIQMKTLTNKWYEEVRQGPSGSEDDEQ ELL >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_3|1272_bp atggcctcagtgtttctctcggctgtctacgccactcacccccaaagcacactggggttg ctgctgctttttgacactttcttagatcttgcaaatctgagtgcaggctggggtcggtca cggacagcattccaaggggctggtggcgaggggcagagcagaggtcagatgcaggagctc atgggccatttggctctagctgctctgacccgtggaactgggcgttttgcgtttggctcc ctaatatctgctgtcgatccagtggccactattgccattttcaatgcacttcatgtggac cccgtgctcaacatgctggtctttggagaaagtattctcaacgatgcagtctccattgtt ctgaccaatacagaagaaaaatatgctcctgggagctgttgtcacctttccttggttatc agaatcatacctggttttgaagaggtatacttcataaacgatcatctccaaattgtcatt tacgtgctgaagcatattgacttgaggaaaacgccttccttggagtttggcatgatgatc atttttgcttatctgccttatgggcttgcagaaggaatctcactctcaggcatcatggcc atccttttctcaggcatcgtgatgtcccactacacgcaccataacctctccccagtcacc cagatcctcatgcagcagaccctccgcaccgtggccttcttatgtggcctgcggggagcc atcccctatgccctgagcctacacctggacctggagcccatggagaagcggcagctcatc ggcaccaccaccatcgtcatcgtgctcttcaccatcctgctgctgggcggcagcaccatg cccctcattcgcctcatggacatcgaggacgccaaggcacaccgcaggaacaagaaggac gtcaacctcagcaagactgagaagatgggcaacactgtggagtcggagcacctgtcggag ctcacggaggaggagtacgaggcccactacatcaggcggcaggaccttaagggcttcgtg tggctggacgccaagtacctgaaccccttcttcactcggaggctgacgcaggagacaccc cacacaaaacacccagtagcatcccctcccttccatggccctgcccctgacgccctgacg gcttggttgtgtctctcgacccaggacctgcaccacgggcgcatccagatgaaaactctc accaacaagtggtacgaggaggtacgccagggcccctccggctccgaggacgacgagcag gagctgctctga >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_4|520_aa MGKPSSMDTKFKDDLFRKYVQFHESKVDTTTSRQRPGSDECLRVAASTLLSLHKVDPFYR FRLIQFYEVVESSLRSLSSSSLRALHGAFSMLETVGINLFLYPWKKEFRSIKTYTGPFVY YVKSTLLEEDIRAILSCMGYTPELGTAYKLRELVETLQVKMVSFELFLAKVECEQMLEIH SQVKDKGYSELDIVSERKSSAEDVRGCSDALRRRAEGREHLTASMSRVALQKSASERAAK DYYKPRVTKPSRSVDAYDSYWESRKPPLKASLSLRKEPVATDVGDDLKDEIIRPSPSLLT MASSPHGSPDVLPPASPSNGPALLRGTYFSTQDDVDLYTDSEPRATYRRQDALRPDVWLL RNDAHSLYHKRSPPAKESALSKCQSCGLSCSSSLCQRCDSLLTCPPASKPSAFPSKASTH DSLAHGASLREKYPGQTQGLDRLPHLHSKSKPSTTPTSRCGFCNRPGATNTCTQCSKVSC DACLSAYHYDPCYKKSELHKFMPNNQLNYKSTQLSHLVYR >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_4|1563_bp atggggaagcccagttcaatggatactaaattcaaggatgacttatttcggaagtacgtg cagttccatgagagcaaagtggataccaccaccagcaggcagcggcctggcagcgatgag tgcctgcgggtggcagcctcaaccctgctcagcctgcacaaggtggatcccttttatcga ttccggctgatccagttctatgaggtggtggagagctccttgcgctcgctcagctcctct agcctgcgggctctgcacggcgccttcagcatgctggagacggtgggcatcaacctcttc ctctacccgtggaagaaggaattcagaagcatcaagacctacacgggcccttttgtttat tatgtcaagtcgacattactggaagaggacatccgagccatcctgagctgcatgggctac acacctgagctgggcactgcatacaagctcagagagctcgtggagaccctccaggtgaag atggtctcctttgagctctttctggccaaagtcgagtgtgagcagatgctagaaatccac tcacaagtgaaggacaagggctactccgagctggacattgtgagcgagcgcaagagcagt gcagaggatgtgcgcggctgctcggacgccctgcggcggcgggcagagggccgggagcac ctgacggcctccatgtcacgagtggcactccagaagtcggccagcgagcgggcggccaag gactactacaagccccgcgtgaccaagccctcgaggtcagtggatgcctatgacagctac tgggagagccggaagccacccctgaaggcctcattgagtcttcggaaggagcctgtggca acggatgtgggggacgacctcaaggatgagatcatccgcccatccccttcgctgctgacc atggccagctccccccacggcagcccggatgtgcttccacccgcctcccccagcaacggc ccggccctgctgcgcggtacctacttctccactcaggatgacgtggatctgtacacagac tctgaacccagggccacctaccgtcggcaggatgctctgcggccggatgtgtggctgctc agaaacgatgcccactccctctaccacaagcgctcgccccctgccaaagagtccgccctc tccaagtgccaaagctgcgggctgtcctgcagctcctccctctgccagcgctgtgacagc ctgctcacctgtcctccagcttccaagcccagcgccttccccagcaaggcctcgactcat gacagcctggcccacggggcatctctgcgggagaagtacccaggccagactcagggcctc gaccgcctcccgcaccttcactccaaatccaagccctccaccacgcccacttcccgctgt ggcttctgcaaccgcccaggcgccaccaacacctgcacccagtgttcaaaagtctcatgt gacgcctgcctcagcgcttaccattatgacccctgctacaaaaagagtgagctgcacaag ttcatgcccaacaaccagctgaactacaagtccacccagctctcccatctcgtgtacaga tag >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_5|122_aa MDAAKDVCIHDAITLVSHWYLPRILGDQLESKETGSERWHEVRQVAQGVLRPVPGGQQRP GNTRPGNDVTMRRARREGGPRLRAGSGKRSMPAADGDYNPEAEDKAEGRRARTKPAEPHF GA >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_5|369_bp atggatgctgcaaaggatgtctgcattcacgatgccatcactttagtttctcattggtac ctcccaagaatcctgggagatcagctggagagcaaagaaacgggctcagaaagatggcat gaggtacgacaagtagctcaaggggtactgagaccagtacccgggggccagcagcgaccc ggaaacactcggcccggaaatgatgtcaccatgaggcgggcccgaagagagggtggacca cggctgcgcgctggctccgggaagcggtcgatgcccgcggccgacggagactacaaccca gaggcggaggacaaagcggaaggccgaagagcgaggacgaaaccggcggaaccgcacttt ggagcctaa >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_6|601_aa MVITGVWLRLCLALGARSAPCTKDGQCTVSHDGQRQEILRELERIKEPVFSSQGPLACVR GAPRGGPPASPAPNRSPPREPRPLGLLLIGRRCAAQSGSKMAAQQRDCGGAAQLAGPAAE ADPLGRFTCPVCLEVYEKPVQECLKPKKPVCGVCRSALAPGVRAVELERQIESTETSCHG CRKNIRSHVATCSKYQNYIMEGVKATIKDASLQPRNVPNRYTFPCPYCPEKNFDQEGLVE HCKLFHSTDTKSVVCPICASMPWGDPNYRSANFREHIQRRHRFSYDTFVDYDVDEEDMMN QAPSYGARPVSSMVSVYAGARGSGSRISESHSTSFWGGMGSGDLAGGMAGDLAGMGGIQN EKETMQSLNDHLASYLDRMRSLETKNWKPESKIREHLEKKGPQVRDWSHHFKTIEDLRAQ IFTNTVDNACIVLQINNACLAADDFTIEENTTEVTTQSTEVGTAEMTHRTETCSPVLGDR PGLHEKSEGQLGEQPEGGGGPLFPADGTAQWDTAVPGVRAGTDPGRGTVPGPGVGSPAEH KVKLEAEITTYCRLLEDSEDFNLGDALDSRNSMQTIQKTTTRQTVDGKVVSETNDTKVLR H >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_6|1806_bp atggttatcacaggagtgtggctgaggctgtgtttggccctgggtgccaggtctgcaccc tgcactaaggatgggcaatgtacagtaagccatgatgggcagaggcaggaaatacttaga gaactggaaagaataaaggagccggtgttctcttcgcagggcccgctcgcttgcgtcaga ggggccccgaggggcggcccacccgctagccccgcccccaaccgctcaccgccccgcgag ccccgccccctcggcctcctcctcatcggccgccgttgcgcggcgcagagcggcagcaag atggcggcgcaacagcgggactgcgggggtgctgcgcagctggcggggccggcggcggag gctgaccccctaggacgcttcacgtgtcccgtgtgcttagaggtgtacgagaagccggta caggaatgtctgaagccgaagaagcctgtctgtggggtgtgtcgcagcgctctggcacct ggcgtccgagccgtggagctcgagcggcagatcgagagcacagagacttcttgccatggc tgccgtaagaatatccggtcccacgtggctacttgttccaaataccagaattacatcatg gaaggtgtgaaggccaccattaaggatgcatctcttcagccaaggaatgttccaaaccgt tacacctttccttgtccttactgtcctgagaagaactttgatcaggaaggacttgtggaa cactgcaaattattccatagcacggataccaaatctgtggtttgtccgatatgtgcctcg atgccctggggagaccccaactaccgcagcgccaacttcagagagcacatccagcgccgg caccggttttcttatgacacttttgtggattatgatgttgatgaagaggacatgatgaat caggcgcccagctatggcgcccggccggtcagcagtatggttagcgtctatgcaggtgcc cggggctctggttcccggatctccgagtcccactccaccagcttctggggcggcatgggg tccggggacctggccggggggatggctggggatctggcaggaatgggaggcatccagaac gagaaggagaccatgcaaagcctgaacgaccatctggcctcctacctggacagaatgagg agcctggagaccaagaactggaagccggagagcaaaatccgggagcacctggagaagaag ggaccccaggtcagagactggagccatcacttcaaaaccatcgaggacctgagggctcag atcttcacaaatactgtggacaatgcctgcattgttctgcagatcaacaatgcctgtctt gctgctgatgactttacaattgaggagaacactacagaagtcaccacgcagtccaccgag gttggaactgctgagatgactcacagaactgagacatgcagtccagtccttggagatcga cctggactccatgagaaatctgaaggccagcttggagaacagcctgagggaggtggaggc ccgctatttcctgcagatggaacagctcagtgggatactgctgtacctggagtcagagct ggcacagacccaggcagagggacagtgccaggcccaggagtaggaagccctgctgaacat aaggtcaagctggaggctgagatcaccacctactgccgcctgctggaagacagcgaggac ttcaatcttggtgatgccctggacagccgcaactccatgcaaaccatccaaaagaccacc acccgccagacagtggatggcaaagtggtgtctgagaccaacgacaccaaagttctgaga cattaa >gi568815578r:49805622_50008490|GENSCAN_predicted_peptide_7|382_aa MERAGLYQCSLMGKGPRPETVCEQVCVYEPLSPPTLQVPGDSLGKLEPGEDDFVHGCHTR HQVTKQTVVLPFSPSTGDDPRCASEPRLGGVPARALTATRREPGQQPAHLLGEWPSAETS LRLARRKPSDPNRKPNYSELQDSNPEFTFQQPYDQAHLLAAIPPPEILNPTASLPMLIWD SVLAPQAQPIAWASLRLQESPRVAELTSLSDEDSGKGSQPPSPPSPAPSSFSSTSVSSLE AEAYAAFPGLGQVPKQLAQLSEAKDLQARKAFNCKYCNKEYLSLGALKMHIRSHTLPCVC GTCGKAFSRPWLLQGHVRTHTGEKPFSCPHCSRAFADRSNLRAHLQTHSDVKKYQCQACA RTFSRMSLLHKHQESGCSGCPR >gi568815578r:49805622_50008490|GENSCAN_predicted_CDS_7|1149_bp atggagagagcagggctctatcagtgcagtctgatgggtaaggggcctaggcccgagaca gtctgcgagcaagtgtgcgtgtatgagcccctgagcccgcccaccctgcaagtgcctgga gactcactggggaagctagaaccaggggaggacgattttgttcacggctgtcacacccgg caccaagtgactaaacagacagtagttctgcccttcagccccagcaccggggacgacccg cgctgcgccagcgaaccccgcctcggaggagtccccgcccgggctctcaccgccacgcgg cgcgagcccggccagcagccggcgcacctgctcggggagtggccttcggcggagacgagc ctccgattggcgcggaggaagccctccgaccccaatcggaagcctaactacagcgagctg caggactctaatccagagtttaccttccagcagccctacgaccaggcccacctgctggca gccatcccacctccggagatcctcaaccccaccgcctcgctgccaatgctcatctgggac tctgtcctggcgccccaagcccagccaattgcctgggcctcccttcggctccaggagagt cccagggtggcagagctgacctccctgtcagatgaggacagtgggaaaggctcccagccc cccagcccaccctcaccggctccttcgtccttctcctctacttcagtctcttccttggag gccgaggcctatgctgccttcccaggcttgggccaagtgcccaagcagctggcccagctc tctgaggccaaggatctccaggctcgaaaggccttcaactgcaaatactgcaacaaggaa tacctcagcctgggtgccctcaagatgcacatccgaagccacacgctgccctgcgtctgc ggaacctgcgggaaggccttctctaggccctggctgctacaaggccatgtccggacccac actggcgagaagcccttctcctgtccccactgcagccgtgccttcgctgaccgctccaac ctgcgggcccacctccagacccactcagatgtcaagaagtaccagtgccaggcgtgtgct cggaccttctcccgaatgtccctgctccacaagcaccaagagtccggctgctcaggatgt ccccgctga