GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:32:58 Sequence gi568815585r:102946136_103166249 : 220114 bp : 39.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 5681 6004 324 1 0 29 49 213 0.455 5.28 1.02 PlyA + 6095 6100 6 1.05 2.08 PlyA - 6733 6728 6 1.05 2.07 Term - 7370 7205 166 1 1 75 33 192 0.771 8.81 2.06 Intr - 10593 10357 237 2 0 45 65 124 0.454 1.61 2.05 Intr - 11158 10974 185 1 2 7 93 165 0.271 6.56 2.04 Intr - 12874 12771 104 2 2 109 84 -37 0.125 -3.03 2.03 Intr - 13699 13613 87 2 0 92 76 30 0.181 1.22 2.02 Intr - 15709 15554 156 2 0 91 12 84 0.109 0.16 2.01 Init - 17757 17610 148 0 1 71 99 99 0.991 9.60 2.00 Prom - 21902 21863 40 -5.85 3.06 PlyA - 23005 23000 6 1.05 3.05 Term - 24058 23913 146 2 2 80 42 147 0.904 6.39 3.04 Intr - 27389 27353 37 2 1 84 92 49 0.952 1.82 3.03 Intr - 28979 28689 291 2 0 65 93 159 0.805 10.51 3.02 Intr - 52730 52672 59 0 2 93 99 7 0.588 -0.02 3.01 Init - 53935 53734 202 1 1 70 48 128 0.537 6.09 3.00 Prom - 66543 66504 40 -5.35 4.03 PlyA - 67074 67069 6 1.05 4.02 Term - 69367 69275 93 1 0 100 44 74 0.662 1.05 4.01 Init - 71008 70880 129 1 0 73 75 126 0.702 10.00 4.00 Prom - 78781 78742 40 -3.75 5.02 PlyA - 78999 78994 6 1.05 5.01 Sngl - 81828 81322 507 0 0 68 44 252 0.955 14.72 5.00 Prom - 84605 84566 40 -5.55 6.07 PlyA - 84651 84646 6 1.05 6.06 Term - 100125 99998 128 1 2 103 42 86 0.944 2.96 6.05 Intr - 103311 103154 158 2 2 79 67 195 0.998 15.23 6.04 Intr - 105297 105122 176 0 2 110 80 51 0.640 4.32 6.03 Intr - 106573 106485 89 2 2 127 106 35 0.998 7.97 6.02 Intr - 112247 112129 119 1 2 68 97 58 0.003 3.89 6.01 Init - 120114 119738 377 0 2 69 94 338 0.364 28.95 6.00 Prom - 126744 126705 40 -6.05 7.00 Prom + 126855 126894 40 -5.35 7.01 Init + 134315 134429 115 1 1 85 110 19 0.694 4.22 7.02 Term + 150235 150368 134 2 2 103 32 128 0.543 5.97 7.03 PlyA + 150377 150382 6 1.05 8.03 PlyA - 153088 153083 6 1.05 8.02 Term - 166363 166114 250 0 1 78 45 144 0.152 3.29 8.01 Init - 173261 172318 944 2 2 86 53 222 0.170 11.69 8.00 Prom - 199647 199608 40 -3.05 9.04 PlyA - 200682 200677 6 1.05 9.03 Term - 210698 210567 132 2 0 44 45 156 0.777 4.01 9.02 Intr - 211068 210900 169 2 1 71 105 75 0.995 6.53 9.01 Intr - 211194 211125 70 1 1 63 74 94 0.435 2.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 36613 36694 82 0 1 42 30 172 0.985 7.98 S.002 Term + 41516 41736 221 0 2 87 44 142 0.955 5.92 S.003 Init - 112240 112129 112 1 1 92 97 124 0.961 12.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_1|107_aa LPKEKGERNGISRLSLKGSDGFSNSRRKSSFLIEAAVPQQRCGATGLSGLQLLKAGRDHT NQSAGSHTAVSRKAMTTNHRIRDETGPGRQLQHRVADRKTIDHAASA >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_1|324_bp ttacctaaggagaaaggagagaggaatggaataagcaggctttcactaaaaggctcagat ggctttagtaattcaaggaggaaatcttcctttcttatagaagcagcagtgccgcagcag cgttgcggggcaacaggactcagtggcctgcaactcttaaaggcggggagagaccacact aaccagagtgctggcagccacacagcagtgtcaaggaaggcgatgacaaccaaccatcgt atcagggatgaaacagggcctggcaggcagctgcaacaccgagtagcggataggaagact atagatcatgcagcatctgcttag >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_2|360_aa MKETKHHNNPVSLIEVNGLRHLHDPKGAQTSKPVSLMSDVQLMSESSKGGFVGVSNPVTN VCRDQEADWREMCMASSDPNRCHKNMLEEIRLVLGSASHLLGLQLIIKMFTADRILLQHR TILELADTEIAHIYTMSIKVKFQALILRKLTSESSILLVFSVSMEKLIYDADVMLLIHPL RMPTEAAPTASAVHGINLTQELVRNAGPWASSQTYWGKKLQFNKIPSLIDRNRLKLTLRW VLEGATALAILPEKRQSESISGRILCLDFTANHISPDLRRDSAESKGPRGESGIINPFIG DVIGFLTNNQGPQDMTLRVLLDVVLIRFTQIIHHEVLLVELPGFSFITTTDSLIDIGPVT >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_2|1083_bp atgaaagaaacaaaacaccacaacaatcctgtgtccttgattgaggtgaatgggctgaga cacctccacgaccccaagggtgcacaaacatcaaaacctgtcagtttaatgtctgatgtt cagctaatgtctgagtcatcaaaaggaggatttgtgggagtatccaaccctgtaacaaac gtatgcagggaccaggaagccgactggagagaaatgtgcatggcctcatctgatccaaat cggtgtcataagaacatgttggaagaaatcagattggtcctgggaagtgccagtcatctt ttagggctgcagctcattataaaaatgttcacagcagatagaatcttgcttcagcaccgc actattttagagctggcagacacagaaatcgctcatatttacacaatgagcattaaggtc aagtttcaagccctaatcctcagaaaactgacatcagaatcatcaatactgttagtattt agtgtgtccatggagaaacttatctacgatgctgatgtgatgctgctgatccatcctttg aggatgcctacagaagctgctcctactgcaagtgcggttcatggcatcaacctcacccag gagcttgttagaaatgcaggaccttgggcctcctcccagacttactggggtaagaagctg cagtttaacaagatccccagtttgatagataggaatagattaaaactgactctcagatgg gttctggaaggagccacagccctggccatcctccctgagaagagacagagtgagtccatc agcggcaggatactctgccttgacttcacagctaatcatatttccccagatctgagaaga gactcagcagaatctaagggccccagaggagaatcaggtatcattaacccttttattgga gatgttattggttttctcaccaacaaccagggccctcaggacatgaccctgagagtcctc cttgacgtggttctcatccgcttcactcaaataattcaccatgaagtcctactggtagaa ctacctggcttcagtttcatcaccaccactgactcgctgattgacattggaccagtcact taa >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_3|244_aa MNPGGHSSAQYRSYLWKGIHVVCVGCLTIHACWQLKALENKPPFVMNCIVQLAAAAQGYS RLDTRTRYASCSKAGLKSLSIHEIGFLIKRKSARSSKETYSIKDPNSTCENQETKESQIH TPVTFYKGMFGSSDLRLTQPDSDQAAFANPVLEGRGVWEKQLLGCIGDTIVSPAFGDRVS LFSTVSDCPECDDSAQVSFKARLAQTYTHLQACQGLRVHMDVLNDKPVPVLALTNTERFG KSNA >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_3|735_bp atgaatcctgggggacactcttcagcccagtacagaagctacctctggaaaggtatccac gtggtatgtgtgggctgcttgaccatacacgcatgttggcaactcaaagctctagaaaat aagcctccctttgtcatgaactgtattgttcagcttgctgcagctgcccagggctacagt cgtctggacactagaacaagatatgcttcatgtagtaaagctggactcaagagcctcagc atacatgagataggatttttgataaagagaaaatcagccaggagctccaaagagacctac tctatcaaagatccaaattccacgtgtgagaaccaggaaacgaaagagtctcagatccac actcctgtaacattttataagggcatgtttggcagctctgatctgaggctcacgcagcct gactcagatcaagctgcatttgcgaatccagtgctggagggaaggggagtttgggagaag caattactgggttgcatcggagacaccattgtatctcctgcattcggtgacagagtctca cttttctccactgtcagtgattgccctgagtgtgatgacagtgctcaagtgtctttcaaa gcacgtttggctcaaacatatacacatttgcaagcttgccaaggccttcgagtacacatg gatgttctaaatgataaaccagtgccagtattagctttaactaacactgaacgctttggg aaaagtaatgcatga >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_4|73_aa MEKKKEKERKKELQGGRGRRRGGGWGKGRERGGRGKKRRGGGETSALLVIGPWDSDWDFH HQAPWFSGLHTQT >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_4|222_bp atggagaagaagaaagagaaggaaaggaagaaagagctacagggaggaagaggaagacga agaggaggaggatgggggaaaggaagagaaagaggaggaagaggaaaaaagagaagagga ggaggagagacatcggcactcctggttattgggccttgggactcagactgggactttcac catcaggccccctggttctcaggtcttcacactcagacttag >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_5|168_aa MPCGHLWTQALGLLSARWVPAVSPTTSAPIAPCFNRPSWSLGGLGRQTSSHGLRIQVPLS RSRTQDHSYGLRCQAHLSGPRYQANPGTWPVLADSGSEPKPPEQPLWTLASQIQAASISS LWTQPTGGSSGSKPQAQPHGPAPRRTSRELQQQARPWIMPNSLPKIFE >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_5|507_bp atgccctgtggacacctgtggacacaagctctgggcttgcttagtgccagatgggtccct gcagtctcacccaccacgtcggcccctatagccccatgcttcaacagacccagctggagc ctgggaggccttggtagacagaccagttcccatggactgagaatccaggtacccctcagt agatccaggacccaggaccattcctatggactcaggtgccaggcccacctcagtggaccc aggtaccaagccaatcctggcacctggccagtccttgcagactcaggctcagaacccaaa ccaccagaacagcctctatggaccctggcttcacagatacaggctgcaagcatatcctct ctgtggacccaaccaacaggtgggtccagtggatccaagcctcaggctcaacctcatggc ccagcaccaaggcgaacttccagagaactccagcagcaagcccgtccatggatcatgcca aatagcctgcccaaaatctttgagtag >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_6|348_aa MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVSLVVPVSIGM FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF AAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_6|1047_bp atgaatgatccgaacagctgtgtggacaatgcaacagtttgctctggtgcatcctgtgtg gtacctgagagcaatttcaataacatcctaagtgtggtcctaagtacggtgctgaccatc ctgttggccttggtgatgttctccatgggatgcaacgtggaaatcaagaaatttctaggg cacataaagcggccgtggggcatttgtgttggcttcctctgtcagtttggaatcatgccc ctcacaggattcatcctgtcggtggcctttgacatcctcccgctccaggccgtagtggtg ctcattataggatgctgccctggaggaactgcctccaatatcttggcctattgggtcgat ggcgacatggacctgagcgtcagcatgaccacatgctccacactgcttgccctcggaatg atgccgctgtgcctccttatctataccaaaatgtgggtcgactctgggagcatcgtaatt ccctatgataacataggtacatctctggtttctctcgttgttcctgtttccattggaatg tttgttaatcacaaatggccccaaaaagcaaagatcatacttaaaattgggtccatcgcg ggcgccatcctcattgtgctcatagctgtggttggaggaatattgtaccaaagcgcctgg atcattgctcccaaactgtggattataggaacaatatttcctgtggcgggttactccctg gggtttcttctggctagaattgctggtctaccctggtacaggtgccgaacggttgctttt gaaacggggatgcagaacacgcagctatgttccaccatcgttcagctctccttcactcct gaggagctcaatgtcgtattcaccttcccgctcatctacagcattttccagctcgccttt gccgcaatattcttaggattttatgtggcatacaagaaatgtcatggaaaaaacaaggca gaaattccagagagcaaagaaaatggaacggagccagagtcatcgttttataaggcaaat ggaggatttcaacctgacgaaaagtag >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_7|82_aa MEQSLSHLAYWKRQFLTLVKEKQTKPNPVVSFLITPVKGSFTDNGRSEQVAVAGQPAFPA LTPWHSRWSVPRIILLCLRSNP >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_7|249_bp atggagcaaagcttgtcacatttagcttattggaaaaggcagtttcttactttagtcaaa gagaaacaaaccaaacccaatcctgtggttagctttctaataactccagtgaagggatcc ttcactgacaatggccgctcggagcaagtggcagttgcaggccagcccgccttccctgcc ctcactccctggcacagtcgctggagtgtccctcggatcatacttctgtgtctgcggtcc aacccttaa >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_8|397_aa MATLPKVIYRFSAIPIKLPMTFFTELEKTTLKFIQNQKRALIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRGIDQWNRTEPSEIMPHIYNCLIFDKPDKNKKWGNDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRRIKDLHVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMSTKAEIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYLSDKGLISRIY NELQQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIRGMQIKTTMRYHL TPVRMAIIKKSGNNSVVPSNYNTERVRNSVWLNEIENTKQEWLKQEVIYFSLKLTKYNGS TVIRKPGHSQHLTAALWIKMAALALTAGPMESDRRRD >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_8|1194_bp atggccacactgcccaaggtaatttatagattcagtgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatacagaaccaaaaaagagcc ctcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga ggtatagaccaatggaacagaacagagccctcagaaataatgccgcatatctacaactgt ctgatctttgacaaacctgacaaaaacaagaaatggggaaacgattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagacggattaaagacttacatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaactccaaaagcaatgtcaacaaaagccgaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaaccaccatcagagtgaacaggcaacctaca gaatgggagaaaatttttgcaacctacttatctgacaaagggctaatatccagaatctac aatgaactccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gatatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagaggaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacagcgtggtcccatccaat tataatacagaacgggtaaggaattcagtttggcttaatgagatagaaaacacaaaacaa gagtggctgaagcaagaagtgatttatttctctctcaaattgactaagtataatggctcc acagtcattaggaaaccaggccattctcagcacttgacagcagctttgtggatcaagatg gcagccttggctctaacagcaggtcccatggagtcggatagaagaagggactga >gi568815585r:102946136_103166249|GENSCAN_predicted_peptide_9|123_aa XQKIPERFATDPSKQVKAGSDTALEFHTEVTTLTRVVNAHIHPFTILDSVERDGDRAQQS CITAFKVTCFIYVSLSPKELWVTAVADSAGKAGAGWTEPPEDAAPGFGGREPLEQMILKN EVT >gi568815585r:102946136_103166249|GENSCAN_predicted_CDS_9|372_bp nctcaaaaaattcctgaacgttttgcaacagatccttccaagcaggtaaaagctggctct gacacagcactggagttccatacagaggttacaacactaaccagagtggtgaacgctcac atccatcccttcaccatactggacagcgtagagagggatggagacagagctcagcagagc tgcatcacagccttcaaagttacctgctttatatatgtgtctctctcccccaaagagctg tgggtcacagcagtagcagatagtgcaggcaaagctggggcagggtggactgagccacca gaagacgcagcacctggctttggaggaagagagcctttagaacagatgatcctcaaaaat gaggtcacctga