GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:50:36 Sequence gi568815593f:170005958_170208608 : 202651 bp : 45.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1759 1790 32 2 2 123 88 0 0.698 1.35 1.02 Intr + 2540 2640 101 1 2 129 47 104 0.963 9.31 1.03 Intr + 2731 2789 59 1 2 75 97 56 0.936 3.73 1.04 Intr + 13003 13151 149 2 2 131 93 71 0.568 11.85 1.05 Intr + 20059 20313 255 0 0 80 15 121 0.559 1.64 1.06 Intr + 21906 21991 86 2 2 72 101 146 0.916 12.92 1.07 Intr + 28442 28598 157 2 1 111 78 300 0.982 31.31 1.08 Intr + 30558 30598 41 2 2 95 116 55 0.973 6.02 1.09 Intr + 35098 35188 91 1 1 77 87 41 0.973 2.90 1.10 Intr + 36056 36175 120 1 0 119 84 216 0.999 24.99 1.11 Intr + 39859 39948 90 0 0 79 95 188 0.999 18.79 1.12 Intr + 41553 41657 105 2 0 92 100 111 0.994 13.11 1.13 Intr + 44299 44438 140 0 2 105 -59 249 0.536 11.16 1.14 Intr + 50727 50811 85 0 1 85 68 128 0.973 10.32 1.15 Intr + 51623 51709 87 1 0 58 109 124 0.999 11.67 1.16 Intr + 52529 52696 168 1 0 -1 96 91 0.623 1.14 1.17 Intr + 61553 61729 177 1 0 69 100 317 0.780 31.12 1.18 Intr + 63180 63263 84 2 0 134 94 113 0.999 16.52 1.19 Intr + 69990 70127 138 2 0 117 56 173 0.996 17.66 1.20 Intr + 71753 71880 128 1 2 107 78 90 0.999 9.38 1.21 Intr + 73018 73189 172 1 1 72 100 203 0.999 19.85 1.22 Intr + 74206 74326 121 0 1 65 85 126 0.999 10.07 1.23 Intr + 75885 76027 143 1 2 87 77 138 0.905 12.67 1.24 Intr + 76839 76897 59 2 2 32 50 120 0.936 0.28 1.25 Intr + 78457 78607 151 1 1 60 113 82 0.940 8.06 1.26 Intr + 79815 79878 64 2 1 59 89 52 0.618 0.69 1.27 Intr + 85151 85259 109 0 1 52 78 67 0.252 1.44 1.28 Term + 89742 89895 154 0 1 86 48 91 0.646 2.29 1.29 PlyA + 93894 93899 6 1.05 2.00 Prom + 93912 93951 40 -8.16 2.01 Init + 100001 100574 574 1 1 84 90 1058 0.984 100.88 2.02 Term + 102092 102654 563 0 2 75 43 420 0.991 30.74 2.03 PlyA + 103753 103758 6 -0.45 3.00 Prom + 104357 104396 40 -12.40 3.01 Init + 104589 104667 79 2 1 79 95 231 0.887 24.12 3.02 Intr + 112544 112616 73 0 1 78 80 34 0.022 0.06 3.03 Intr + 118521 118590 70 0 1 73 94 38 0.024 2.08 3.04 Term + 130331 130366 36 1 0 114 48 1 0.080 -3.96 3.05 PlyA + 131102 131107 6 1.05 4.03 PlyA - 131430 131425 6 1.05 4.02 Term - 135992 135175 818 0 2 91 34 1199 0.983 108.10 4.01 Init - 148111 147913 199 1 1 70 76 96 0.197 3.75 4.00 Prom - 162372 162333 40 -4.76 5.00 Prom + 171856 171895 40 -2.26 5.01 Init + 182604 182735 132 2 0 84 83 70 0.877 6.18 5.02 Intr + 183432 183545 114 2 0 105 84 48 0.936 6.74 5.03 Intr + 186778 186815 38 0 2 60 119 11 0.377 -1.54 5.04 Intr + 189915 190104 190 0 1 60 -10 148 0.542 1.79 5.05 Intr + 190799 190917 119 1 2 81 100 63 0.907 6.06 5.06 Intr + 191007 191025 19 0 1 84 98 5 0.457 -2.29 5.07 Intr + 192847 192959 113 0 2 81 85 45 0.542 2.68 5.08 Term + 194560 194665 106 1 1 89 48 50 0.524 -1.02 5.09 PlyA + 196524 196529 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:170005958_170208608|GENSCAN_predicted_peptide_1|1088_aa XWWSGLAPTDQLWNNYFHLAVAFITQDSLQLEQFSHAKYNKILNKYGDMRRLIGFSIRDM WYKLGQNKICFIPGMVGPILEMTLIPEAELRKATIPIFFDMMLCEYQRSGDFKKARYISF NKSSPTEATHRKFFGICAVSAQAFGPSLRSFVHKSEGTMASRMMCEPEGRANVVPAFSSP KDSNSMELYPLCPGKSLPKFENEIILKLDHEVEGGRGDEQYMQLLESILMECAAEHPTIA KSVENFVNLVKGLLEKLLDYRGVMTDESKDNRMSCTVNLLNFYKDNNREEMYIRYLYKLR DLHLDCDNYTEAAYTLLLHTWLLKWSDEQCASQVMQTGQQHPQTHRQLKETLYETIIGYF DKGKMWEEAISLCKELAEQYEMEIFDYELLSQNLIQQAKFYESIMKILRPKPDYFAVGYY GQGFPSFLRNKVFIYRGKEYERREDFQMQLMTQFPNAEKMNTTSAPGDDVKNAPGHFYKS NYVQRFHYSRPVRRGTVDPENEFASMWIERTSFVTAYKLPGILRWFEVVHMSQGSQRMLP SILAVEERPGGNEEGVHADIEERDIPAEETASEKPWAQSMLVSRSEQVRTTISPLENAIE TMSTANEKILMMINQYQSDETLPINPLSMLLNGIVDPAVMGGFAKYEKAFFTEEYVRDHP EDQDKLTHLKDLIAWQIPFLGAGIKIHEKRVSDNLRPFHDRMEECFKNLKMKVEKEYGVR EMPDFDDRRVGRPRSMLRSYRQMSIISLASMNSDCSTPSKPTSESFDLELASPKTPRVEQ EEPISPGSTLPEVKLRRSKKRTKRSSVVFADEKAAAESDLKRLSRKHEFMSDTNLSEHAA IPLKASVLSQMSFASQSMPTIPALALSVAGIPGLDEANTSPRLSQTFLQLSDGDKKTLTR KKVNQFFKTMLASKSAEEGKQIPDSLSTDLLSLVASMSLLQAPFEDVLPSSASSFLNSLK SPVAMIVGVWATTNTFGFPKLSDLSNGCDQQDPIWNPGLLPAVFISAILVSDLDKTGFHS ELPTVMGLTAQVNKQMHKKHPEIPADINEDISFQKNLQFLQRCNQITVTFGKACDLSKPN FLICEKGQ >gi568815593f:170005958_170208608|GENSCAN_predicted_CDS_1|3267_bp ngctggtggagtgggttagcaccaacagatcagctgtggaacaactattttcatctggca gtggcttttatcacccaggattctctgcagctggagcagttctcacacgccaaatacaac aaaatcctgaataagtatggggacatgagacggctaattggcttctccatccgtgatatg tggtacaagcttggtcagaacaaaatctgcttcatcccaggcatggtaggacctatatta gagatgacacttatccctgaggctgagctccggaaagccaccataccaatcttcttcgac atgatgctgtgtgaatatcaaagaagtggggatttcaaaaaggcccggtatatctccttc aataagtcttccccaactgaggcaactcaccggaaattctttggcatttgtgctgtatca gcacaagcttttggtccctcccttagatcctttgtgcacaaaagtgagggaacgatggca tcacggatgatgtgtgaaccagagggaagagcaaatgtggtccctgccttttcatcacct aaagattccaactccatggagctctatcctttatgcccaggaaagagcctgccaaagttt gaaaacgaaatcatcctgaagctggaccacgaggtagaagggggccgaggcgacgagcag tacatgcagctcctggagtcaatcctgatggaatgtgctgcagagcacccaaccattgcc aagtcggtggagaacttcgtgaacctggtcaaaggcctcctggagaagctgctggattac cggggtgtgatgacagatgagagcaaagacaaccgcatgagctgcaccgtgaacctgctg aatttctacaaagataacaacagggaggagatgtacataaggtacctgtacaaactccgc gatcttcacctggactgtgacaattacacagaggctgcctacacgctccttctccacacc tggcttctcaagtggtcggatgagcagtgtgcatcacaggtcatgcagacaggccagcag cacccccagacacaccggcagctgaaggagacgctctacgagaccatcataggctacttt gacaaaggaaagatgtgggaagaggccataagtctgtgcaaggagctggcggaacagtac gagatggagatctttgactatgagctgctcagccagaacctgatccagcaggcaaaattc tatgaaagcatcatgaaaatcctcaggcccaaaccagactactttgctgttggatactac ggccagggattcccctccttcctgcggaacaaagtgttcatctaccgcgggaaggaatat gagcgaagagaagatttccagatgcagctgatgacccagttccccaatgcagagaagatg aacaccacctctgccccgggagatgatgtgaagaatgccccaggccacttctacaaatcc aactacgtgcaaaggttccactactcccggcccgtgcgcagggggaccgtagacccagag aatgagtttgcttccatgtggattgagagaacctccttcgtgactgcatacaagctgccg gggatcctgcgctggtttgaggtggtgcacatgtcgcagggtagtcagagaatgcttccc tctatactggctgttgaggagaggcctggaggaaatgaggaaggggtccatgctgacatt gaagagagagacattccagcagaggagacagcaagtgaaaagccctgggcccagagtatg cttgtgtccagatcagaacaagtgaggaccacaattagtcctctggagaatgccatagaa accatgtccacggccaatgagaagatcctgatgatgataaaccagtaccagagtgatgag accctccccatcaacccactctccatgctcctgaacgggattgtggaccctgctgtcatg ggaggcttcgccaagtatgagaaggccttcttcactgaagagtatgtcagggaccaccct gaggaccaggacaagctgacccacctcaaggacctgattgcatggcagatccccttcttg ggagctgggattaagatccatgagaaaagggtgtcagataacttgcgacccttccatgac cggatggaggaatgtttcaagaacctgaaaatgaaggtggagaaggagtacggtgtccga gagatgcctgactttgacgacaggagagtgggccgtcccaggtctatgctgcgctcatac agacagatgtccatcatctctctggcttccatgaattctgactgcagcacccccagcaag cctacctcagagagctttgacctggaattagcatcacccaagacgccgagagtggagcag gaggaaccgatctccccggggagcaccctgcctgaggtcaagctgcggaggtccaagaag aggacaaagagaagcagcgtagtttttgcggatgagaaagcagctgcagagtcggacctg aagcggctttccaggaagcatgagttcatgagtgacaccaacctctcggagcatgcggcc atccccctcaaggcgtctgtcctctctcaaatgagctttgccagccagtccatgcctacc atcccagccctggcgctctcagtggcaggcatccctgggttggatgaggccaacacatct ccccgcctcagccagaccttcctccaactctcagatggtgacaagaagacactcacacgg aagaaggtcaatcagttcttcaagacaatgctggccagcaaatcggctgaagaaggcaaa cagatcccagactcgctgtccacggacctcctgtccctggtggcctctatgtccctgctg caagccccatttgaagacgtgttgcctagttcagcttctagtttcctcaacagtctaaaa agtcctgttgcgatgattgttggggtttgggctacaacaaacacctttgggttccctaag ttaagtgatctctccaatggctgtgatcagcaggacccaatctggaatccaggtcttctt ccagctgtgtttatctctgcaattcttgtgtctgatcttgacaaaacaggcttccactca gaacttcctactgtaatgggtctaactgcccaggtcaacaaacagatgcataagaagcac ccagaaatccctgcagacattaatgaggacatatcattccagaagaatctccagttcctg cagagatgtaatcagataactgtgacctttggaaaagcatgtgacctctctaagcccaat ttcctcatatgcgaaaaaggacaatga >gi568815593f:170005958_170208608|GENSCAN_predicted_peptide_2|378_aa MSSFDLPAPSPPRCSPQFPSIGQEPPEMNLYYENFFHPQGVPSPQRPSFEGGGEYGATPN PYLWFNGPTMTPPPYLPGPNASPFLPQAYGVQRPLLPSVSGLGGSDLGWLPIPSQEELMK LVRPPYSYSALIAMAIHGAPDKRLTLSQIYQYVADNFPFYNKSKAGWQNSIRHNLSLNDC FKKVPRDEDDPGKGNYWTLDPNCEKMFDNGNFRRKRKRKSDVSSSTASLALEKTESSLPV DSPKTTEPQDILDGASPGGTTSSPEKRPSPPPSGAPCLNSFLSSMTAYVSGGSPTSHPLV TPGLSPEPSDKTGQNSLTFNSFSPLTNLSNHSGGGDWANPMPTNMLSYGGSVLSQFSPHF YNSVNTSGVLYPREGTEV >gi568815593f:170005958_170208608|GENSCAN_predicted_CDS_2|1137_bp atgagctccttcgacctgccggcgccctccccacctcgctgcagcccccagttccccagc atcggccaggagccccccgagatgaacctctactatgagaacttcttccacccacagggc gtgcccagccctcagcggccctccttcgaggggggcggcgagtatggggccacccccaac ccctacctctggttcaacgggcccaccatgaccccgccaccctacctgcccggccccaac gccagccccttcctgccccaggcctatggagtgcagaggccgctgctgcccagcgtgtcg gggcttggggggagcgacctgggctggctgcccatcccctcgcaggaggagctgatgaag ctggtgcggccaccctattcctactcggctctcatcgccatggccatccacggggcaccc gacaagcgcctcactctcagccagatctaccagtacgtggccgacaacttccccttctac aacaagagcaaggccggctggcagaactccatccgccacaacctgtcgctcaacgactgc ttcaagaaggtgccccgcgacgaggacgacccgggcaaagggaattactggaccctggac cccaactgtgagaaaatgttcgacaatggaaatttccgcaggaaaaggaagagaaaatca gatgtttcctctagcacagcctccttggccttagagaagacagagagcagtctcccggtg gacagccccaagaccacggagcctcaggacatcttggatggagcctcaccagggggcacc accagctccccagagaagcggccctcccctcccccatcaggcgccccttgccttaacagc ttcctttcctctatgacagcctatgtgagcggggggagccccacgagccaccccttggtc acaccaggactgagccctgagcccagtgacaagacggggcagaactcactgaccttcaac tccttctccccgctcaccaacctcagcaaccacagcggtgggggtgactgggcgaacccc atgcccaccaacatgctcagctatggaggatctgtgctcagccaattcagccctcacttc tacaacagtgtcaacaccagtggtgtcctctaccccagggagggcaccgaggtctag >gi568815593f:170005958_170208608|GENSCAN_predicted_peptide_3|85_aa MAMIIDDDDDDDDDNDGDDDVDGDDDVRDQPNLFNNRGVPSSPWLEDVDNRAAERLAFWK ELVDQHLWEMLLTEMRKCRLKDQAT >gi568815593f:170005958_170208608|GENSCAN_predicted_CDS_3|258_bp atggcaatgattattgatgatgatgatgatgatgatgatgataatgatggtgatgatgat gttgatggtgatgatgatgtgagggaccaacccaacttgtttaacaacaggggtgtccca agtagcccttggctggaggacgttgataatagggctgctgagagacttgcattctggaag gagttggttgaccagcacctatgggagatgctgctgacagagatgaggaagtgcaggctg aaagatcaagccacttaa >gi568815593f:170005958_170208608|GENSCAN_predicted_peptide_4|338_aa MALTRSSAVLGKAEVAAAPEFGIQGLFQLSFLTAFPVIPISHTLGNMSPSAKDSAPLLPA FALLIPYSTRFTTRFTFSTNYQSLGSVQPPSYGVQLVSSAASVYAGAGGSGSRISMSRST NFQGCLGSGGLAAGLAGGLERMGGIHKEKETMQSLNDCLASYLDRVRSQETENQKLENKI WEHLEKKGPQVRDWGHYFKTIEELRTQIFANTVDNARIVLQINNARLAADDFRVKYETGL AMRQSVESEIHGLCKVTDDTNVTRLQLETEIEALKEELLFMKKNHKEEVKGLQAQIASSG LTMEVDAPKSQNLTKIMADIRAQYKELARKNQEELDKY >gi568815593f:170005958_170208608|GENSCAN_predicted_CDS_4|1017_bp atggcactgacacggtcctccgcagtcctggggaaggctgaggtagctgctgcgcctgaa tttggcattcaaggccttttccagcttagttttctgactgcctttccagttatccccata tcccacactcttgggaacatgtcaccttctgccaaagattctgccccgctcctccctgcc tttgcgcttcttattccctacagcacgagattcaccactcgcttcaccttctccaccaac taccagtccctaggctccgttcagcctcccagctatggtgtccagctggtcagcagcgca gccagcgtctatgcaggcgctgggggctcgggctcccggatctctatgtcccgctccacc aacttccaaggctgcttggggtctgggggcctggctgcggggctggccgggggtctggag agaatgggaggcatccacaaggagaaggagaccatgcaaagcctgaacgactgcctggcc tcctacctggacagagtgaggagccaggagaccgagaaccagaagctggaaaacaaaatc tgggagcacctggagaagaagggaccccaggtcagagactggggccattacttcaagacc atcgaggaactgaggactcagatctttgcaaatactgtggacaatgcccgcattgttctg cagatcaacaatgcccgtcttgctgctgatgactttagagtcaagtatgagacagggctg gccatgcgccagtctgtggagagcgaaatccatgggctctgcaaagtcactgatgacaca aatgtcactcggctgcagctggagacagagatcgaagctctcaaggaggagctgctcttc atgaagaaaaaccacaaagaggaagtaaaaggcctacaagcccagattgccagctctggg ttgaccatggaggtagatgcccccaaatctcagaacctcaccaagatcatggcagacatc cgggcccaatacaaggagctggctcggaagaaccaagaggagctggacaagtactag >gi568815593f:170005958_170208608|GENSCAN_predicted_peptide_5|276_aa MTWAVRAAFAKGLWLKGILSGPLKGAMRIEKKEGRVVEGDASPEIHRETTHYGLEDPTQV PASLEAFSAYLTPLATTSYLPQRPTTLLGAVNVPWTPVMLRADGIDPDFAAMKSPQESPT SGVLSLQTVIRDILVWETAALHPDFTAGETGPGEETHLVRKLKLKKATAHTGSEGSRKIG NQLCSQCLCPLCSNTAFQSSYAVKFSAHHIGQRALARVEASADHFPDNLAGRDGTVCSTG KSSLGQGMSRMFEKMVQKGAFGNPQGFCTQDGLEGS >gi568815593f:170005958_170208608|GENSCAN_predicted_CDS_5|831_bp atgacttgggccgtgagagcagcatttgctaagggcctgtggctgaaagggatcctgtca ggaccattgaaaggagcaatgaggatagagaagaaggaggggagggtggtggaaggggat gcctctccggagatccaccgggaaaccactcactatggccttgaagacccaactcaagta cctgcttctttagaagccttttctgcctacctcaccccattagctactacctcatacctg cctcagaggccgacaacactcctgggggcggtgaatgtgccctggacccctgtcatgcta cgagcagatggcatcgatcctgactttgctgccatgaaatcccctcaagagagccccaca tcaggggtcctgagtttgcagacagtcattagagacatccttgtttgggaaacagctgcc ctccaccctgatttcacagctggggaaacagggcctggagaggagactcatctggtgaga aaactaaagctcaagaaggccacggcacacacaggcagcgagggcagcaggaagattgga aatcagctctgctcccaatgcctgtgcccactctgttccaacacagccttccaaagctca tacgctgtgaagttttctgcccatcacattggccaaagagctctagccagagttgaagcc agtgctgatcactttcctgacaacctggctggcagagatgggaccgtatgtagcacaggg aagagctcacttggacagggaatgtcccgcatgtttgaaaagatggttcagaaaggagcc tttggcaatccccagggtttctgcacccaagatggactggagggatcatga