GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:44:42 Sequence gi568815588f:69032897_69271266 : 238370 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9904 9922 19 0 1 99 94 20 0.306 3.93 1.02 Term + 14867 14907 41 0 2 128 43 38 0.162 0.65 1.03 PlyA + 16023 16028 6 1.05 2.05 PlyA - 20503 20498 6 1.05 2.04 Term - 29095 28822 274 0 1 32 40 217 0.805 6.24 2.03 Intr - 29279 29246 34 0 1 86 88 46 0.499 1.68 2.02 Intr - 36789 36636 154 0 1 58 55 87 0.325 2.05 2.01 Init - 38808 38767 42 0 0 57 64 52 0.358 0.12 2.00 Prom - 41484 41445 40 -3.16 3.00 Prom + 50727 50766 40 -7.46 3.01 Init + 55265 55340 76 1 1 80 116 130 0.801 14.25 3.02 Intr + 64188 64335 148 1 1 53 94 60 0.302 2.39 3.03 Intr + 70975 71148 174 1 0 72 -12 162 0.024 3.65 3.04 Intr + 77831 77975 145 2 1 -3 74 114 0.013 1.18 3.05 Intr + 79545 79652 108 2 0 75 32 90 0.415 2.58 3.06 Intr + 81768 81897 130 2 1 74 39 84 0.593 2.37 3.07 Intr + 81936 82073 138 1 0 -18 18 220 0.474 4.84 3.08 Term + 82458 82663 206 1 2 31 39 169 0.555 3.83 3.09 PlyA + 82786 82791 6 1.05 4.00 Prom + 93097 93136 40 -1.96 4.01 Init + 100074 100151 78 2 0 69 94 71 0.297 6.86 4.02 Intr + 122916 122991 76 2 1 73 97 5 0.317 -1.01 4.03 Intr + 124111 124267 157 2 1 54 64 177 0.996 10.97 4.04 Intr + 125151 125315 165 0 0 117 106 105 0.893 14.28 4.05 Intr + 129510 129616 107 0 2 84 111 16 0.544 3.36 4.06 Intr + 133146 133214 69 1 0 55 87 84 0.528 4.15 4.07 Intr + 135593 135735 143 0 2 78 80 34 0.195 1.67 4.08 Term + 138260 138373 114 1 0 86 43 22 0.166 -3.93 4.09 PlyA + 138806 138811 6 1.05 5.00 Prom + 144703 144742 40 -3.96 5.01 Init + 147396 147666 271 2 1 57 70 218 0.221 12.03 5.02 Intr + 153091 153168 78 2 0 77 91 75 0.261 6.22 5.03 Intr + 166208 166301 94 0 1 90 78 78 0.954 6.02 5.04 Intr + 167384 167603 220 2 1 76 90 40 0.723 1.30 5.05 Intr + 169543 169623 81 0 0 61 87 63 0.906 3.33 5.06 Intr + 169971 170147 177 2 0 84 115 24 0.855 4.82 5.07 Intr + 174897 175045 149 2 2 109 76 119 0.997 11.83 5.08 Term + 175704 176139 436 0 1 87 29 242 0.999 12.96 5.09 PlyA + 176179 176184 6 1.05 6.00 Prom + 184341 184380 40 -6.66 6.01 Init + 187540 187602 63 0 0 52 116 113 0.934 11.65 6.02 Intr + 194311 194473 163 0 1 99 117 222 0.994 25.75 6.03 Intr + 199868 200016 149 0 2 60 67 208 0.969 15.85 6.04 Intr + 200118 200237 120 2 0 105 94 3 0.812 3.29 6.05 Intr + 206146 206241 96 0 0 52 100 25 0.263 0.31 6.06 Intr + 207756 207855 100 2 1 98 18 274 0.817 21.08 6.07 Intr + 210286 210469 184 2 1 80 74 301 0.856 26.75 6.08 Intr + 213183 213338 156 0 0 57 74 158 0.976 10.43 6.09 Intr + 214464 214697 234 0 0 67 70 344 0.983 27.30 6.10 Intr + 215528 215832 305 2 2 78 80 567 0.990 51.13 6.11 Intr + 217394 217539 146 0 2 88 107 235 0.913 25.40 6.12 Intr + 217637 217756 120 1 0 108 116 107 0.997 16.19 6.13 Intr + 224140 224235 96 0 0 115 24 98 0.987 6.31 6.14 Intr + 224431 224530 100 0 1 111 96 152 0.999 17.98 6.15 Intr + 225880 226063 184 2 1 61 57 251 0.966 18.15 6.16 Intr + 226167 226231 65 0 2 44 103 84 0.976 3.86 6.17 Intr + 228209 228398 190 0 1 109 55 286 0.890 26.04 6.18 Intr + 232689 232922 234 0 0 98 70 239 0.895 19.90 6.19 Term + 233714 233861 148 2 1 98 49 141 0.890 8.57 6.20 PlyA + 234629 234634 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 23204 23245 42 1 0 77 98 47 0.855 5.02 S.002 Term + 70975 71224 250 1 1 72 42 170 0.959 5.98 S.003 Init + 77910 77975 66 2 0 63 74 67 0.810 3.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_1|19_aa MATPNASDSQDQPDPGTTL >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_1|60_bp atggccacacccaatgctagtgattctcaggaccagccagatcctgggaccaccttgtaa >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_2|167_aa MRENVLISTFPSPQAGETGIKATHCLGPMSPGCPNPFATLGHLRPVLVRGCQHVTSSRPG LSDLQVLKVFGKKNKWRRSSRPSDLVGISSDLQKLLVWWAPDTWCFCCLGAAEDPEPCLS STLTEAAKYRKYTRSKGKHALQRRPLITIVYVGSNGTEFAARVFFLS >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_2|504_bp atgcgagagaacgtgctgatcagtacttttccatctccacaggcaggggagacaggaatc aaggccactcactgcctgggcccaatgtcccctggatgtcccaacccttttgctaccctg ggccatctacgaccagttctggtgcgaggatgccagcatgtaacctccagtaggccaggg ctgtcggatctccaggtgctaaaagtttttggcaagaaaaacaagtggaggaggagcagc cgcccgtcagacctggtggggatcagcagtgacctgcagaagttgctggtgtggtgggca ccggatacctggtgtttctgctgcctcggtgctgctgaggacccagagccctgcctgagc agcacgctcacagaggcagcgaagtacaggaagtacacaaggtcaaaaggaaaacacgct ctccagagacgtcctctgatcactatcgtgtacgttgggtcgaatggaacagaatttgct gctagagtgtttttcctctcctaa >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_3|374_aa MQKLLKCSRLVLALALILVLESSVQGYPTRRARYQWVRCNPDSNSANCLEEKGPMFELLP GESNKIPRLRTDLFPKTRIQDLNRIFPLSEDYSGSGFGSGSGSGSGSGSGFLTEMEQDYQ LVDESDAFHDNLRKIGRLGTKQLRVKACGWTNERRQSVKMYVSNVNAHQKTSITEEALNN QERLADGCKFLDAESQFIQGHAPFSWTAHSQRLIYVGTTEIMLRQKQIQAIFLFEFKTGL KAAETTRNINNPFGAGIANKHESLEDEKRSGRPWEFDNDQLRAIIEADPLTTTQEVAEKL NIDHSTAGIGPQKGPVLLHKNAQVQVVEPMLQKLNELGYEVLPHPPYLPDFLPTDYHSFK HLGNFLQGKRFHNQ >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_3|1125_bp atgcagaagctactcaaatgcagtcggcttgtcctggctcttgccctcatcctggttctg gaatcctcagttcaaggttatcctacgcggagagccaggtaccaatgggtgcgctgcaat ccagacagtaattctgcaaactgccttgaagaaaaaggaccaatgttcgaactacttcca ggtgaatccaacaagatcccccgtctgaggactgacctttttccaaagacgagaatccag gacttgaatcgtatcttcccactttctgaggactactctggatcaggcttcggctccggc tccggctctggatcaggatctgggagtggcttcctaacggaaatggaacaggattaccaa ctagtagacgaaagtgatgctttccatgacaaccttagaaagattggaagattgggcacg aagcagctcagggtaaaggcatgtggatggaccaatgagagaagacaaagtgtgaagatg tacgtatcaaatgtgaatgcccaccagaaaacatccatcacagaagaggcactaaacaac caggaaaggttggctgacggctgcaagttcttagatgcagagagccagttcatccaaggt catgctcccttctcatggacagctcacagtcaaagactcatctatgtggggactacggaa ataatgttaagacaaaagcaaattcaagctattttcttattcgagttcaaaacaggtctt aaagcagcagagacaactcgcaacatcaacaacccatttggcgcaggaattgctaacaaa catgagagccttgaagatgagaagcgtagtggccggccatgggaatttgacaatgaccaa ttgagagcaatcattgaagctgatcctcttacaactacacaagaagttgctgaaaaactc aacatcgaccattctacagccggcataggtccacagaaaggcccagttctgctccataag aacgcccaagtgcaggtcgtagaaccaatgcttcaaaagttgaatgaattgggctacgaa gttttgcctcatccgccatatttacctgacttcttgccaactgactaccactccttcaag catctcggcaactttttgcagggaaaacgcttccacaaccagtag >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_4|302_aa MAEMKTEDGKVEKHYLFYDGESVSGKVNLAFKQPGKRLEHQGIRIEFVGQIELFNDKSNT HEFVNLVKELALPGELTQSRSYDFEFMQVEKPYESYIGANVRLRYFLKVTIVRRLTDLVK EYDLIVHQLATYPDVNNSIKMEVGIEDCLHIEFEYNKSKYHLKDVIVGKIYFLLVRIKIQ HMELQLIKKEITGIGPSTTTETETIAKYEIMDGAPVKGESIPIRLFLAGYDPTPTMRDVN KKFSVRYFLNLVLVDEEDRRYFKQQEIILWRKAPEKLRKQRTNFHQRFESPESQASAEQP EM >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_4|909_bp atggcagaaatgaaaactgaagatggcaaagtagaaaaacactatctcttctatgacgga gaatccgtttcaggaaaggtaaacctagcctttaagcaacctggaaagaggctagaacac caaggaattagaattgaatttgtaggtcaaattgaacttttcaatgacaagagtaatact catgaatttgtaaacctagtgaaagaactagccttacctggagaactgactcagagcaga agttatgattttgaatttatgcaagttgaaaagccatatgaatcttacatcggtgccaat gtccgcttgaggtattttcttaaagtgacaatagtgagaagactgacagatttggtaaaa gagtatgatcttattgttcaccagcttgccacctatcctgatgttaacaactctattaag atggaagtgggcattgaagattgtctacatatagaatttgaatataataaatcaaagtat catttaaaggatgtgattgttggaaaaatttacttcttattagtaagaataaaaatacaa catatggagttacagctgatcaaaaaagagatcacaggaattggacccagtaccacaaca gaaacagaaacaatcgccaaatatgaaataatggatggtgcaccagtaaaaggtgaatca attccaataaggctatttttagcaggatatgacccaactccaacaatgagagatgtgaac aaaaaattttcagtaaggtactttttgaatttagtgcttgttgatgaggaagaccggagg tacttcaaacagcaggagataattttatggagaaaagctcctgaaaaactgaggaaacag agaacaaactttcaccagcgatttgaatctccagaatcacaggcatctgccgaacagcct gaaatgtga >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_5|501_aa MSFSRALLWARLPAGRQAGHRAAICSALRPHFGPFPGVLGQVSVLATASSSASGGSKIPN TSLFVPLTVKPQGPSADGDVGAELTRPLDKNEVKKVLDKFYKRKEIQKLGADYGLDGTKL AQAKKFNDPNDPCKILVATDAIGMGLNLSIRRIIFYSLIKPSINEKGERELEPITTSQAL QIAGRAGRFSSRFKEGEVTTMNHEDLSLLKEILKRPVDPIRAAGLHPTAEQIEMFAYHLP DATLSNLIDIFVDFSQVDGQYFVCNMDDFKFSAELIQHIPLSLRVRYVFCTAPINKKQPF VCSSLLQFARQYSRNEPLTFAWLRRYIKWPLLPPKNIKDLMDLEAVHDVLDLYLWLSYRF MDMFPDASLIRDLQKELDGIIQDGVHNITKLIKMSETHKLLNLEGFPSGSQSRLSGTLKS QARRTRGTKALGSKATEPPSPDAGELSLASRLVQQGLLTPDMLKQLEKEWMTQQTEHNKE KTESGTHPKGTRRKKKEPDSD >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_5|1506_bp atgtccttctcccgtgccctattgtgggctcggctcccggcggggcgccaggctggccac cgggcagccatctgctctgcccttcgtccccactttgggccctttcccggggttctgggg caagtttctgtccttgccaccgcctcctcctctgcctccggtggctccaaaataccaaac acgtccttgttcgtgcccctgactgtgaaacctcagggccccagcgccgacggcgacgtc ggggccgagctaacccggcctctggacaagaatgaagtaaagaaggtcttagacaaattt tacaagaggaaagaaattcagaaactgggtgctgattatggacttgatgggaccaaactt gctcaagcaaaaaagtttaatgatcccaatgacccatgcaaaatcttggttgctacagat gcaattggcatgggacttaatttgagcataaggagaattattttttactcccttataaag cccagtatcaatgaaaagggagagagagaactagaaccaatcacaacctctcaagccctg cagattgctggcagagctggcagattcagctcacggtttaaagaaggagaggttacaaca atgaatcatgaagatctcagtttattaaaggaaattttgaagaggcctgtggatcctata agggcagctggtcttcatccaactgctgagcagattgaaatgtttgcctaccatctccct gatgcaacactgtccaatctcattgatatttttgtagacttttcacaagttgatgggcag tattttgtctgcaatatggatgattttaaattttctgcagagttgatccagcatattcca ctaagtctgcgagtgaggtatgttttctgcacagctcctatcaacaagaagcagcctttt gtgtgttcttcactgttacagtttgccaggcagtatagcaggaatgagcccctgaccttt gcatggttacgccgatacatcaaatggcctttacttccacctaagaatattaaagacctc atggatcttgaagctgtccacgatgtcttggatctttacttgtggctaagctaccgattt atggatatgtttccagatgccagccttattcgagatctccagaaagaactagatggtatt atccaagatggtgtgcacaatatcactaaattgattaaaatgtctgagacgcataagctg ttgaatttggagggctttccatcagggagccagtcacgattgtcaggaaccttaaagagc caagctagaaggacacgcggcaccaaagctctagggagtaaagctactgagccacccagc cccgatgcaggagagctgtcccttgcttccagattggtgcagcaaggactcctcactcca gacatgctgaaacagctagaaaaagagtggatgacacaacaaactgaacacaacaaagaa aaaacagagtctgggactcatccaaaagggacgagaagaaagaagaaggaacctgattcg gactag >gi568815588f:69032897_69271266|GENSCAN_predicted_peptide_6|950_aa MFAVHLMAFYFSKLKEDQIKKVDRFLYHMRLSDDTLLDIMRRFRAEMEKGLAKDTNPTAA VKMLPTFVRAIPDGSENGEFLSLDLGGSKFRVLKVQVAEEGKRHVQMESQFYPTPNEIIR GNGTELFEYVADCLADFMKTKDLKHKKLPLGLTFSFPCRQTKLEEGVLLSWTKKFKARGV QDTDVVSRLTKAMRRHKDMDVDILALVNDTVGTMMTCAYDDPYCEVGVIIGTGTNACYME DMSNIDLVEGDEGRMCINTEWGAFGDDGALEDIRTEFDRELDLGSLNPGKQLFEKMISGL YLGELVRLILLKMAKAGLLFGGEKSSALHTKGKIETRHVAAMEKYKEGLANTREILVDLG LEPSEADCIAVQHVCTIVSFRSANLCAAALAAILTRLRENKKVERLRTTVGMDGTLYKIH PQYPKRLHKVVRKLVPSCDVRFLLSESGSTKGAAMVTAVASRVQAQRKQIDRVLALFQLT REQLVDVQAKMRAELEYGLKKKSHGLATVRMLPTYVCGLPDGTEKGKFLALDLGGTNFRV LLVKIRSGRRSVRMYNKIFAIPLEIMQGTGEELFDHIVQCIADFLDYMGLKGASLPLGFT FSFPCRQMSIDKGTLIGWTKGFKATDCEGEDVVDMLREAIKRRNEFDLDIVAVVNDTVGT MMTCGYEDPNCEIGLIAGTGSNMCYMEDMRNIEMVEGGEGKMCINTEWGGFGDNGCIDDI WTRYDTEVDEGSLNPGKQSCRMSVKYPIGTKEKYDDKFATGLPQPYPSSPNRYEKMTSGM YLGEIVRQILIDLTKQGLLFRGQISERLRTRGIFETKFLSQIESDRLALLQVRRILQQLG LDSTCEDSIVVKEVCGAVSRRAAQLCGAGLAAIVEKRREDQGLEHLRITVGVDGTLYKLH PHFSRILQETVKELAPRCDVTFMLSEDGSGKGAALITAVAKRLQQAQKEN >gi568815588f:69032897_69271266|GENSCAN_predicted_CDS_6|2853_bp atgtttgcggtccacttgatggcattttacttcagcaagctgaaggaggaccagatcaag aaggtggacaggttcctgtatcacatgcggctctccgatgacacccttttggacatcatg aggcggttccgggctgagatggagaagggcctggcaaaggacaccaaccccacggctgca gtgaagatgttgcccaccttcgtcagggccattcccgatggttccgaaaatggggagttc ctttccctggatctcggagggtccaagttccgagtgctgaaggtgcaagtcgctgaagag gggaagcgacacgtgcagatggagagtcagttctacccaacgcccaatgaaatcatccgc gggaacggcacagagctgtttgaatatgtagctgactgtctggcagatttcatgaagacc aaagatttaaagcataagaaattgccccttggcctaactttttctttcccctgtcgacag actaaactggaagagggtgtcctactttcgtggacaaaaaagtttaaggcacgaggagtt caggacacggatgtggtgagccgtctgaccaaagccatgagaagacacaaggacatggac gtggacatcctggccctggtcaatgacaccgtggggaccatgatgacctgtgcctatgac gacccctactgcgaagttggtgtcatcatcggaactggcaccaatgcgtgttacatggag gacatgagcaacattgacctggtggagggcgacgagggcaggatgtgcatcaacacagag tggggggccttcggggacgacggggccctggaggacattcgcactgagttcgacagggag ctggacctcggctctctcaacccaggaaagcaactgttcgagaagatgatcagtggcctg tacctgggggagcttgtcaggcttatcttgctgaagatggccaaggctggcctcctgttt ggtggtgagaaatcttctgctctccacactaagggcaagatcgaaacacggcacgtggct gccatggagaagtataaagaaggccttgctaatacaagagagatcctggtggacctgggt ctggaaccgtctgaggctgactgcattgccgtccagcatgtctgtaccatcgtctccttc cgctcggccaatctctgtgcagcagctctggcggccatcctgacacgcctccgggagaac aagaaggtggaacggctccggaccacagtgggcatggacggcaccctctacaagatacac cctcagtacccaaaacgcctgcacaaggtggtgaggaaactggtcccaagctgtgatgtc cgcttcctcctgtcagagagtggcagcaccaagggggccgccatggtgaccgcggtggcc tcccgcgtgcaggcccagcggaagcagatcgacagggtgctggctttgttccagctgacc cgagagcagctcgtggacgtgcaggccaagatgcgggctgagctggagtatgggctgaag aagaagagccacgggctggccacggtcaggatgctgcccacctacgtctgcgggctgccg gacggcacagagaaaggaaagtttctcgccctggatcttgggggaaccaacttccgggtc ctcctggtgaagatcagaagtggacggaggtcagtgcgaatgtacaacaagatcttcgcc atccccctggagatcatgcagggcactggtgaggagctctttgatcacattgtgcagtgc atcgccgacttcctggactacatgggcctcaagggagcctccctacctttgggcttcaca ttctcatttccctgcaggcagatgagcattgacaagggaacactcatagggtggaccaaa ggtttcaaggccactgactgtgaaggggaggacgtggtggacatgctcagggaagccatc aagaggagaaacgagtttgacctggacattgttgcagtcgtgaatgatacagtggggacc atgatgacctgtggctatgaagatcctaattgtgagattggcctgattgcaggaacaggc agcaacatgtgctacatggaggacatgaggaacatcgagatggtggaggggggtgaaggg aagatgtgcatcaatacagagtggggaggatttggagacaatggctgcatagatgacatc tggacccgatacgacacggaggtggatgaggggtccttgaatcctggcaagcagagctgt cgaatgtcagtgaaatatcctattggaacaaaagaaaaatatgatgacaagtttgcaaca ggtctgccccaaccttatccttcttctccaaacagatacgagaaaatgaccagtgggatg tacttgggggagattgtgcggcagatcctgatcgacctgaccaagcagggtctcctcttc cgagggcagatttcagagcgtctccggaccaggggcatcttcgaaaccaagttcctgtcc cagatcgaaagcgatcggctggcccttctccaggtcaggaggattctgcagcagctgggc ctggacagcacgtgtgaggacagcatcgtggtgaaggaggtgtgcggagccgtgtcccgg cgggcggcccagctctgcggtgctggcctggccgctatagtggaaaaaaggagagaagac caggggctagagcacctgaggatcactgtgggtgtggacggcaccctgtacaagctgcac cctcacttttctagaatattgcaggaaactgtgaaggaactagcccctcgatgtgatgtg acattcatgctgtcagaagatggcagtggaaaaggggcagcactgatcactgctgtggcc aagaggttacagcaggcacagaaggagaactag