GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:51:44 Sequence gi568815595f:46398063_46600896 : 202834 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3099 3182 84 2 0 83 94 55 0.517 6.42 1.02 Intr + 9327 9440 114 2 0 79 103 32 0.944 4.44 1.03 Term + 10006 11052 1047 0 0 82 37 444 0.970 30.84 1.04 PlyA + 11078 11083 6 1.05 2.03 PlyA - 13778 13773 6 1.05 2.02 Term - 24048 23547 502 2 1 40 36 275 0.802 11.45 2.01 Init - 25350 25211 140 0 2 63 25 113 0.330 2.11 2.00 Prom - 32063 32024 40 -5.36 3.18 PlyA - 32932 32927 6 1.05 3.17 Term - 35538 35372 167 1 2 43 42 110 0.370 -0.02 3.16 Intr - 40067 39878 190 2 1 112 105 77 0.963 10.96 3.15 Intr - 41418 41234 185 1 2 120 80 179 0.902 19.81 3.14 Intr - 43421 43354 68 1 2 55 78 53 0.991 -0.45 3.13 Intr - 45520 45379 142 2 1 110 117 187 0.993 23.21 3.12 Intr - 47374 47219 156 2 0 54 87 141 0.838 10.58 3.11 Intr - 48431 48378 54 0 0 103 113 22 0.972 5.15 3.10 Intr - 49336 49246 91 1 1 72 98 82 0.996 7.17 3.09 Intr - 50955 50801 155 1 2 103 58 200 0.985 18.29 3.08 Intr - 51966 51792 175 0 1 87 87 163 0.999 15.61 3.07 Intr - 52611 52433 179 1 2 98 70 218 0.998 20.64 3.06 Intr - 56298 56243 56 2 2 128 91 -7 0.975 2.22 3.05 Intr - 57380 57233 148 0 1 90 89 59 0.958 5.49 3.04 Intr - 57916 57734 183 2 0 104 110 151 0.996 18.66 3.03 Intr - 58336 58228 109 1 1 83 86 97 0.999 8.86 3.02 Intr - 61757 61594 164 0 2 93 101 47 0.990 6.19 3.01 Init - 66805 66763 43 1 1 84 109 110 0.960 11.38 3.00 Prom - 70825 70786 40 -5.66 4.00 Prom + 78460 78499 40 -7.66 4.01 Init + 80119 80228 110 0 2 91 19 95 0.351 0.59 4.02 Intr + 80275 80371 97 1 1 84 61 68 0.354 3.71 4.03 Intr + 87153 87265 113 2 2 60 81 69 0.134 2.58 4.04 Intr + 90972 91218 247 0 1 78 97 108 0.387 8.16 4.05 Intr + 99947 100155 209 1 2 74 94 77 0.408 4.78 4.06 Intr + 102294 102598 305 0 2 125 46 186 0.583 14.33 4.07 Intr + 103146 103284 139 1 1 58 55 101 0.267 3.32 4.08 Term + 113182 113293 112 1 1 84 37 74 0.291 -0.07 4.09 PlyA + 115249 115254 6 1.05 5.09 PlyA - 117346 117341 6 1.05 5.08 Term - 121001 120952 50 0 2 112 40 -12 0.213 -6.13 5.07 Intr - 123596 123460 137 1 2 63 90 194 0.981 17.31 5.06 Intr - 129519 129364 156 2 0 89 87 68 0.966 5.93 5.05 Intr - 131988 131843 146 0 2 57 70 114 0.977 5.58 5.04 Intr - 134847 134711 137 1 2 72 63 86 0.898 4.79 5.03 Intr - 141758 141635 124 2 1 63 37 49 0.246 -2.54 5.02 Intr - 147191 146984 208 1 1 99 113 208 0.993 23.48 5.01 Init - 153529 153405 125 1 2 76 62 162 0.973 12.04 5.00 Prom - 156974 156935 40 -3.36 6.00 Prom + 169362 169401 40 -5.66 6.01 Init + 175732 175795 64 0 1 75 119 -2 0.370 2.91 6.02 Intr + 181170 181304 135 1 0 42 61 73 0.580 0.54 6.03 Intr + 181677 181791 115 1 1 130 63 96 0.742 10.81 6.04 Intr + 181889 181998 110 2 2 93 91 3 0.561 1.03 6.05 Term + 183070 183188 119 2 2 89 37 38 0.308 -2.50 6.06 PlyA + 184226 184231 6 1.05 7.03 PlyA - 185131 185126 6 1.05 7.02 Term - 195589 195529 61 0 1 134 44 40 0.197 1.48 7.01 Intr - 195773 195675 99 1 0 71 71 84 0.197 4.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_1|414_aa MTEWEYDGIKWLYVFQKDWSTQATGKWEGQEQSDQKEGIHCPGPFPQLPDAGSGGCALPL QELSPVGSLKMANYTLAPEDEYDVLIEGELESDEAEQCDKYDAQALSAQLVPSLCSAVFV IGVLDNLLVVLILVKYKGLKRVENIYLLNLAVSNLCFLLTLPFWAHAGGDPMCKILIGLY FVGLYSETFFNCLLTVQRYLVFLHKGNFFSARRRVPCGIITSVLAWVTAILATLPEFVVY KPQMEDQKYKCAFSRTPFLPADETFWKHFLTLKMNISVLVLPLFIFTFLYVQMRKTLRFR EQRYSLFKLVFAIMVVFLLMWAPYNIAFFLSTFKEHFSLSDCKSSYNLDKSVHITKLIAT THCCINPLLYAFLDGTFSKYLCRCFHLRSNTPLQPRGQSAQGTSREEPDHSTEV >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_1|1245_bp atgacagaatgggaatatgatggaattaagtggctttatgtgttccaaaaggattggtcc acgcaggccacaggaaaatgggagggtcaggagcagtctgatcaaaaggagggcatccac tgtccggggccattcccacagctcccggatgctgggtctggaggctgcgcccttcccctg caggagctcagcccagtgggcagtctgaagatggccaattacacgctggcaccagaggat gaatatgatgtcctcatagaaggtgaactggagagcgatgaggcagagcaatgtgacaag tatgacgcccaggcactctcagcccagctggtgccatcactctgctctgctgtgtttgtg atcggtgtcctggacaatctcctggttgtgcttatcctggtaaaatataaaggactcaaa cgcgtggaaaatatctatcttctaaacttggcagtttctaacttgtgtttcttgcttacc ctgcccttctgggctcatgctgggggcgatcccatgtgtaaaattctcattggactgtac ttcgtgggcctgtacagtgagacatttttcaattgccttctgactgtgcaaaggtaccta gtgtttttgcacaagggaaactttttctcagccaggaggagggtgccctgtggcatcatt acaagtgtcctggcatgggtaacagccattctggccactttgcctgaattcgtggtttat aaacctcagatggaagaccagaaatacaagtgtgcatttagcagaactcccttcctgcca gctgatgagacattctggaagcattttctgactttaaaaatgaacatttcggttcttgtc ctccccctatttatttttacatttctctatgtgcaaatgagaaaaacactaaggttcagg gagcagaggtatagccttttcaagcttgtttttgccataatggtagtcttccttctgatg tgggcgccctacaatattgcatttttcctgtccactttcaaagaacacttctccctgagt gactgcaagagcagctacaatctggacaaaagtgttcacatcactaaactcatcgccacc acccactgctgcatcaaccctctcctgtatgcgtttcttgatgggacatttagcaaatac ctctgccgctgtttccatctgcgtagtaacaccccacttcaacccagggggcagtctgca caaggcacatcgagggaagaacctgaccattccaccgaagtgtaa >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_2|213_aa MSMEKALKQLEVQSTKKERAFAGRVGWAFLTVLRKVHTQSLRDADWRELAKGVWLGGTPD DQRPHVELAIHWSPTNVQWVLVLVDTGIDCSLVYGNPVKFLGKSAYIKGYGGQSVKVKPV SLYLGIGHLAPCLYTVYVSPIPEYILEVDILHGLEAVPSIMDLMNHSTMELGQYHYVVDL ANAFFSVDLALESQEQFALMTMDFHSVAAGLCA >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_2|642_bp atgagcatggagaaggcgctgaagcagctggaagtgcagagcaccaagaaggagagagcc tttgctggcagagttggatgggcatttttaactgtgctaagaaaagtacacacccagtcc ctgagggatgcagactggagggaactggccaaaggtgtctggcttggggggacaccagat gaccagaggccacatgtggaattggcaatccactggtcccccaccaatgtacagtgggtg ctggtgctggtagatactggcatagattgtagccttgtttatgggaacccagttaagttt ttgggcaaatctgcatatattaaaggttacggaggccagtcagtgaaagtgaaacctgta tctctgtaccttggcattggccacttggctccttgcttatacactgtgtatgtctctccc atacctgaatacattctggaggtggatattttacatggcttggaagctgtgccatctatc atggatttgatgaaccactcgacaatggaattaggacagtaccactatgtggtggacttg gccaatgcattcttctcagttgaccttgctctagagagccaggaacagtttgccttgatg acaatggactttcacagtgttgctgcagggctatgtgcatag >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_3|754_aa MKLVFLVLLFLGALGLCLAGRRRSVQWCAVSQPEATKCFQWQRNMRKVRGPPVSCIKRDS PIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVYGTERQPRTHYYAVAVVKKG GSFQLNELQGLKSCHTGLRRTAGWNVPIGTLRPFLNWTGPPEPIEAAVARFFSASCVPGA DKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKCLRDGAGDVAFIRESTVFEDLSDE AERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVVARSVNGKEDAIWNLLRQAQEKFGKD KSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRIDSGLYLGSGYFTAIQNLRKSEEEVAAR RARVVWCAVGEQELRKCNQWSGLSEGSVTCSSASTTEDCIALVLKGEADAMSLDGGYVYT AGKCGLVPVLAENYKSQQSSDPDPNCVDRPVEGYLAVAVVRRSDTSLTWNSVKGKKSCHT AVDRTAGWNIPMGLLFNQTGSCKFDEYFSQSCAPGSDPRSNLCALCIGDEQGENKCVPNS NERYYGYTGAFRCLAENAGDVAFVKDVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRK PVTEARSCHLAMAPNHAVVSRMDKVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNL LFNDNTECLARLHGKTTYEKYLGPQYVAGITNLKKCSTSRCHYVTHMDLGKLNKGGKRGN KRQETKEYIWKKRSGGTLPLVDKGPELYTTLRSD >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_3|2265_bp atgaaacttgtcttcctcgtcctgctgttcctcggggccctcggactgtgtctggctggc cgtaggaggagtgttcagtggtgcgccgtatcccaacccgaggccacaaaatgcttccaa tggcaaaggaatatgagaaaagtgcgtggccctcctgtcagctgcataaagagagactcc cccatccagtgtatccaggccattgcggaaaacagggccgatgctgtgacccttgatggt ggtttcatatacgaggcaggcctggccccctacaaactgcgacctgtagcggcggaagtc tacgggaccgaaagacagccacgaactcactattatgccgtggctgtggtgaagaagggc ggcagctttcagctgaacgaactgcaaggtctgaagtcctgccacacaggccttcgcagg accgctggatggaatgtccctatagggacacttcgtccattcttgaattggacgggtcca cctgagcccattgaggcagctgtggccaggttcttctcagccagctgtgttcccggtgca gataaaggacagttccccaacctgtgtcgcctgtgtgcggggacaggggaaaacaaatgt gccttctcctcccaggaaccgtacttcagctactctggtgccttcaagtgtctgagagac ggggctggagacgtggcttttatcagagagagcacagtgtttgaggacctgtcagacgag gctgaaagggacgagtatgagttactctgcccagacaacactcggaagccagtggacaag ttcaaagactgccatctggcccgggtcccttctcatgccgttgtggcacgaagtgtgaat ggcaaggaggatgccatctggaatcttctccgccaggcacaggaaaagtttggaaaggac aagtcaccgaaattccagctctttggctcccctagtgggcagaaagatctgctgttcaag gactctgccattgggttttcgagggtgcccccgaggatagattctgggctgtaccttggc tccggctacttcactgccatccagaacttgaggaaaagtgaggaggaagtggctgcccgg cgtgcgcgggtcgtgtggtgtgcggtgggcgagcaggagctgcgcaagtgtaaccagtgg agtggcttgagcgaaggcagcgtgacctgctcctcggcctccaccacagaggactgcatc gccctggtgctgaaaggagaagctgatgccatgagtttggatggaggatatgtgtacact gcaggcaaatgtggtttggtgcctgtcctggcagagaactacaaatcccaacaaagcagt gaccctgatcctaactgtgtggatagacctgtggaaggatatcttgctgtggcggtggtt aggagatcagacactagccttacctggaactctgtgaaaggcaagaagtcctgccacacc gccgtggacaggactgcaggctggaatatccccatgggcctgctcttcaaccagacgggc tcctgcaaatttgatgaatatttcagtcaaagctgtgcccctgggtctgacccgagatct aatctctgtgctctgtgtattggcgacgagcagggtgagaataagtgcgtgcccaacagc aacgagagatactacggctacactggggctttccggtgcctggctgagaatgctggagac gttgcatttgtgaaagatgtcactgtcttgcagaacactgatggaaataacaatgaggca tgggctaaggatttgaagctggcagactttgcgctgctgtgcctcgatggcaaacggaag cctgtgactgaggctagaagctgccatcttgccatggccccgaatcatgccgtggtgtct cggatggataaggtggaacgcctgaaacaggtgttgctccaccaacaggctaaatttggg agaaatggatctgactgcccggacaagttttgcttattccagtctgaaaccaaaaacctt ctgttcaatgacaacactgagtgtctggccagactccatggcaaaacaacatatgaaaaa tatttgggaccacagtatgtcgcaggcattactaatctgaaaaagtgctcaacctcccgt tgccactatgtaacccacatggacctagggaaactgaacaaagggggcaaacgtgggaat aaaagacaagagacaaaagagtatatttggaagaagcggtcagggggcactttgcctcta gtggacaagggccctgagctttacacaaccctccgtagtgattag >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_4|443_aa MPSSRPTALRLFLMAPLSSLGSLLEFLLTSNPAASRLVSGLQGKLSQNPCGHSVTSPEAE GRYVGDKGQASCEGHVTVHWEVRRSSLLESQNGRECRYCWHSQREGRPAVRAGAQTSSPS SRLPCGPEGSMLSCSTSCKPPVAQALKRPPVPTVCPCHWFPIPCSFVGLQTVACCLPNKF LASASLTKPTCQGPGELDPPRHTIAPEMAGDTEVWKQMFQELMREVKPWHRWTLRPDKGL LPNVLKPGWMQYQQWTFARFQCSSCSRNWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRC KKCPQPLFEDPEFTQENISRILKNLVFRILKKCYRGRFQLIEEVPMIKDISLEGPHNSDN LFFKKAFTIVFTLKRTESEVSMIHKPFGNLVKPVGPLTREFCTDPLRTCEHVTLCGTGDF TDVIKVKDLEMGRVSSIIQLGSV >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_4|1332_bp atgccctcttcccgccccactgccctgagacttttcctaatggcccctctcagcagcctg ggcagcctccttgaatttctgctcaccagtaacccagctgcgtcccgactggtctctggg ctacaagggaagctttcccagaacccatgtggccactcagtcacttccccagaggctgaa ggtcgctatgttggtgacaaagggcaggccagctgtgaaggccacgtcaccgtccactgg gaagtgaggcggagctccctgctggagagccagaatgggcgggaatgcaggtactgctgg cactcccagagggagggaaggcctgctgtccgtgctggcgcccagacttcctctccaagc tcccgcctgccctgtggccccgaaggttccatgctgagctgcagcacttcctgcaagcct ccagtggctcaggcgctcaaaaggcctcctgtccccactgtatgcccatgtcactggttc ccgatcccctgctcctttgtcggcttacagacagtggcctgttgcctgcccaacaaattc ctggccagcgccagcctgaccaaaccgacctgccagggcccaggagagctcgacccaccc aggcacaccatagccccagagatggctggggacacagaagtgtggaagcaaatgtttcag gagttaatgcgggaggtgaagccatggcacaggtggaccctgagaccagacaagggcctt cttcccaacgtcctgaagccaggctggatgcaataccagcagtggaccttcgccaggttc cagtgctcctcctgctctcgtaactgggcctctgcccaagttctggtccttttccacatg aactggagtgaggagaagtccaggggccaggtgaagatgagggtgtttacccagagatgt aagaagtgcccccaacctctgtttgaggaccctgagttcacacaagagaacatctcaagg atcctgaaaaacctggtgttccgaattctgaagaaatgctatagaggaagatttcagttg atagaggaggttcctatgatcaaggacatctctcttgaagggccacacaatagtgacaac ttattttttaaaaaggccttcacaattgtgttcactctcaagagaactgagtctgaagtg tctatgattcataaaccatttggaaatcttgtgaagcctgtgggccccctcaccagggaa ttttgcacagaccctctgagaacctgtgaacatgtaaccttatgtggcacaggggacttt acagatgtgattaaggtgaaggaccttgagatggggagagtgtcctccatcatccagctg ggctcagtataa >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_5|360_aa MGHKVVVFDISVIRALWETRVKKHKAWQKKEVERLEKSALEKIKEEWNFVAECRRKGIPQ AVYCKNGFIDTSVRLLDKIERNTLTRQSSLPKDRGKRSSAFVFELSGEHWTFYPRTLELG EALDTEFINVISAVETQGGGVTGKWQGLGRNSGCLKNLKELNVGFNYLKSIPPELGDCEN LERLDCSGNLELMELPFELSNLKQVTFVDISANKFSSVPICVLRMSNLQWLDISSNNLTD LPQDIDRLEELQSFLLYKNKLTYLPYSMLNLKKLTLLVVSGDHLVELPTALCDSSTPLKF VSLMDNPIDNAQCEDGNEIMESERDRQHFDKEVMKAYIEDLKERESVPSYTTKVSFSLQL >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_5|1083_bp atgggacataaagtggttgtcttcgacatttctgtcatcagagccttgtgggaaactcgt gtcaagaagcacaaagcttggcagaagaaggaggtggaaaggcttgagaagagcgccttg gagaagataaaggaggagtggaactttgtggccgaatgcaggaggaagggcatcccccag gctgtatactgcaagaatggcttcatagacaccagcgtgcggcttctggacaagattgaa aggaacactctcacaaggcagagttcacttcccaaggacagaggcaaacggagcagtgcg tttgtgtttgaactttctggggagcactggacgttttatcccagaacacttgagctaggg gaggccttggacactgagttcatcaatgtcatttcagcagtggagacccagggaggtggc gtgactggaaaatggcagggcctgggccggaactcaggttgtttgaagaacctgaaagaa ctcaatgtgggtttcaactatctgaagagcattcctccagaattgggagattgtgaaaat ctagagagactggattgttctggaaatctagaattaatggagctgccctttgaattaagt aatttgaagcaagttacatttgtagatatctcagcaaacaagttttccagtgtcccaatc tgtgtcctgcggatgtcgaatttgcagtggttggatatcagcagcaataacctgaccgac ctgccgcaagatatagacaggctagaggagctgcagagctttctcttgtataaaaacaag ttgacctaccttccctattccatgctgaacctgaagaagctcactctgttagtcgtcagt ggggaccatttggtggagctcccaactgccctttgtgactcatccacacctttaaaattt gtaagccttatggacaatcctattgataatgcccaatgtgaagatggcaatgaaataatg gaaagtgaacgggatcgccaacattttgataaagaagttatgaaagcctatattgaagac cttaaagaaagagaatctgttcccagctataccaccaaagtgtcttttagccttcaactt tga >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_6|180_aa MKYYFSHTVLTKFKMLAAARPGLGHQEFARPSRGYLAFRDDSIWPQEEPAIRPRSSQRVP PMGIQHSKELNRTCCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCGSVPHDTWLPKKC SLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVASRTPELPPSARTTTFMLVGICLSIQSYY >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_6|543_bp atgaaatactacttttcacatactgtattaacaaaatttaagatgcttgctgcagccaga cctgggctgggccatcaggaatttgctcgtccatctcggggatacctggccttcagagat gacagcatttggccccaggaggagcctgcaattcggcctcggtcttcccagcgtgtgccg cccatggggatacagcacagtaaggagctaaacagaacctgctgcctgaatgggggaacc tgcatgctggggtccttttgtgcctgccctccctccttctacggacggaactgtgagcac gatgtgcgcaaagagaactgtgggtctgtgccccatgacacctggctgcccaagaagtgt tccctgtgtaaatgctggcacggtcagctccgctgctttcctcaggcatttctacccggc tgtgatggccttgtgatggatgagcacctcgtggcttccaggactccagaactaccaccg tctgcacgtactaccacttttatgctagttggcatctgcctttctatacaaagctactat taa >gi568815595f:46398063_46600896|GENSCAN_predicted_peptide_7|53_aa XIGVCHGELTIWNWDKSENYWFTPHNQHQKYLCRPVLTPGMRLARSAGKRRDS >gi568815595f:46398063_46600896|GENSCAN_predicted_CDS_7|162_bp nncattggcgtctgccatggagagctcactatatggaactgggacaaatctgagaactac tggtttacaccacacaatcagcatcagaaatacctgtgcagaccagtattgaccccgggg atgcggctggccaggtctgcagggaagcgcagagactcctag