GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:37:31 Sequence gi568815595f:46272903_46473958 : 201056 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2716 2711 6 1.05 1.02 Term - 10414 10124 291 1 0 72 42 196 0.893 8.74 1.01 Init - 26120 26028 93 2 0 49 63 72 0.224 1.18 1.00 Prom - 29476 29437 40 -4.56 2.03 PlyA - 29538 29533 6 1.05 2.02 Term - 38783 38573 211 1 1 56 49 150 0.443 4.67 2.01 Init - 43250 43243 8 2 2 64 119 0 0.839 1.20 2.00 Prom - 44168 44129 40 -3.66 3.00 Prom + 49677 49716 40 -5.46 3.01 Init + 52857 52917 61 2 1 90 37 50 0.612 1.52 3.02 Intr + 53450 53557 108 0 0 69 52 95 0.036 4.26 3.03 Intr + 59878 60548 671 2 2 82 11 152 0.208 -1.68 3.04 Intr + 62246 62320 75 1 0 67 92 70 0.694 5.01 3.05 Intr + 70099 70152 54 0 0 64 106 23 0.073 0.88 3.06 Intr + 74535 74592 58 2 1 52 119 -8 0.031 -2.84 3.07 Intr + 78421 78510 90 2 0 82 78 44 0.182 2.77 3.08 Intr + 81211 81275 65 2 2 53 105 60 0.285 2.64 3.09 Term + 84575 85708 1134 1 0 95 41 870 0.877 74.98 3.10 PlyA + 87172 87177 6 1.05 4.00 Prom + 91449 91488 40 -5.76 4.01 Sngl + 100001 101059 1059 1 0 76 47 709 0.983 62.76 4.02 PlyA + 103281 103286 6 1.05 5.00 Prom + 116452 116491 40 -5.56 5.01 Init + 128259 128342 84 2 0 83 94 55 0.514 6.42 5.02 Intr + 134487 134600 114 2 0 79 103 32 0.935 4.44 5.03 Term + 135166 136212 1047 0 0 82 37 444 0.961 30.84 5.04 PlyA + 136238 136243 6 1.05 6.03 PlyA - 138938 138933 6 1.05 6.02 Term - 149208 148707 502 2 1 40 36 275 0.802 11.45 6.01 Init - 150510 150371 140 0 2 63 25 113 0.330 2.11 6.00 Prom - 157223 157184 40 -5.36 7.18 PlyA - 158092 158087 6 1.05 7.17 Term - 160698 160532 167 1 2 43 42 110 0.370 -0.02 7.16 Intr - 165227 165038 190 2 1 112 105 77 0.963 10.96 7.15 Intr - 166578 166394 185 1 2 120 80 179 0.902 19.81 7.14 Intr - 168581 168514 68 1 2 55 78 53 0.991 -0.45 7.13 Intr - 170680 170539 142 2 1 110 117 187 0.993 23.21 7.12 Intr - 172534 172379 156 2 0 54 87 141 0.838 10.58 7.11 Intr - 173591 173538 54 0 0 103 113 22 0.972 5.15 7.10 Intr - 174496 174406 91 1 1 72 98 82 0.996 7.17 7.09 Intr - 176115 175961 155 1 2 103 58 200 0.985 18.29 7.08 Intr - 177126 176952 175 0 1 87 87 163 0.999 15.61 7.07 Intr - 177771 177593 179 1 2 98 70 218 0.998 20.64 7.06 Intr - 181458 181403 56 2 2 128 91 -7 0.975 2.22 7.05 Intr - 182540 182393 148 0 1 90 89 59 0.958 5.49 7.04 Intr - 183076 182894 183 2 0 104 110 151 0.996 18.66 7.03 Intr - 183496 183388 109 1 1 83 86 97 0.999 8.86 7.02 Intr - 186917 186754 164 0 2 93 101 47 0.990 6.19 7.01 Init - 191965 191923 43 1 1 84 109 110 0.942 11.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 91026 91080 55 2 1 38 73 58 0.800 0.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_1|127_aa MSGFQGDLEWRQTDFSLSDFTLYDRNPQLVETCVLPFWEFFLNHNSNNNNYYYCYYCYYC CCYYYYWVFYFAMTPVFAILELLESNFGYKPGQEHGSSPLEGEEMLHPFQQKNKSWVVTG LQGTLRV >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_1|384_bp atgagcggcttccagggtgacctggagtggagacaaacagatttcagcctctcagatttc accctttacgatagaaaccctcagctggtggaaacctgtgtacttccattctgggagttt ttcctgaatcacaacagcaacaacaacaactactattactgctactactgctattactgc tgctgctactactactactgggtattctactttgcgatgacacccgtttttgctattctt gaactccttgagtcaaattttggatacaagcctggacaggagcatggcagcagccccttg gagggcgaagaaatgctacatccctttcagcaaaaaaataagtcctgggttgttactgga ctccaggggactctgagagtttga >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_2|72_aa MSCLTTKGASFFKIAHDIEAAGGKLSVIATRENIAYIMECLQSDIDILMEFLLSVATAPG FGHWEVDAFSLS >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_2|219_bp atgagctgtttgactacgaaaggagcttcatttttcaagatagcccatgacattgaagca gctggtggtaaattaagtgtaattgcaacaagagaaaacatagcttacatcatggaatgt ctgcagagtgatattgatattctaatggagttcctgctcagtgtcgccacagcaccagga tttggtcattgggaagtagatgccttcagtctcagctaa >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_3|771_aa MVLASAQLLERPQEDLLKAEVPLGPCGLYTGALSSNRIRLSTSVIRTWDFEDMNEEVSSG HLDVHVKVTGIYDGLAWAQRPDTNLLQKPQKVITHKPWFPQAGTLDVENWDRAGLKQAHQ KGLKVDSSVFSTWSLVHTVLLPLSPYYSAEQQAESKNWKEFVVLLTAPIEYKKQEREDKN WPIPPPPDAETSVPSPSVAEIEIPVQRILCSAVIAGEPLGPCAFPISVRPDPNNPQQFIH EHSPLEFKLLKELKTSVVNNGVQSQWFLEEGMLDIELWEQRLWVMVSTTTLVLAVPFSES EKELYEFAQKGQLYYWDKSLNHKNAFKRFSALCGTGVPCPHSGPQGFKFVFSGEPLFTLA EGVEGGADPFCLYLFGSVAEKPDIPGLPETSHKLNRESGLNKDAFPQYIHNMLSTSRSRF IRNTNESGEEVTTFFDYDYGAPCHKFDVKQIGAQLLPPLYSLVFIFGFVGNMLVVLILIN CKKLKCLTDIYLLNLAISDLLFLITLPLWAHSAANEWVFGNAMCKLFTGLYHIGYFGGIF FIILLTIDRYLAIVHAVFALKARTVTFGVVTSVITWLVAVFASVPGIIFTKCQKEDSVYV CGPYFPRGWNNFHTIMRNILGLVLPLLIMVICYSGILKTLLRCRNEKKRHRAVRVIFTIM IVYFLFWTPYNIVILLNTFQEFFGLSNCESTSQLDQATQVTETLGMTHCCINPIIYAFVG EKFRRYLSVFFRKHITKRFCKQCPVFYRETVDGVTSTNTPSTGEQEVSAGL >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_3|2316_bp atggtgctggcatctgctcagcttctggagaggccccaagaagatttgctcaaggcagaa gttccactaggtccgtgcgggttatatactggtgccctgagcagcaacagaatcaggctc tcaacaagtgtcatccgaacatgggactttgaggacatgaacgaagaagtttcttcaggc catctggatgtacacgtgaaggttacggggatatatgatggcttagcttgggctcagagg cctgacactaacctcctgcagaagccacaaaaggttattacacataaaccatggtttcca caggcaggcactcttgatgtggaaaattgggatagagcaggattaaaacaagctcatcaa aaaggtcttaaagttgattcttcagttttctccacttggagtttagttcatactgtactt ctgccattatctccttattattctgcggaacagcaggctgaatctaaaaattggaaagaa tttgttgtcctactcacagctccaattgaatataaaaaacaggagagggaggataaaaat tggcctataccgcctcctccagatgcagaaacatctgtaccatctccttcagtggcagaa atagagatcccagtacaaagaattttatgctctgctgtcatagctggagagcccttagga ccttgtgcttttcctatttctgtaaggcctgatccaaataatccacagcagtttattcat gaacactctccactagaatttaagttgttgaaggaattaaaaactagtgtggtcaataat ggagtacaaagccaatggttcctggaggaaggaatgctagacatagaactttgggagcaa cgcctctgggttatggtctccacaaccacgctggtcttggcagtgcccttctcagaaagt gaaaaagagttgtatgaatttgcccaaaagggacaattatactactgggataaatctttg aatcataaaaatgcctttaagcggttttctgccctgtgtgggacaggtgttccttgccct cattccggtccacaaggatttaaatttgtcttcagtggtgaacctttgttcaccctggca gaaggtgtggaggggggtgcagatccgttttgtctttatctgtttggttcagttgctgag aagcctgacataccaggactgcctgagacaagccacaagctgaacagagaaagtggattg aacaaggacgcatttccccagtacatccacaacatgctgtccacatctcgttctcggttt atcagaaataccaacgagagcggtgaagaagtcaccaccttttttgattatgattacggt gctccctgtcataaatttgacgtgaagcaaattggggcccaactcctgcctccgctctac tcgctggtgttcatctttggttttgtgggcaacatgctggtcgtcctcatcttaataaac tgcaaaaagctgaagtgcttgactgacatttacctgctcaacctggccatctctgatctg ctttttcttattactctcccattgtgggctcactctgctgcaaatgagtgggtctttggg aatgcaatgtgcaaattattcacagggctgtatcacatcggttattttggcggaatcttc ttcatcatcctcctgacaatcgatagatacctggctattgtccatgctgtgtttgcttta aaagccaggacggtcacctttggggtggtgacaagtgtgatcacctggttggtggctgtg tttgcttctgtcccaggaatcatctttactaaatgccagaaagaagattctgtttatgtc tgtggcccttattttccacgaggatggaataatttccacacaataatgaggaacattttg gggctggtcctgccgctgctcatcatggtcatctgctactcgggaatcctgaaaaccctg cttcggtgtcgaaacgagaagaagaggcatagggcagtgagagtcatcttcaccatcatg attgtttactttctcttctggactccctataatattgtcattctcctgaacaccttccag gaattcttcggcctgagtaactgtgaaagcaccagtcaactggaccaagccacgcaggtg acagagactcttgggatgactcactgctgcatcaatcccatcatctatgccttcgttggg gagaagttcagaaggtatctctcggtgttcttccgaaagcacatcaccaagcgcttctgc aaacaatgtccagttttctacagggagacagtggatggagtgacttcaacaaacacgcct tccactggggagcaggaagtctcggctggtttataa >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_4|352_aa MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQLLTGLYFIGFFSGIFFII LLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTCSS HFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI MIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQVTETLGMTHCCINPIIYAFV GEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVGL >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_4|1059_bp atggattatcaagtgtcaagtccaatctatgacatcaattattatacatcggagccctgc caaaaaatcaatgtgaagcaaatcgcagcccgcctcctgcctccgctctactcactggtg ttcatctttggttttgtgggcaacatgctggtcatcctcatcctgataaactgcaaaagg ctgaagagcatgactgacatctacctgctcaacctggccatctctgacctgtttttcctt cttactgtccccttctgggctcactatgctgccgcccagtgggactttggaaatacaatg tgtcaactcttgacagggctctattttataggcttcttctctggaatcttcttcatcatc ctcctgacaatcgataggtacctggctgtcgtccatgctgtgtttgctttaaaagccagg acggtcacctttggggtggtgacaagtgtgatcacttgggtggtggctgtgtttgcgtct ctcccaggaatcatctttaccagatctcaaaaagaaggtcttcattacacctgcagctct cattttccatacagtcagtatcaattctggaagaatttccagacattaaagatagtcatc ttggggctggtcctgccgctgcttgtcatggtcatctgctactcgggaatcctaaaaact ctgcttcggtgtcgaaatgagaagaagaggcacagggctgtgaggcttatcttcaccatc atgattgtttattttctcttctgggctccctacaacattgtccttctcctgaacaccttc caggaattctttggcctgaataattgcagtagctctaacaggttggaccaagctatgcag gtgacagagactcttgggatgacgcactgctgcatcaaccccatcatctatgcctttgtc ggggagaagttcagaaactacctcttagtcttcttccaaaagcacattgccaaacgcttc tgcaaatgctgttctattttccagcaagaggctcccgagcgagcaagctcagtttacacc cgatccactggggagcaggaaatatctgtgggcttgtga >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_5|414_aa MTEWEYDGIKWLYVFQKDWSTQATGKWEGQEQSDQKEGIHCPGPFPQLPDAGSGGCALPL QELSPVGSLKMANYTLAPEDEYDVLIEGELESDEAEQCDKYDAQALSAQLVPSLCSAVFV IGVLDNLLVVLILVKYKGLKRVENIYLLNLAVSNLCFLLTLPFWAHAGGDPMCKILIGLY FVGLYSETFFNCLLTVQRYLVFLHKGNFFSARRRVPCGIITSVLAWVTAILATLPEFVVY KPQMEDQKYKCAFSRTPFLPADETFWKHFLTLKMNISVLVLPLFIFTFLYVQMRKTLRFR EQRYSLFKLVFAIMVVFLLMWAPYNIAFFLSTFKEHFSLSDCKSSYNLDKSVHITKLIAT THCCINPLLYAFLDGTFSKYLCRCFHLRSNTPLQPRGQSAQGTSREEPDHSTEV >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_5|1245_bp atgacagaatgggaatatgatggaattaagtggctttatgtgttccaaaaggattggtcc acgcaggccacaggaaaatgggagggtcaggagcagtctgatcaaaaggagggcatccac tgtccggggccattcccacagctcccggatgctgggtctggaggctgcgcccttcccctg caggagctcagcccagtgggcagtctgaagatggccaattacacgctggcaccagaggat gaatatgatgtcctcatagaaggtgaactggagagcgatgaggcagagcaatgtgacaag tatgacgcccaggcactctcagcccagctggtgccatcactctgctctgctgtgtttgtg atcggtgtcctggacaatctcctggttgtgcttatcctggtaaaatataaaggactcaaa cgcgtggaaaatatctatcttctaaacttggcagtttctaacttgtgtttcttgcttacc ctgcccttctgggctcatgctgggggcgatcccatgtgtaaaattctcattggactgtac ttcgtgggcctgtacagtgagacatttttcaattgccttctgactgtgcaaaggtaccta gtgtttttgcacaagggaaactttttctcagccaggaggagggtgccctgtggcatcatt acaagtgtcctggcatgggtaacagccattctggccactttgcctgaattcgtggtttat aaacctcagatggaagaccagaaatacaagtgtgcatttagcagaactcccttcctgcca gctgatgagacattctggaagcattttctgactttaaaaatgaacatttcggttcttgtc ctccccctatttatttttacatttctctatgtgcaaatgagaaaaacactaaggttcagg gagcagaggtatagccttttcaagcttgtttttgccataatggtagtcttccttctgatg tgggcgccctacaatattgcatttttcctgtccactttcaaagaacacttctccctgagt gactgcaagagcagctacaatctggacaaaagtgttcacatcactaaactcatcgccacc acccactgctgcatcaaccctctcctgtatgcgtttcttgatgggacatttagcaaatac ctctgccgctgtttccatctgcgtagtaacaccccacttcaacccagggggcagtctgca caaggcacatcgagggaagaacctgaccattccaccgaagtgtaa >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_6|213_aa MSMEKALKQLEVQSTKKERAFAGRVGWAFLTVLRKVHTQSLRDADWRELAKGVWLGGTPD DQRPHVELAIHWSPTNVQWVLVLVDTGIDCSLVYGNPVKFLGKSAYIKGYGGQSVKVKPV SLYLGIGHLAPCLYTVYVSPIPEYILEVDILHGLEAVPSIMDLMNHSTMELGQYHYVVDL ANAFFSVDLALESQEQFALMTMDFHSVAAGLCA >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_6|642_bp atgagcatggagaaggcgctgaagcagctggaagtgcagagcaccaagaaggagagagcc tttgctggcagagttggatgggcatttttaactgtgctaagaaaagtacacacccagtcc ctgagggatgcagactggagggaactggccaaaggtgtctggcttggggggacaccagat gaccagaggccacatgtggaattggcaatccactggtcccccaccaatgtacagtgggtg ctggtgctggtagatactggcatagattgtagccttgtttatgggaacccagttaagttt ttgggcaaatctgcatatattaaaggttacggaggccagtcagtgaaagtgaaacctgta tctctgtaccttggcattggccacttggctccttgcttatacactgtgtatgtctctccc atacctgaatacattctggaggtggatattttacatggcttggaagctgtgccatctatc atggatttgatgaaccactcgacaatggaattaggacagtaccactatgtggtggacttg gccaatgcattcttctcagttgaccttgctctagagagccaggaacagtttgccttgatg acaatggactttcacagtgttgctgcagggctatgtgcatag >gi568815595f:46272903_46473958|GENSCAN_predicted_peptide_7|754_aa MKLVFLVLLFLGALGLCLAGRRRSVQWCAVSQPEATKCFQWQRNMRKVRGPPVSCIKRDS PIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVYGTERQPRTHYYAVAVVKKG GSFQLNELQGLKSCHTGLRRTAGWNVPIGTLRPFLNWTGPPEPIEAAVARFFSASCVPGA DKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKCLRDGAGDVAFIRESTVFEDLSDE AERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVVARSVNGKEDAIWNLLRQAQEKFGKD KSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRIDSGLYLGSGYFTAIQNLRKSEEEVAAR RARVVWCAVGEQELRKCNQWSGLSEGSVTCSSASTTEDCIALVLKGEADAMSLDGGYVYT AGKCGLVPVLAENYKSQQSSDPDPNCVDRPVEGYLAVAVVRRSDTSLTWNSVKGKKSCHT AVDRTAGWNIPMGLLFNQTGSCKFDEYFSQSCAPGSDPRSNLCALCIGDEQGENKCVPNS NERYYGYTGAFRCLAENAGDVAFVKDVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRK PVTEARSCHLAMAPNHAVVSRMDKVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNL LFNDNTECLARLHGKTTYEKYLGPQYVAGITNLKKCSTSRCHYVTHMDLGKLNKGGKRGN KRQETKEYIWKKRSGGTLPLVDKGPELYTTLRSD >gi568815595f:46272903_46473958|GENSCAN_predicted_CDS_7|2265_bp atgaaacttgtcttcctcgtcctgctgttcctcggggccctcggactgtgtctggctggc cgtaggaggagtgttcagtggtgcgccgtatcccaacccgaggccacaaaatgcttccaa tggcaaaggaatatgagaaaagtgcgtggccctcctgtcagctgcataaagagagactcc cccatccagtgtatccaggccattgcggaaaacagggccgatgctgtgacccttgatggt ggtttcatatacgaggcaggcctggccccctacaaactgcgacctgtagcggcggaagtc tacgggaccgaaagacagccacgaactcactattatgccgtggctgtggtgaagaagggc ggcagctttcagctgaacgaactgcaaggtctgaagtcctgccacacaggccttcgcagg accgctggatggaatgtccctatagggacacttcgtccattcttgaattggacgggtcca cctgagcccattgaggcagctgtggccaggttcttctcagccagctgtgttcccggtgca gataaaggacagttccccaacctgtgtcgcctgtgtgcggggacaggggaaaacaaatgt gccttctcctcccaggaaccgtacttcagctactctggtgccttcaagtgtctgagagac ggggctggagacgtggcttttatcagagagagcacagtgtttgaggacctgtcagacgag gctgaaagggacgagtatgagttactctgcccagacaacactcggaagccagtggacaag ttcaaagactgccatctggcccgggtcccttctcatgccgttgtggcacgaagtgtgaat ggcaaggaggatgccatctggaatcttctccgccaggcacaggaaaagtttggaaaggac aagtcaccgaaattccagctctttggctcccctagtgggcagaaagatctgctgttcaag gactctgccattgggttttcgagggtgcccccgaggatagattctgggctgtaccttggc tccggctacttcactgccatccagaacttgaggaaaagtgaggaggaagtggctgcccgg cgtgcgcgggtcgtgtggtgtgcggtgggcgagcaggagctgcgcaagtgtaaccagtgg agtggcttgagcgaaggcagcgtgacctgctcctcggcctccaccacagaggactgcatc gccctggtgctgaaaggagaagctgatgccatgagtttggatggaggatatgtgtacact gcaggcaaatgtggtttggtgcctgtcctggcagagaactacaaatcccaacaaagcagt gaccctgatcctaactgtgtggatagacctgtggaaggatatcttgctgtggcggtggtt aggagatcagacactagccttacctggaactctgtgaaaggcaagaagtcctgccacacc gccgtggacaggactgcaggctggaatatccccatgggcctgctcttcaaccagacgggc tcctgcaaatttgatgaatatttcagtcaaagctgtgcccctgggtctgacccgagatct aatctctgtgctctgtgtattggcgacgagcagggtgagaataagtgcgtgcccaacagc aacgagagatactacggctacactggggctttccggtgcctggctgagaatgctggagac gttgcatttgtgaaagatgtcactgtcttgcagaacactgatggaaataacaatgaggca tgggctaaggatttgaagctggcagactttgcgctgctgtgcctcgatggcaaacggaag cctgtgactgaggctagaagctgccatcttgccatggccccgaatcatgccgtggtgtct cggatggataaggtggaacgcctgaaacaggtgttgctccaccaacaggctaaatttggg agaaatggatctgactgcccggacaagttttgcttattccagtctgaaaccaaaaacctt ctgttcaatgacaacactgagtgtctggccagactccatggcaaaacaacatatgaaaaa tatttgggaccacagtatgtcgcaggcattactaatctgaaaaagtgctcaacctcccgt tgccactatgtaacccacatggacctagggaaactgaacaaagggggcaaacgtgggaat aaaagacaagagacaaaagagtatatttggaagaagcggtcagggggcactttgcctcta gtggacaagggccctgagctttacacaaccctccgtagtgattag